Skip to content

Conversation

TNTran92
Copy link

@TNTran92 TNTran92 commented Apr 6, 2024

The PR Update rocm .42 to 0.43. Functionalities added include

  • CMakeLists supporting ROCM (remove CUDA and Apple)
  • add FSDP support and related test
  • Fix some unit tests to make them populate
  • More documentation, especially in optimizers

pnunna93 pushed a commit that referenced this pull request Sep 23, 2025
* Add SYCL Kernels for XPU backend

* fix transpose

Signed-off-by: jiqing-feng <[email protected]>

* fix log and format

Signed-off-by: jiqing-feng <[email protected]>

* revert cpu changes

Signed-off-by: jiqing-feng <[email protected]>

* clean ipex_xpu

Signed-off-by: jiqing-feng <[email protected]>

* clean ipex import

Signed-off-by: jiqing-feng <[email protected]>

* fix ipex cpu import

Signed-off-by: jiqing-feng <[email protected]>

* fix typo

Signed-off-by: jiqing-feng <[email protected]>

* fix comments

Signed-off-by: jiqing-feng <[email protected]>

* refine gemv_4bit kernel

* enable FP4 for dequant_4bit and gemv_4bit

* refine FP4 dequantization performance

* remove check for better performance

Signed-off-by: jiqing-feng <[email protected]>

* fix doc

Signed-off-by: jiqing-feng <[email protected]>

* clean code

* fix tests

Signed-off-by: jiqing-feng <[email protected]>

* rm comments

Signed-off-by: jiqing-feng <[email protected]>

* fix memory issue

* fix ut failure

* adjust threshold

Signed-off-by: jiqing-feng <[email protected]>

* fix xpu check

Signed-off-by: jiqing-feng <[email protected]>

* change test_functional check

Signed-off-by: jiqing-feng <[email protected]>

* fix test_module

Signed-off-by: jiqing-feng <[email protected]>

* fix device check

Signed-off-by: jiqing-feng <[email protected]>

* fix tests

Signed-off-by: jiqing-feng <[email protected]>

* Enable Windows build and refine code

* fix xpu log

Signed-off-by: jiqing-feng <[email protected]>

* remove ipex entirely

Signed-off-by: jiqing-feng <[email protected]>

* fix cpu int8 CB

Signed-off-by: jiqing-feng <[email protected]>

* fix lint

Signed-off-by: jiqing-feng <[email protected]>

* fix logs (#12)

* fix logs

Signed-off-by: jiqing-feng <[email protected]>

* fix format

Signed-off-by: jiqing-feng <[email protected]>

---------

Signed-off-by: jiqing-feng <[email protected]>

* Fix sycl lint error and tests (#13)

* fix sycl nd

Signed-off-by: jiqing-feng <[email protected]>

* fix tests

Signed-off-by: jiqing-feng <[email protected]>

---------

Signed-off-by: jiqing-feng <[email protected]>

* skip typo check for xpu kernel codes (#14)

* skip test for xpu ops

Signed-off-by: jiqing-feng <[email protected]>

* fix lint

Signed-off-by: jiqing-feng <[email protected]>

* skip typo for xpu

Signed-off-by: jiqing-feng <[email protected]>

* skip

Signed-off-by: jiqing-feng <[email protected]>

* skip

Signed-off-by: jiqing-feng <[email protected]>

---------

Signed-off-by: jiqing-feng <[email protected]>

* register triton kernel for quantization (#15)

Signed-off-by: jiqing-feng <[email protected]>

* Fix version comparison issue (#18)

# Description

The version comparison expression miss reference the .release property from the version object. This lead to compare between the tuple and the string

# Error message
```
The 8-bit optimizer is not available on your device, only available on CUDA for now.
🦥 Unsloth: Will patch your computer to enable 2x faster free finetuning.
Traceback (most recent call last):
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/unsloth_validation/run.py", line 1, in <module>
    import unsloth
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/unsloth/__init__.py", line 235, in <module>
    from .models import *
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/unsloth/models/__init__.py", line 15, in <module>
    from .llama     import FastLlamaModel
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/unsloth/models/llama.py", line 23, in <module>
    from ._utils import *
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/unsloth/models/_utils.py", line 89, in <module>
    from unsloth_zoo.patching_utils import (
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/unsloth_zoo/patching_utils.py", line 629, in <module>
    import transformers.integrations.bitsandbytes
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/transformers/integrations/bitsandbytes.py", line 20, in <module>
    import bitsandbytes as bnb
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/bitsandbytes/bitsandbytes/__init__.py", line 39, in <module>
    from .backends.xpu import ops as xpu_ops
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/bitsandbytes/bitsandbytes/backends/xpu/ops.py", line 17, in <module>
    if version.parse(torch.__version__).release >= version.parse("2.9"):
TypeError: '>=' not supported between instances of 'tuple' and 'Version'
```

---------

Signed-off-by: jiqing-feng <[email protected]>
Co-authored-by: jiqing-feng <[email protected]>
Co-authored-by: Er-Xin (Edwin) Shang <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant