Skip to content

(WIP) Multi backend refactor -> main (full diff of all already merged PRs) #1220

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 279 commits into
base: main
Choose a base branch
from

Conversation

Titus-von-Koeller
Copy link
Collaborator

@Titus-von-Koeller Titus-von-Koeller commented May 25, 2024

This PR to main serves the purpose to keep an overview of all the extensive changes that have been introduced to multi-backend-refactor to the iterative PRs around this topic.

We will eventually merge this into master and before that do a thorough final review and, as well, get Tim's final sign-off on this extensive refactor.

For now, it mainly serves the purpose of providing a public diff of the entirety of the changes. However, already feel free to leave constructive feedback and review comments.

jianan-gu and others added 30 commits December 4, 2023 00:56
Enable igemmlt int test on rocm
jiqing-feng and others added 10 commits January 22, 2025 16:17
* fix dequant 8bit

Signed-off-by: jiqing-feng <[email protected]>

* support double quant on intel cpu and xpu

Signed-off-by: jiqing-feng <[email protected]>

* fix format

Signed-off-by: jiqing-feng <[email protected]>

* fix shape

Signed-off-by: jiqing-feng <[email protected]>

* fix 4bit format

Signed-off-by: jiqing-feng <[email protected]>

* fix device error for xpu

Signed-off-by: jiqing-feng <[email protected]>

* fix 4bit tensor shape

Signed-off-by: jiqing-feng <[email protected]>

* fix nf4 xpu finetune

Signed-off-by: jiqing-feng <[email protected]>

---------

Signed-off-by: jiqing-feng <[email protected]>
* new matmul8bit

Signed-off-by: jiqing-feng <[email protected]>

* fix cxb

Signed-off-by: jiqing-feng <[email protected]>

---------

Signed-off-by: jiqing-feng <[email protected]>
Copy link

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

matthewdouglas and others added 5 commits February 10, 2025 15:40
* fix xpu dtypoe

Signed-off-by: jiqing-feng <[email protected]>

* fix nf4 dtype

Signed-off-by: jiqing-feng <[email protected]>

---------

Signed-off-by: jiqing-feng <[email protected]>
* fix version

Signed-off-by: jiqing-feng <[email protected]>

* fix setup version

Signed-off-by: jiqing-feng <[email protected]>

---------

Signed-off-by: jiqing-feng <[email protected]>
jiqing-feng and others added 9 commits March 4, 2025 20:39
* enable benchmark script

Signed-off-by: jiqing-feng <[email protected]>

* Small fixes to non_cuda_backends.mdx

---------

Signed-off-by: jiqing-feng <[email protected]>
Co-authored-by: Titus <[email protected]>
Signed-off-by: jiqing-feng <[email protected]>
* enable quant storage

Signed-off-by: jiqing-feng <[email protected]>

* fix to numpy

Signed-off-by: jiqing-feng <[email protected]>

---------

Signed-off-by: jiqing-feng <[email protected]>
* fix 4bit XPU dequant 4bit

Signed-off-by: jiqing-feng <[email protected]>

* fix default value

Signed-off-by: jiqing-feng <[email protected]>

* fix ipex linear set

Signed-off-by: jiqing-feng <[email protected]>

* fix ipex linear set to false when calling state dict

Signed-off-by: jiqing-feng <[email protected]>

* fix Int8Param device patch

Signed-off-by: jiqing-feng <[email protected]>

---------

Signed-off-by: jiqing-feng <[email protected]>
* fix xpu to cpu

Signed-off-by: jiqing-feng <[email protected]>

* fix xpu cpu data device

Signed-off-by: jiqing-feng <[email protected]>

---------

Signed-off-by: jiqing-feng <[email protected]>
* fix intel cpu/xpu warning

Signed-off-by: jiqing-feng <[email protected]>

* fix error log

Signed-off-by: jiqing-feng <[email protected]>

* fix lib

Signed-off-by: jiqing-feng <[email protected]>

* rm return Nonr

Signed-off-by: jiqing-feng <[email protected]>

* error log only without ipex

Signed-off-by: jiqing-feng <[email protected]>

* fix import eerror

Signed-off-by: jiqing-feng <[email protected]>

* fix format

Signed-off-by: jiqing-feng <[email protected]>

---------

Signed-off-by: jiqing-feng <[email protected]>
@anadon
Copy link

anadon commented Apr 14, 2025

Could someone post about the status/progress of this PR? Like a list of checked and unchecked known items to do.

Liangliang-Ma and others added 2 commits April 15, 2025 11:13
* enable xpu 8bit optim

* add deqaunt_blockwise

* dequantize_blockwise

* add bakcend synchronize

* refine code

* ipex dep

* ipex dep too

* ipex version check

---------

Co-authored-by: jiqing-feng <[email protected]>
Authored by: Chetan Kumar Verma <[email protected]>
Co-authored-by: Ruheena Suhani Shaik <[email protected]>
Co-authored-by: Bhargav Eede <[email protected]>
Co-authored-by: Vivek Goel <[email protected]>

Co-authored-by: Ruheena Suhani Shaik <[email protected]>
@Titus-von-Koeller
Copy link
Collaborator Author

Please see this short update about the multi-backend refactor #1596.

cc @anadon

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.