HPU (Intel Gaudi) support for bitsandbytes #1592

ckvermaAI · 2025-04-14T09:58:27Z

This PR enables the support of bitsandbytes for HPU (Intel Gaudi) devices.

Adding HPU as the supported device.
Create a backend for HPU devices (bitsandbytes/backends/hpu.py).

Authored by: Chetan Kumar Verma [email protected]
Co-authored-by: Ruheena Suhani Shaik [email protected]
Co-authored-by: Bhargav Eede [email protected]
Co-authored-by: Vivek Goel [email protected]

Authored by: Chetan Kumar Verma <[email protected]> Co-authored-by: Ruheena Suhani Shaik <[email protected]> Co-authored-by: Bhargav Eede <[email protected]> Co-authored-by: Vivek Goel <[email protected]>

vivekgoe · 2025-04-14T10:05:38Z

@jiqing-feng @Titus-von-Koeller Please help review these changes. These changes add support for NF4 quantization/dequantization using Intel Gaudi hardware. https://www.intel.com/content/www/us/en/products/details/processors/ai-accelerators/gaudi.html
This PR adds support for only single level NF4 quantization for now, we are working on adding support for second level NF4 quantization and will add that using another PR in near future. Thanks!

github-actions · 2025-04-15T15:22:38Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Titus-von-Koeller · 2025-04-15T17:21:38Z

The code looks good, thanks for your work on this, a promising first step!

@vivekgoe @ckvermaAI

Please see this short update about the multi-backend refactor #1596.

Regarding the Intel backend, as discussed in parallel with Ke Ding, the target for PRs migrating existing work from multi-backend-refactor instead of main will be the new bitsandbytes-intel repo.

However, some of the pure torch ops and generic cpu functionality still make more sense in the main branch of bitsandbytes, if they don't have the Intel IPEX dependency. Please align with @matthewdouglas and me on those. It's probably best to discuss that in our shared Slack channel.

vivekgoe · 2025-04-18T05:04:03Z

@Titus-von-Koeller Thanks for reviewing and merging our PR! If possible, please add me to shared Slack channel you mentioned or if it needs to be done by someone in Intel team then let me know I will follow-up internally.

HPU support for bitsandbytes

43460fa

Authored by: Chetan Kumar Verma <[email protected]> Co-authored-by: Ruheena Suhani Shaik <[email protected]> Co-authored-by: Bhargav Eede <[email protected]> Co-authored-by: Vivek Goel <[email protected]>

matthewdouglas added the Intel Integration label Apr 14, 2025

Titus-von-Koeller marked this pull request as ready for review April 15, 2025 15:26

Titus-von-Koeller merged commit b090d85 into bitsandbytes-foundation:multi-backend-refactor Apr 15, 2025
1 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HPU (Intel Gaudi) support for bitsandbytes #1592

HPU (Intel Gaudi) support for bitsandbytes #1592

ckvermaAI commented Apr 14, 2025 •

edited

Loading

vivekgoe commented Apr 14, 2025

github-actions bot commented Apr 15, 2025

Titus-von-Koeller commented Apr 15, 2025

vivekgoe commented Apr 18, 2025

HPU (Intel Gaudi) support for bitsandbytes #1592

HPU (Intel Gaudi) support for bitsandbytes #1592

Conversation

ckvermaAI commented Apr 14, 2025 • edited Loading

vivekgoe commented Apr 14, 2025

github-actions bot commented Apr 15, 2025

Titus-von-Koeller commented Apr 15, 2025

vivekgoe commented Apr 18, 2025

ckvermaAI commented Apr 14, 2025 •

edited

Loading