add support for 64 block size on 32 warp size supported amd gpus #1748

electron271 · 2025-09-06T21:07:19Z

https://rocm.docs.amd.com/en/latest/reference/gpu-arch-specs.html most non instinct gpus support 32 warp size

tested on RX 9070 XT, looking into getting this tested on amd instinct accelerators to ensure gpus with 64 warp size still work

matthewdouglas · 2025-09-08T18:36:35Z

Thanks for the PR! I don't have the bandwidth to test this personally at the moment, so will defer to AMD team. Also I do not have any RDNA GPUs on hand.

cc: @pnunna93

github-actions · 2025-09-09T16:17:03Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

pnunna93

Thanks for the PR! It's good to go once warp size change is made.

pnunna93 · 2025-09-24T21:28:17Z

csrc/ops.hip

   hipLaunchKernelGGL(( kQuantizeBlockwise<T, 128, 2, 0, DATA_TYPE>), dim3(num_blocks), dim3(64), 0, 0, code, A, absmax, out, rand, rand_offset, n);
-  //else if(blocksize == 64)
-  // hipLaunchKernelGGL(( kQuantizeBlockwise<T, 64, 2, 0, DATA_TYPE>), dim3(num_blocks), dim3(32), 0, 0, code, A, absmax, out, rand, rand_offset, n);
+  else if(blocksize == 64 && warpSize == 32)


warpSize will be deprecated in 7.0, we just added a WARP_SIZE macro, please use it instead.

electron271 added 2 commits September 6, 2025 00:28

add support for 64 block size on 32 warp size supported amd gpus

d607127

uncomment 64 block size support in csrc

f7b4430

electron271 mentioned this pull request Sep 6, 2025

ROCM support unslothai/unsloth#3279

Open

only enable 64 block size support on architectures with 32 warp size

6e2e4d2

matthewdouglas added the ROCm label Sep 8, 2025

pnunna93 suggested changes Sep 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

add support for 64 block size on 32 warp size supported amd gpus #1748

add support for 64 block size on 32 warp size supported amd gpus #1748

Uh oh!

electron271 commented Sep 6, 2025

Uh oh!

matthewdouglas commented Sep 8, 2025

Uh oh!

github-actions bot commented Sep 9, 2025

Uh oh!

pnunna93 left a comment

Uh oh!

pnunna93 Sep 24, 2025

Uh oh!

Uh oh!

Uh oh!

add support for 64 block size on 32 warp size supported amd gpus #1748

Are you sure you want to change the base?

add support for 64 block size on 32 warp size supported amd gpus #1748

Uh oh!

Conversation

electron271 commented Sep 6, 2025

Uh oh!

matthewdouglas commented Sep 8, 2025

Uh oh!

github-actions bot commented Sep 9, 2025

Uh oh!

pnunna93 left a comment

Choose a reason for hiding this comment

Uh oh!

pnunna93 Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!