Add gfx1152 support#97
Conversation
|
Hmm, the resulting build detects my GPU but I used llama-ubuntu-rocm-gfx1152-x64.zip for my testing. |
|
@ppanchad-amd Can you please follow up with TheRock team to check if this is a known issue on their side? |
Can also attest to this, built llama.cpp for windows 7.14 gfx1152 (https://therock-nightly-tarball.s3.amazonaws.com/therock-dist-windows-gfx1152-7.14.0a20260521.tar.gz) and only got gibberish out of qwen3 and gemma models, no matter the quant/size. Might be an issue on TheRock's end as the builds arent sanity tested yet. CPU is Ryzen 7 350 (GPU is 860M). Interestingly enough lemonade server recognises it, but says it's only supported on linux as a backend. |
|
In lemonade discord some users had success with the rocm-stable and gfx1152. Models: gemma4 E4B Q4_K, Bonsai 1.7B and 8B. Just commenting it here in case it helps identify what's wrong |
Just to clarify as the user who did those tests on gfx1152: |
|
that was me with the working bonsai and broken gemma4. happy to try test stuff and gather diagnostics. |
|
Which checkpoint files created gibberish? Do you have a command line to reproduce? |
|
not near the computer right now so I'm pulling this from discord...
Also tried with and I got rid of the extra ctk/ctv/ub/b/ctx-size flags too |
|
I've made a separate issue for the ? issues, as they might be more of an upstream thing, lmk if it should just be merged here #98 |
|
I think the issue was on the llama.cpp side. Pull Request #24129, which has now been merged, should fix it. Update:
|
|
using upstream llama.cpp, rocm 7.13 I'm seeing repeated and serious success. running an overnight test with |
|
llama-b9628.txt |
Summary
Enable nightly builds for the
gfx1152target (Krackan Point). TheRock now publishes nightly tarballs for this target attherock-nightly-tarball.s3.amazonaws.com(therock-dist-{windows,linux}-gfx1152-*.tar.gz).Related discussions: #50