Add gfx1152 support by danielholanda · Pull Request #97 · lemonade-sdk/llamacpp-rocm

danielholanda · 2026-05-20T02:14:46Z

Summary

Enable nightly builds for the gfx1152 target (Krackan Point). TheRock now publishes nightly tarballs for this target at therock-nightly-tarball.s3.amazonaws.com (therock-dist-{windows,linux}-gfx1152-*.tar.gz).

Related discussions: #50

clee · 2026-05-20T05:22:31Z

Hmm, the resulting build detects my GPU but llama-cli shows gibberish when I enter text using any of the qwen3.5 models I have locally. Using llama-cli from the Vulkan build works as expected.

I used llama-ubuntu-rocm-gfx1152-x64.zip for my testing.

danielholanda · 2026-05-20T16:56:02Z

@ppanchad-amd Can you please follow up with TheRock team to check if this is a known issue on their side?

petmav · 2026-05-21T07:57:28Z

Hmm, the resulting build detects my GPU but llama-cli shows gibberish when I enter text using any of the qwen3.5 models I have locally. Using llama-cli from the Vulkan build works as expected.

I used llama-ubuntu-rocm-gfx1152-x64.zip for my testing.

Can also attest to this, built llama.cpp for windows 7.14 gfx1152 (https://therock-nightly-tarball.s3.amazonaws.com/therock-dist-windows-gfx1152-7.14.0a20260521.tar.gz) and only got gibberish out of qwen3 and gemma models, no matter the quant/size. Might be an issue on TheRock's end as the builds arent sanity tested yet. CPU is Ryzen 7 350 (GPU is 860M). Interestingly enough lemonade server recognises it, but says it's only supported on linux as a backend.

sofiageo · 2026-05-21T08:52:40Z

In lemonade discord some users had success with the rocm-stable and gfx1152. Models: gemma4 E4B Q4_K, Bonsai 1.7B and 8B.

Just commenting it here in case it helps identify what's wrong

soulafein83 · 2026-05-21T11:16:06Z

In lemonade discord some users had success with the rocm-stable and gfx1152. Models: gemma4 E4B Q4_K, Bonsai 1.7B and 8B.

Just commenting it here in case it helps identify what's wrong

Just to clarify as the user who did those tests on gfx1152:
Bonsai models actually work fine. Gemma 3 and 4, Llama 3.2 both get stuck in infinite loops, spitting out either tokens or question marks.
However, llama-bench runs on the ROCm backend and finishes without any issues.

ckuethe · 2026-05-21T11:38:22Z

that was me with the working bonsai and broken gemma4. happy to try test stuff and gather diagnostics.

mgehre-amd · 2026-05-21T11:41:07Z

Which checkpoint files created gibberish? Do you have a command line to reproduce?

ckuethe · 2026-05-21T11:53:24Z

not near the computer right now so I'm pulling this from discord...

./llama-cli --no-mmap -b 4096 -ub 4096 -fa 1 -ctk q8_0 -ctv q8_0 -m /var/lib/lemonade/.cache/huggingface/hub/models--unsloth--Qwen3.6-35B-A3B-GGUF/snapshots/a483e9e6cbd595906af30beda3187c2663a1118c/Qwen3.6-35B-A3B-UD-Q4_K_XL.gguf --ctx-size 262144 --jinja

Also tried with
models--unsloth--gemma-4-E4B-it-GGUF/snapshots/653803f092503c04a65164346f3208a36e707693/gemma-4-E4B-it-Q4_K_M.gguf

and I got rid of the extra ctk/ctv/ub/b/ctx-size flags too

petmav · 2026-05-21T12:04:39Z

I've made a separate issue for the ? issues, as they might be more of an upstream thing, lmk if it should just be merged here #98

soulafein83 · 2026-06-08T13:01:50Z

ggml-org/llama.cpp#24129

I think the issue was on the llama.cpp side. Pull Request #24129, which has now been merged, should fix it.

Update:
Just wanted to confirm that the llama.cpp ROCm 7.13 build (tag b9559) from here:
https://github.com/lemonade-sdk/llama.cpp/releases/tag/b9559
works completely out of the box on gfx1152 (tested on a laptop with Ryzen 7 AI 350).
Verified with:

Gemma-4-e4b
Qwen3.5 9b

ckuethe · 2026-06-11T07:26:18Z

using upstream llama.cpp, rocm 7.13 I'm seeing repeated and serious success.

running an overnight test with lemonade bench, but a few multiturn sessions have worked with bonsai-1.7B, through gemma4-12B and qwen3.6-35B, up to qwen3-coder-next-80B

soulafein83 · 2026-06-14T09:57:54Z

llama-b9628.txt
These are my logs from llama-bench on my laptop with Ryzen 7 AI 350 (KrackanPoint) 16 gb

Add gfx1152 support

b3e647d

danielholanda self-assigned this May 20, 2026

danielholanda mentioned this pull request May 20, 2026

Support for gfx1152 and gfx1153 #50

Open

simpler description

1b2877c

danielholanda changed the title ~~Add gfx1152 support [DRAFT]~~ Add gfx1152 support May 20, 2026

danielholanda marked this pull request as draft May 20, 2026 04:32

danielholanda marked this pull request as ready for review May 20, 2026 04:32

soulafein83 mentioned this pull request Jun 15, 2026

Feature Request: Add ROCm support for gfx1152 (RDNA 3.5 APUs like Ryzen AI 300 series #112

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add gfx1152 support#97

Add gfx1152 support#97
danielholanda wants to merge 2 commits into
mainfrom
dholanda/gfx1152

danielholanda commented May 20, 2026

Uh oh!

clee commented May 20, 2026

Uh oh!

danielholanda commented May 20, 2026

Uh oh!

petmav commented May 21, 2026 •

edited

Loading

Uh oh!

sofiageo commented May 21, 2026

Uh oh!

soulafein83 commented May 21, 2026

Uh oh!

ckuethe commented May 21, 2026

Uh oh!

mgehre-amd commented May 21, 2026 •

edited

Loading

Uh oh!

ckuethe commented May 21, 2026

Uh oh!

petmav commented May 21, 2026 •

edited

Loading

Uh oh!

soulafein83 commented Jun 8, 2026 •

edited

Loading

Uh oh!

ckuethe commented Jun 11, 2026

Uh oh!

soulafein83 commented Jun 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Uh oh!

Conversation

danielholanda commented May 20, 2026

Summary

Uh oh!

clee commented May 20, 2026

Uh oh!

danielholanda commented May 20, 2026

Uh oh!

petmav commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sofiageo commented May 21, 2026

Uh oh!

soulafein83 commented May 21, 2026

Uh oh!

ckuethe commented May 21, 2026

Uh oh!

mgehre-amd commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ckuethe commented May 21, 2026

Uh oh!

petmav commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

soulafein83 commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ckuethe commented Jun 11, 2026

Uh oh!

soulafein83 commented Jun 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

petmav commented May 21, 2026 •

edited

Loading

mgehre-amd commented May 21, 2026 •

edited

Loading

petmav commented May 21, 2026 •

edited

Loading

soulafein83 commented Jun 8, 2026 •

edited

Loading