Skip to content

Conversation

gshtras
Copy link
Collaborator

@gshtras gshtras commented Oct 1, 2025

Follow up to #25896 that skipped ROCm attention backend when adding the new parameter to get_kv_cache_shape

Signed-off-by: Gregory Shtrasberg <[email protected]>
@mergify mergify bot added rocm Related to AMD ROCm v1 labels Oct 1, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request is a bugfix that aligns the get_kv_cache_shape method in RocmAttentionBackend with the base class interface by adding the cache_dtype_str parameter. This change is a follow-up to a broader update that was applied to other attention backends. The new parameter is currently unused within the method, which is consistent with other backends where the KV cache shape does not depend on the cache data type. The change is correct and addresses the interface inconsistency.

@gshtras gshtras added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 1, 2025
@vllm-bot vllm-bot merged commit 0b018d8 into vllm-project:main Oct 2, 2025
48 of 52 checks passed
pdasigi pushed a commit to pdasigi/vllm that referenced this pull request Oct 2, 2025
yewentao256 pushed a commit that referenced this pull request Oct 3, 2025
Signed-off-by: Gregory Shtrasberg <[email protected]>
Signed-off-by: yewentao256 <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm v1
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants