Skip to content

Conversation

@Mozoltov821
Copy link

@Mozoltov821 Mozoltov821 commented Nov 16, 2025

we should not padding head_dim in weight_load, as it will be padded in ragged page attention kernel

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

  • Please use English, otherwise it will be closed.
  • The purpose of the PR, or link existing issues this PR will resolve.
  • The test plan, such as providing test command.
  • (Optional) The necessary documentation update.

we should not padding head_dim in weight_load, as it will be padded in ragged page attention kernel
@gemini-code-assist
Copy link

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@SiqiLi-Fighting SiqiLi-Fighting self-requested a review November 16, 2025 05:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant