Skip to content

Conversation

@adobrzyn
Copy link
Collaborator

@adobrzyn adobrzyn commented Oct 6, 2025

No description provided.

…utput shape (#176)

- After [#24772](vllm-project/vllm#24772) an
assertion on input shape in `Qwen3MoeSparseMoeBlock` exposes a design
compatibility [issue
is expected to flatten `batch_size x seqlen`
- This PR aligns the attention output as such, and restores the output
shape of the forward pass afterwards

---------

Signed-off-by: attafosu <[email protected]>
Co-authored-by: Chendi.Xue <[email protected]>
@michalkuligowski
Copy link
Collaborator

/run-gaudi-tests

@xuechendi
Copy link
Collaborator

Duplicated PR to: #212

@michalkuligowski
Copy link
Collaborator

/run-gaudi-tests

@github-actions
Copy link

github-actions bot commented Oct 8, 2025

✅ CI Passed

All checks passed successfully against the following vllm commit:
da3fa78dc98f3001e5fb703729a77311146e0cd3

@wpyszka wpyszka merged commit 69bdc7a into v0.10.2_next Oct 8, 2025
44 of 45 checks passed
adobrzyn added a commit that referenced this pull request Oct 13, 2025
…odel o… (#316)

Signed-off-by: attafosu <[email protected]>
Co-authored-by: Thomas Atta-Fosu <[email protected]>
Co-authored-by: Chendi.Xue <[email protected]>
Co-authored-by: Michał Kuligowski <[email protected]>
@adobrzyn adobrzyn deleted the adobrzyn/port_176_next branch October 13, 2025 08:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants