[GAUDISW-245117] add b2b matmul #770

linoybu · 2026-01-01T13:11:48Z

No description provided.

github-actions · 2026-01-01T13:11:59Z

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

Copilot

Pull request overview

This PR introduces a B2BMatmul class for batch-to-block matrix multiplication operations. The change replaces the generic Matmul class with the new B2BMatmul class for batch2block_matmul and block2batch_matmul operations in the attention backend.

Added new B2BMatmul class that inherits from Matmul
Updated attention backend to use B2BMatmul for batch-to-block operations
Modified import statement to include the new class

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
vllm_gaudi/extension/utils.py	Defines the new `B2BMatmul` class inheriting from `Matmul`
vllm_gaudi/attention/backends/hpu_attn.py	Updates `batch2block_matmul` and `block2batch_matmul` to use `B2BMatmul` instead of `Matmul`
vllm_gaudi/extension/ops.py	Contains an apparent typo in variable name

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

vllm_gaudi/extension/ops.py

github-actions · 2026-01-05T12:47:25Z

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

Signed-off-by: linoy buchnik <[email protected]>

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

vllm_gaudi/extension/utils.py

linoybu · 2026-01-06T07:27:00Z

@copilot open a new pull request to apply changes based on the comments in this thread

Co-authored-by: Copilot <[email protected]> Signed-off-by: Linoy Buchnik <[email protected]>

linoybu · 2026-01-06T07:29:13Z

@copilot open a new pull request to apply changes based on the comments in this thread

dudilester · 2026-01-06T07:36:07Z

vllm_gaudi/extension/utils.py

+    This class is intentionally kept functionally identical to ``Matmul``.
+    It exists to provide semantic distinction in the codebase (e.g., for
+    patterns that specifically require back-to-back matmul) and to allow
+    future customization without changing call sites.


maybe edit the comment to be more specific, change back-to-back to batch2block/block2batch and explain the reasoning for it, that it is used by the INC to adjust the scale to the needed values of the input tensor as some of them are discarded by the 2nd input which is kind of a mask mapping

Signed-off-by: Linoy Buchnik <[email protected]>

Copilot AI review requested due to automatic review settings January 1, 2026 13:11

linoybu requested review from adobrzyn, afierka-intel, iboiko-habana, kamil-kaczor, ksmusz, kzawora-intel, mgawarkiewicz-intel, michalkuligowski and xuechendi as code owners January 1, 2026 13:11

Copilot AI reviewed Jan 1, 2026

View reviewed changes

vllm_gaudi/extension/ops.py Outdated Show resolved Hide resolved

dudilester approved these changes Jan 1, 2026

View reviewed changes

github-actions bot mentioned this pull request Jan 1, 2026

🚦 Team Review Dashboard #701

Open

[GAUDISW-245117] add b2b matmul

bbebcc1

Signed-off-by: linoy buchnik <[email protected]>

linoybu force-pushed the add_b2b_matmul branch from 7cd4a46 to bbebcc1 Compare January 5, 2026 12:52

linoybu requested a review from Copilot January 6, 2026 07:25

Copilot AI reviewed Jan 6, 2026

View reviewed changes

vllm_gaudi/extension/utils.py Show resolved Hide resolved

Update vllm_gaudi/extension/utils.py

4626d66

Co-authored-by: Copilot <[email protected]> Signed-off-by: Linoy Buchnik <[email protected]>

linoybu closed this Jan 6, 2026

linoybu reopened this Jan 6, 2026

dudilester reviewed Jan 6, 2026

View reviewed changes

linoybu added 3 commits January 7, 2026 11:22

Update utils.py

8a8ea09

Signed-off-by: Linoy Buchnik <[email protected]>

Update hpu_attn.py

fc8aab7

Signed-off-by: Linoy Buchnik <[email protected]>

Update utils.py

c64b088

Signed-off-by: Linoy Buchnik <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GAUDISW-245117] add b2b matmul #770

[GAUDISW-245117] add b2b matmul #770

Uh oh!

linoybu commented Jan 1, 2026

Uh oh!

github-actions bot commented Jan 1, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

github-actions bot commented Jan 5, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

linoybu commented Jan 6, 2026

Uh oh!

linoybu commented Jan 6, 2026

Uh oh!

dudilester Jan 6, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[GAUDISW-245117] add b2b matmul #770

Are you sure you want to change the base?

[GAUDISW-245117] add b2b matmul #770

Uh oh!

Conversation

linoybu commented Jan 1, 2026

Uh oh!

github-actions bot commented Jan 1, 2026

🚧 CI Blocked

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

github-actions bot commented Jan 5, 2026

🚧 CI Blocked

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

linoybu commented Jan 6, 2026

Uh oh!

linoybu commented Jan 6, 2026

Uh oh!

dudilester Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dudilester Jan 6, 2026 •

edited

Loading