Skip to content

Conversation

@nsingh-habana
Copy link

@nsingh-habana nsingh-habana commented Nov 10, 2025

Description

Enables the collectiveMma and collectiveEpilogue with new mma atoms and new copy toms for the xe_gemm_s8_s8_s32_tensor_op_s32 unit test. Moves the old test to legacy.

Type

Performance

Testing

Xe20

Dependencies (Requires changes from below PR which might not be merged yet)

#573

Performance

image

@nsingh-habana nsingh-habana force-pushed the gemm_s8_s8_s32_tensor_op_s32 branch from 91a333d to 3414a0c Compare November 10, 2025 13:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant