Gate invalid triton autotune configs in AOTInductor for GFX95+ #4940

JChunX · 2025-09-26T23:48:23Z

Summary:
Saw lowering error when lowering models on MI350X with FP8 PyTorch:
P1966277532

Issue arises from lack of instruction support for BLOCK_K <= 64 when matrix_instr_nonkdim=16 on GFX95+ Hardware. This was previously patched for FP8 Triton in D81180838, but now error is showing up in AOTI codepaths with FP8 PyTorch.

Differential Revision: D83383625

Summary: Saw lowering error when lowering models on MI350X with FP8 PyTorch: P1966277532 Issue arises from lack of instruction support for BLOCK_K <= 64 when matrix_instr_nonkdim=16 on GFX95+ Hardware. This was previously patched for FP8 Triton in D81180838, but now error is showing up in AOTI codepaths with FP8 PyTorch. Differential Revision: D83383625

netlify · 2025-09-26T23:48:28Z

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`66d3d30`
🔍 Latest deploy log	https://app.netlify.com/projects/pytorch-fbgemm-docs/deploys/68d7264a1236520008f4811f
😎 Deploy Preview	https://deploy-preview-4940--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

facebook-github-bot · 2025-09-26T23:48:38Z

@JChunX has exported this pull request. If you are a Meta employee, you can view the originating diff in D83383625.

meta-cla bot added the cla signed label Sep 26, 2025

facebook-github-bot added fb-exported meta-exported labels Sep 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Gate invalid triton autotune configs in AOTInductor for GFX95+ #4940

Gate invalid triton autotune configs in AOTInductor for GFX95+ #4940

Uh oh!

JChunX commented Sep 26, 2025

Uh oh!

netlify bot commented Sep 26, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Sep 26, 2025

Uh oh!

Uh oh!

Gate invalid triton autotune configs in AOTInductor for GFX95+ #4940

Are you sure you want to change the base?

Gate invalid triton autotune configs in AOTInductor for GFX95+ #4940

Uh oh!

Conversation

JChunX commented Sep 26, 2025

Uh oh!

netlify bot commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Uh oh!

facebook-github-bot commented Sep 26, 2025

Uh oh!

Uh oh!

netlify bot commented Sep 26, 2025 •

edited

Loading