Arm backend: Add Qwen3 VL language model MXFP8 test#20533
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20533
Note: Links to docs will display an error until the docs builds have been completed. ❌ You can merge normally! (3 Unrelated Failures), 1 Unclassified FailureAs of commit 0728a47 with merge base a2438e4 ( UNCLASSIFIED FAILURE - DrCI could not classify the following job because the workflow did not run on the merge base. The failure may be pre-existing on trunk or introduced by this PR:
BROKEN TRUNK - The following jobs failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
One or more co-authors of this pull request were not found. You must specify co-authors in commit message trailer via: Supported
Alternatively, if the co-author should not be included, remove the Please update your commit message(s) by doing |
There was a problem hiding this comment.
Pull request overview
Adds an end-to-end Arm TOSA test to validate MXFP8 (FP8 e4m3) conversion for the Qwen3-VL TextModel path, ensuring BF16 + MXFP lowering and accuracy checks work together for this model family.
Changes:
- Introduces an MXFP8 + BF16 TOSA pipeline test for
TextModelWrapperusingMXFPTosaPipelineFP. - Adds a Linear-module filter and a post-export op-count assertion to verify all
nn.Linearlayers are rewritten totorch.ops.tosa_mxfp.linear.default.
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
* Add an E2E TOSA test for the Qwen3 VL language model with linear layers converted to MXFP8. Signed-off-by: Baris Demir <baris.demir@arm.com> Signed-off-by: Tom Allsop <tom.allsop@arm.com> Co-authored-by: Baris Demir <baris.demir@arm.com>
1038f14 to
0728a47
Compare
|
/easycla |
cc @digantdesai @freddan80 @per @zingo @oscarandersson8218 @mansnils @Sebastian-Larsson @robell @rascani