What's Changed
Breaking Changes
- [rewriter] Decouple llama rule sets and make API explicit by @copilot-swe-agent in #2388
Core ONNX Script
- Update autocast.py to fix attribute creation error by @justinchuby in #2365
Optimizer and rewriter
- SDPA fusion cleanup by @gramalingam in #2352
- Enable CSE in optimizer by @titaiwangms in #2361
- Test SDPA fusion via MHA by @gramalingam in #2366
- Always fold the
Transpose
node in the constant folder by @justinchuby in #2355 - Fix fused matmul check/rewrite functions by @bmehta001 in #2331
- Cast-cast elimination by @gramalingam in #2368
- Re-enable fused matmul rules by @bmehta001 in #2370
- Fusion extensions to improve GQA fusion by @gramalingam in #2374
- fix: pattern match gelu from contrib and onnx ops🐛 by @KarelZe in #2364
- [Rewriter]: Add ∘ MatMul -> Gemm by @AyoubMDL in #2356
- Fixes to MHA fusion by @gramalingam in #2380
- fix: check for rank of bias in bias-gelu fusion🐛 by @KarelZe in #2393
- [torchlib] Implement aten__upsample_bicubic2d_aa and aten__upsample_bilinear2d_aa functions by @copilot-swe-agent in #2383
- Move gemm_to_matmul_add rule to ort fusion rules by @justinchuby in #2398
- Fix fusion ordering for partial rotary embedding by @gramalingam in #2402
- [optimizer] Replace value.nbytes with value.size by @titaiwangms in #2399
Torch Lib
- Add missing converter for _local_scalar_dense by @xadupre in #2367
- [torchlib] Unregister aten::max.other by @titaiwangms in #2377
- Support dynamic shapes for aten_unfold by @xenova in #2407
Documentation
- docs: cleanup documentation for function-based rewrites📄 by @KarelZe in #2359
- Updates to the rewriter tutorial by @gramalingam in #2397
- Add sphinx inter link to onnx_ir by @justinchuby in #2415
Other Changes
- Fix test names for pytest by @justinchuby in #2358
- Require onnx-ir 0.1.1 by @justinchuby in #2360
- [CI] Fix execnet.gateway_base.DumpError: can't serialize <class 'numpy.int64'> by @titaiwangms in #2379
- Move _c_api_utils.py to version_converter package by @copilot-swe-agent in #2401
- Fix proto handling in version converter by @justinchuby in #2411
- Add phi2/phi4 test cases for mha/gqa fusion by @gramalingam in #2409
- Update VERSION to 0.3.1 by @justinchuby in #2414
- Bump onnx ir requirement to 0.1.3 by @justinchuby in #2418
- Create torch_2_8 apis by @justinchuby in #2419
New Contributors
- @KarelZe made their first contribution in #2359
- @copilot-swe-agent made their first contribution in #2383
- @xenova made their first contribution in #2407
Full Changelog: v0.3.0...v0.3.1