v0.3.1

Latest

Latest

justinchuby released this 26 Jun 19:26

2d603fb

What's Changed

Breaking Changes

[rewriter] Decouple llama rule sets and make API explicit by @copilot-swe-agent in #2388

Core ONNX Script

Update autocast.py to fix attribute creation error by @justinchuby in #2365

Optimizer and rewriter

SDPA fusion cleanup by @gramalingam in #2352
Enable CSE in optimizer by @titaiwangms in #2361
Test SDPA fusion via MHA by @gramalingam in #2366
Always fold the Transpose node in the constant folder by @justinchuby in #2355
Fix fused matmul check/rewrite functions by @bmehta001 in #2331
Cast-cast elimination by @gramalingam in #2368
Re-enable fused matmul rules by @bmehta001 in #2370
Fusion extensions to improve GQA fusion by @gramalingam in #2374
fix: pattern match gelu from contrib and onnx ops🐛 by @KarelZe in #2364
[Rewriter]: Add ∘ MatMul -> Gemm by @AyoubMDL in #2356
Fixes to MHA fusion by @gramalingam in #2380
fix: check for rank of bias in bias-gelu fusion🐛 by @KarelZe in #2393
[torchlib] Implement aten__upsample_bicubic2d_aa and aten__upsample_bilinear2d_aa functions by @copilot-swe-agent in #2383
Move gemm_to_matmul_add rule to ort fusion rules by @justinchuby in #2398
Fix fusion ordering for partial rotary embedding by @gramalingam in #2402
[optimizer] Replace value.nbytes with value.size by @titaiwangms in #2399

Torch Lib

Add missing converter for _local_scalar_dense by @xadupre in #2367
[torchlib] Unregister aten::max.other by @titaiwangms in #2377
Support dynamic shapes for aten_unfold by @xenova in #2407

Documentation

docs: cleanup documentation for function-based rewrites📄 by @KarelZe in #2359
Updates to the rewriter tutorial by @gramalingam in #2397
Add sphinx inter link to onnx_ir by @justinchuby in #2415

Other Changes

Fix test names for pytest by @justinchuby in #2358
Require onnx-ir 0.1.1 by @justinchuby in #2360
[CI] Fix execnet.gateway_base.DumpError: can't serialize <class 'numpy.int64'> by @titaiwangms in #2379
Move _c_api_utils.py to version_converter package by @copilot-swe-agent in #2401
Fix proto handling in version converter by @justinchuby in #2411
Add phi2/phi4 test cases for mha/gqa fusion by @gramalingam in #2409
Update VERSION to 0.3.1 by @justinchuby in #2414
Bump onnx ir requirement to 0.1.3 by @justinchuby in #2418
Create torch_2_8 apis by @justinchuby in #2419

New Contributors

@KarelZe made their first contribution in #2359
@copilot-swe-agent made their first contribution in #2383
@xenova made their first contribution in #2407

Full Changelog: v0.3.0...v0.3.1

Contributors

gramalingam, justinchuby, and 7 other contributors

Assets 2