Releases: EnzymeAD/Reactant.jl
Releases · EnzymeAD/Reactant.jl
v0.2.18
Reactant v0.2.18
Merged pull requests:
- linearize kernel args (#497) (@mofeing)
- Ka2 (#498) (@wsmoses)
- Regenerate MLIR Bindings (#501) (@github-actions[bot])
- linearize aliased kernel args (#504) (@jumerckx)
- Split
should_rewrite_ft
forcall
andinvoke
expressions, and overlayBase._unique_dims
(#505) (@mofeing) - feat: add rsqrt simplification (#506) (@avik-pal)
- Regenerate MLIR Bindings (#507) (@github-actions[bot])
- Format code of branch "main" (#509) (@github-actions[bot])
- feat: optimization passes (#510) (@avik-pal)
- Regenerate MLIR Bindings (#513) (@github-actions[bot])
- Make v and hcat with numbers work. (#514) (@jaeminoh)
- XLA Allocator stats (#517) (@Pangoraw)
- fix: generalize broadcast_in_dims for setindex (#518) (@avik-pal)
- Format code of branch "main" (#520) (@github-actions[bot])
- WIP: adapt to sroa jll (#521) (@wsmoses)
- Kernel: support constant input arg (#522) (@wsmoses)
- Implement
isnan
andisfinite
for TracedRNumber (#525) (@Pangoraw) - Format code of branch "main" (#528) (@github-actions[bot])
- feat: sorting and related functions (#529) (@avik-pal)
- Regenerate MLIR Bindings (#531) (@github-actions[bot])
- Format code of branch "main" (#533) (@github-actions[bot])
- Generalize precompilation support (#534) (@wsmoses)
- More constprop (#536) (@wsmoses)
- Fix tolerance on loggamma integration test (#537) (@wsmoses)
- Fix missing dialects in docs (#538) (@wsmoses)
Closed issues:
- minimize XLA error in gemm_autotuner for CUDA (#444)
- Not support
partialsortperm
? (#485) - KernelAbstractions + Reactant: UndefVarError:
pop
not defined (#488) - Infinite recursion on
unique(::Vector{Symbol})
within Reactant (#493) - Concatenation of scalar and ConcreteRArray gives a Vector (#511)
- Incorrect
broadcast_to_size
implementation (#512) - How to set NaN values in an RArray to a certain number? (#524)
- Precompilation of Reactant 0.2.1x fails (#527)
v0.2.17
Reactant v0.2.17
Merged pull requests:
- gpu, llvm and nvvm dialect bindings (#315) (@jumerckx)
- fix: recursion in AbsInt working (#483) (@avik-pal)
- add test for #226 (#486) (@jumerckx)
- fix: missing scalar indexing check for setindex (#491) (@avik-pal)
- fix: incorrect IR for traced RNGs (#494) (@avik-pal)
Closed issues:
v0.2.16
Reactant v0.2.16
Merged pull requests:
- CuArray tracing (#475) (@wsmoses)
- Improve error message on broadcast (#476) (@wsmoses)
- Fix traced rnumber promotion (#477) (@wsmoses)
- Fix type unstable stack (#478) (@wsmoses)
- Rewrite apply iterate (#479) (@wsmoses)
- Improve cuda error (#480) (@wsmoses)
- Format code of branch "main" (#481) (@github-actions[bot])
v0.2.15
Reactant v0.2.15
Merged pull requests:
- precompile first interpretation (#353) (@glou-nes)
- [GHA] Set author of PRs created by scheduled jobs to Enzyme bot (#455) (@giordano)
- feat: partial support for boolean indexing (#457) (@avik-pal)
- feat:
no_nan
mode and more passes (#458) (@avik-pal) - Fix cuda isdefined error (#459) (@wsmoses)
- feat: more trigonometric functions (#462) (@avik-pal)
- fix: check for name and module in function (#463) (@avik-pal)
- Fix offsetarrays support (#464) (@wsmoses)
- Fix opaqueclosure Worldage issue (#465) (@wsmoses)
- Format code of branch "main" (#466) (@github-actions[bot])
- CompatHelper: add new compat entry for OffsetArrays in [weakdeps] at version 1 for package ., (keep existing compat) (#467) (@github-actions[bot])
- CompatHelper: add new compat entry for OffsetArrays at version 1 for package test, (keep existing compat) (#468) (@github-actions[bot])
- Update index.md (#469) (@x66ccff)
- Format code of branch "main" (#471) (@github-actions[bot])
- feat: missing mean(f, ...) dispatches (#472) (@avik-pal)
- Update index.md, delete the
- Provide better error message if calling thunk with wrong types (#474) (@wsmoses)
Closed issues:
v0.2.14
Reactant v0.2.14
Merged pull requests:
- More mul overloads (#446) (@wsmoses)
- fix: use pip for jax (#450) (@avik-pal)
- Pipeline for nested enzyme differentiation (#452) (@wsmoses)
- Revert modularize bazel build to unbreak yggy (#453) (@wsmoses)
- CompatHelper: add new compat entry for SpecialFunctions in [weakdeps] at version 2 for package ., (keep existing compat) (#454) (@github-actions[bot])
Closed issues:
v0.2.13
Reactant v0.2.13
Merged pull requests:
SpecialFunctions
simple functions (#384) (@glou-nes)- Modularize Bazel build (#421) (@mofeing)
- feat: support Base.stack (#433) (@avik-pal)
- Fix mul overload (#440) (@wsmoses)
- fix: handle aos for mul (#441) (@avik-pal)
- feat: missing BF16 dispatches (#443) (@avik-pal)
- refactor: rework TracedRNG to be similar to other types (#448) (@avik-pal)
Closed issues:
- 2nd order AD fails (#298)
v0.2.12
Reactant v0.2.12
Merged pull requests:
- feat: indexing using traced values (#434) (@avik-pal)
- Regenerate MLIR Bindings (#435) (@github-actions[bot])
- fix: try building with cudnn 9.4 (#436) (@avik-pal)
- chore: bump jll (#437) (@avik-pal)
Closed issues:
- Scalar indexing triggered when requesting a view of a traced array (#408)
v0.2.11
Reactant v0.2.11
Merged pull requests:
- CompatHelper: add new compat entry for YaoBlocks in [weakdeps] at version 0.13, (keep existing compat) (#312) (@github-actions[bot])
- WIP: kernels (#314) (@wsmoses)
- test: unbreak CUDA CI (#337) (@avik-pal)
- feat: correct handling of wrapped arrays functionalities (#342) (@avik-pal)
- reduce codegen on
YaoBlocks.mat
methods (#343) (@mofeing) - refactor: use Ops instead of direct stablehlo calls (#347) (@avik-pal)
- Run CI on aarch64 (#350) (@mofeing)
- add missing @test (#352) (@glou-nes)
- Add Ops.hlo_call(::String, args...) (#358) (@Pangoraw)
- Kernels: support Julia 1.11 (#359) (@jumerckx)
- Define outer
repeat
method forTracedRArray
(#361) (@AntonOresten) - docs: add Reactant.Ops docs to the website (#362) (@avik-pal)
- feat: tracing Random.jl functionality correctly (#363) (@avik-pal)
- Interp2 (#365) (@wsmoses)
- fix: ensure printing of wrapped ConcreteRArrays goes through our show (#367) (@avik-pal)
- feat: add support for the remaining wrapper types (#369) (@avik-pal)
- Format code of branch "main" (#371) (@github-actions[bot])
stablehlo.sort
Ops (#374) (@glou-nes)- No Transpose emission for 0 rank tensor (#375) (@glou-nes)
- fix: include files if they end with .jl (#377) (@avik-pal)
- refactor: move overrides into a separate file (#379) (@avik-pal)
- CompatHelper: bump compat for Flux to 0.16 for package test, (keep existing compat) (#383) (@github-actions[bot])
- Fix traced_getfield (#385) (@wsmoses)
- fix
backtrace
calls on debug mode (#387) (@mofeing) - Kernel-supporting jll (#389) (@wsmoses)
- fix: override to overlay (#390) (@avik-pal)
- CompatHelper: add new compat entry for CUDA in [weakdeps] at version 5 for package ., (keep existing compat) (#391) (@github-actions[bot])
- CompatHelper: add new compat entry for CUDA at version 5 for package test, (keep existing compat) (#392) (@github-actions[bot])
- fix: missing interpolation in compile (#394) (@avik-pal)
- CompatHelper: add new compat entry for Random123 at version 1 for package test, (keep existing compat) (#395) (@github-actions[bot])
- CompatHelper: add new compat entry for StatsBase at version 0.34 for package test, (keep existing compat) (#396) (@github-actions[bot])
- CompatHelper: add new compat entry for StableRNGs at version 1 for package test, (keep existing compat) (#397) (@github-actions[bot])
- CompatHelper: add new compat entry for Distributions at version 0.25 for package test, (keep existing compat) (#398) (@github-actions[bot])
- CompatHelper: add new compat entry for HypothesisTests at version 0.11 for package test, (keep existing compat) (#399) (@github-actions[bot])
- feat: bump jll version and add opt pass (#400) (@avik-pal)
- ci: disable non-functional CI (#401) (@avik-pal)
- feat: add a no_enzyme option (#402) (@avik-pal)
- bump reactant jll (#404) (@Pangoraw)
- Inference cache (#405) (@jumerckx)
- Pythoncall (#407) (@wsmoses)
- JLL: enable throwing better errors (#409) (@wsmoses)
- Regenerate MLIR Bindings (#410) (@github-actions[bot])
- Format code of branch "main" (#411) (@github-actions[bot])
- CompatHelper: add new compat entry for PythonCall in [weakdeps] at version 0.9 for package ., (keep existing compat) (#412) (@github-actions[bot])
- CompatHelper: add new compat entry for PythonCall at version 0.9 for package test, (keep existing compat) (#413) (@github-actions[bot])
- generate EnzymeXLA dialect bindings (#414) (@mofeing)
- refactor: move python testing to integration (#415) (@avik-pal)
- fix: handle traced array returns inside objects (#417) (@avik-pal)
- Improve reactant error messages (#418) (@wsmoses)
- Fix ReactantPythonCallExt.jl (#419) (@wsmoses)
- feat: add zero and fill! for ConcreteRArray (#420) (@avik-pal)
- Fix error on global (#422) (@wsmoses)
- Cuv2 (#423) (@wsmoses)
- disable absint of absint (#424) (@wsmoses)
- fix: make eltype of Traced/Concrete Arrays to be respective RNumbers (#426) (@avik-pal)
- CUDA kernels take 3 (#427) (@wsmoses)
- Format code of branch "main" (#432) (@github-actions[bot])
Closed issues:
getindex
assumes contiguous indexing (#242)- Incorrect stableHLO for grouped convolution gradient (#331)
stablehlo.cholesky
inconsistent on GPU vs CPU (#338)- Incorrect traced code for upsampling functions (#339)
- Handling Wrapped Arrays Correctly (#345)
dynamic_slice
(#349)- Handling random numbers correctly (#356)
- Cannot compile function with Module as parameter (#357)
- Cannot repeat >2-dimensional arrays (#360)
- Segfault on Integer Overflow (#368)
- [BUG]: Segfault with large amount of data (#380)
- Compile time scales with MPI ranks (and it shouldn't) (#386)
- Graceful termination on out of memory (#406)
- Autodiff code generation fails when caching values (#416)