forked from EnzymeAD/Reactant.jl
-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Thunk Change #10
Closed
Thunk Change #10
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Use `[sources]` section of `docs/Project.toml` instead of messing with `LOAD_PATH`.
* fix: inconsistent return dims * test: inconsistent indexing * fix: inconsistent dimensions inside gather getindex
* fix: define getindexing into sub reshaped array * test: failure case
Co-authored-by: enzyme-ci-bot[bot] <78882869+enzyme-ci-bot[bot]@users.noreply.github.com>
Co-authored-by: enzyme-ci-bot[bot] <78882869+enzyme-ci-bot[bot]@users.noreply.github.com>
…nzymeAD#560) * [CI] Format generated files twice to work around JuliaFormatter bug * [CI] Don't run benchmarks on PRs from forks
* CUDA: fix nv intrinsic errs * Update ext/ReactantCUDAExt.jl Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> --------- Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Restrict running different workflows based on the touched files. This should _slightly_ reduce CI workload.
* respect scopping rules in for * `@isdefined` * Update ReactantCore.jl * fix
Co-authored-by: enzyme-ci-bot[bot] <78882869+enzyme-ci-bot[bot]@users.noreply.github.com>
* feat: support arbitrary structures in while loop * test: add nested structure test * feat: if conditions can now handle arbitrary strucutres * fix: mutation support inside if condition * fix: restore while support * Update src/TracedUtils.jl
* profiler: Add option to generate perfetto url * remove sort * Update api.md
* More jll/cuda stuff * Apply suggestions from code review Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * force xla * add set arg attr * Update WORKSPACE * fix support * Now with noalias * Update ext/ReactantCUDAExt.jl Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Update Project.toml * we are now sane * now with alignment * Update src/Profiler.jl Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> --------- Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: enzyme-ci-bot[bot] <78882869+enzyme-ci-bot[bot]@users.noreply.github.com>
* Allocator stats docs * Add API to create profiler annotations * profiling tutorial * sig cleanup * vitepress config * vitepress config 2 * note about allocatorstats
* PTX fma and other flags * we can keep cuda debug info now, that's cool * version bump * Update XLA.jl * Update src/XLA.jl Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> --------- Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
* [CI] Move tests on aarch64 linux to GitHub Actions * Fix name of aarch64-linux runners Co-authored-by: Ian McInerney <[email protected]> * Skip CUDA integration tests that could fail --------- Co-authored-by: Ian McInerney <[email protected]>
* Fix condition to skip CUDA tests on aarch64 * Skip more tests
* feat: expose more XLA GPU options to the user * feat: check for bazel install * fix: load env vars before * Update deps/build_local.jl Co-authored-by: Mosè Giordano <[email protected]> * Update Project.toml --------- Co-authored-by: Mosè Giordano <[email protected]> Co-authored-by: William Moses <[email protected]>
Co-authored-by: enzyme-ci-bot[bot] <78882869+enzyme-ci-bot[bot]@users.noreply.github.com>
…zymeAD#825) * feat: support implicit padding from XLA * feat: use XLA for shard-info if we need padding * test: padding for sharding * fix: return type
Co-authored-by: enzyme-ci-bot[bot] <78882869+enzyme-ci-bot[bot]@users.noreply.github.com>
* Fix LLVM dialect registration * Add functionality for parsing single operations * Bump Enzyme-JAX commit
macOS runners are occasionally hanging, having a shorter timeout helps by making the job fail earlier than waiting uselessly for 90 minutes, saving some time, especially important since there are only 5 concurrent macOS runners across the entire organisation.
* docs: housekeeping + memref dialect * docs: autoadd dialects
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
stablehlo.sort
Ops (stablehlo.sort
Ops EnzymeAD/Reactant.jl#374)SpecialFunctions
simple functions (SpecialFunctions
simple functions EnzymeAD/Reactant.jl#384)no_nan
mode and more passes (feat:no_nan
mode and more passes EnzymeAD/Reactant.jl#458)isnan
andisfinite
for TracedRNumber (Implementisnan
andisfinite
for TracedRNumber EnzymeAD/Reactant.jl#525)should_rewrite_ft
forcall
andinvoke
expressions, and overlayBase._unique_dims
(Splitshould_rewrite_ft
forcall
andinvoke
expressions, and overlayBase._unique_dims
EnzymeAD/Reactant.jl#505)paths
settings for workflow triggers ([GHA] Addpaths
settings for workflow triggers EnzymeAD/Reactant.jl#563)Pkg.instantiate
([CI] Remove useless call toPkg.instantiate
EnzymeAD/Reactant.jl#576)Enzyme.autodiff
Tenet + Reactant + Enzyme.gradient broken on last releases EnzymeAD/Reactant.jl#593 (Fix dense elements attribute inEnzyme.autodiff
#593 EnzymeAD/Reactant.jl#604)xla/stream_executor/cuda:cuda_compute_capability_proto_cc_impl
only on non CUDA (Usexla/stream_executor/cuda:cuda_compute_capability_proto_cc_impl
only on non CUDA EnzymeAD/Reactant.jl#646)trim
->strip
(Replacetrim
->strip
EnzymeAD/Reactant.jl#661)ClientCompile
to pass CUDA data dir ([ReactantExtra] Add argument toClientCompile
to pass CUDA data dir EnzymeAD/Reactant.jl#683)LLVMOpenMP_jll
to call OpenMP functions (UseLLVMOpenMP_jll
to call OpenMP functions EnzymeAD/Reactant.jl#673)similar
return empty tensors. (makesimilar
return empty tensors. EnzymeAD/Reactant.jl#632)@trace
function calls (@trace
function calls EnzymeAD/Reactant.jl#366)build_local.jl
script ([deps] Some improvements to thebuild_local.jl
script EnzymeAD/Reactant.jl#689)hardware_interference_size
([ReactantExtra] Stop removing references tohardware_interference_size
EnzymeAD/Reactant.jl#704)Type(value)
instead oftype(value)
(fixType(value)
instead oftype(value)
EnzymeAD/Reactant.jl#733)rem
with only one operand being aConcreteRNumber
(Support tracing ofrem
with only one operand being aConcreteRNumber
EnzymeAD/Reactant.jl#754)mod
(Fix implementation ofmod
EnzymeAD/Reactant.jl#758)--color
option for Bazel ([build_local] Add argument to set--color
option for Bazel EnzymeAD/Reactant.jl#776)Ops.sharding_constraint
(feat: addOps.sharding_constraint
EnzymeAD/Reactant.jl#798)raise
a keyword argument ([Compiler] Makeraise
a keyword argument EnzymeAD/Reactant.jl#797)make-bindings.jl
to generate MemRef dialect