Skip to content

Conversation

@glou-nes
Copy link
Owner

@glou-nes glou-nes commented Mar 7, 2025

giordano and others added 30 commits January 17, 2025 16:17
Use `[sources]` section of `docs/Project.toml` instead of messing with
`LOAD_PATH`.
* fix: inconsistent return dims

* test: inconsistent indexing

* fix: inconsistent dimensions inside gather getindex
* fix: define getindexing into sub reshaped array

* test: failure case
Co-authored-by: enzyme-ci-bot[bot] <78882869+enzyme-ci-bot[bot]@users.noreply.github.com>
Co-authored-by: enzyme-ci-bot[bot] <78882869+enzyme-ci-bot[bot]@users.noreply.github.com>
…nzymeAD#560)

* [CI] Format generated files twice to work around JuliaFormatter bug

* [CI] Don't run benchmarks on PRs from forks
* CUDA: fix nv intrinsic errs

* Update ext/ReactantCUDAExt.jl

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

---------

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Restrict running different workflows based on the touched files.  This should
_slightly_ reduce CI workload.
* respect scopping rules in for

* `@isdefined`

* Update ReactantCore.jl

* fix
Co-authored-by: enzyme-ci-bot[bot] <78882869+enzyme-ci-bot[bot]@users.noreply.github.com>
* feat: support arbitrary structures in while loop

* test: add nested structure test

* feat: if conditions can now handle arbitrary strucutres

* fix: mutation support inside if condition

* fix: restore while support

* Update src/TracedUtils.jl
* profiler: Add option to generate perfetto url

* remove sort

* Update api.md
* More jll/cuda stuff

* Apply suggestions from code review

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

* force xla

* add set arg attr

* Update WORKSPACE

* fix support

* Now with noalias

* Update ext/ReactantCUDAExt.jl

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

* Update Project.toml

* we are now sane

* now with alignment

* Update src/Profiler.jl

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

---------

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: enzyme-ci-bot[bot] <78882869+enzyme-ci-bot[bot]@users.noreply.github.com>
* Allocator stats docs

* Add API to create profiler annotations

* profiling tutorial

* sig cleanup

* vitepress config

* vitepress config 2

* note about allocatorstats
* PTX fma and other flags

* we can keep cuda debug info now, that's cool

* version bump

* Update XLA.jl

* Update src/XLA.jl

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

---------

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
* [CI] Move tests on aarch64 linux to GitHub Actions

* Fix name of aarch64-linux runners

Co-authored-by: Ian McInerney <mcianster@gmail.com>

* Skip CUDA integration tests that could fail

---------

Co-authored-by: Ian McInerney <mcianster@gmail.com>
* Fix condition to skip CUDA tests on aarch64

* Skip more tests
* feat: expose more XLA GPU options to the user

* feat: check for bazel install

* fix: load env vars before

* Update deps/build_local.jl

Co-authored-by: Mosè Giordano <765740+giordano@users.noreply.github.com>

* Update Project.toml

---------

Co-authored-by: Mosè Giordano <765740+giordano@users.noreply.github.com>
Co-authored-by: William Moses <gh@wsmoses.com>
avik-pal and others added 27 commits February 28, 2025 23:53
Co-authored-by: enzyme-ci-bot[bot] <78882869+enzyme-ci-bot[bot]@users.noreply.github.com>
…zymeAD#825)

* feat: support implicit padding from XLA

* feat: use XLA for shard-info if we need padding

* test: padding for sharding

* fix: return type
Co-authored-by: enzyme-ci-bot[bot] <78882869+enzyme-ci-bot[bot]@users.noreply.github.com>
* Fix LLVM dialect registration

* Add functionality for parsing single operations

* Bump Enzyme-JAX commit
macOS runners are occasionally hanging, having a shorter timeout helps by making
the job fail earlier than waiting uselessly for 90 minutes, saving some time,
especially important since there are only 5 concurrent macOS runners across the
entire organisation.
* docs: housekeeping + memref dialect

* docs: autoadd dialects
@glou-nes glou-nes closed this Mar 7, 2025
@glou-nes glou-nes deleted the thunk branch March 7, 2025 15:51
@glou-nes glou-nes restored the thunk branch March 7, 2025 15:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.