v0.0.7
What's Changed
- Benchmark llama by @wsmoses in #23
- Update README.md by @wsmoses in #26
- Rebase to later llvm/xla by @wsmoses in #25
- Add support for use of xla runtime by @wsmoses in #27
- [JAX] Remove uses of dialect="mhlo" from the JAX compiler_ir() function by @wsmoses in #28
- Bump XLA by @wsmoses in #29
- Custom MLIR lowering pipeline by @wsmoses in #30
- Add missing ')' to enzyme_call and add tests for old pipeline by @itf in #31
- Make enzyme_ref delete in_shapes instead of out_shapes by @itf in #32
- Pipelinemod by @wsmoses in #33
- reuse Common.td from Enzyme by @ftynse in #34
- Fix segfault by @wsmoses in #35
- Jax reverse more by @wsmoses in #36
- MLIR Reverse Mode by @wsmoses in #37
- Update xla by @wsmoses in #38
- Fix unused warning by @wsmoses in #39
- XLA: vendor the runtime mlir backend by @wsmoses in #40
- Handle inactive args from context by @wsmoses in #41
- Fix shape error by @wsmoses in #42
- generalize mlir zeroing by @wsmoses in #43
- Generalize mktup by @wsmoses in #44
- Cleanup warnings by @wsmoses in #45
- bugfix else by @wsmoses in #46
- Reduction optimization by @wsmoses in #47
- More optimization fixes by @wsmoses in #48
- Add pad folding optimization by @wsmoses in #49
- Add unrolling by @wsmoses in #50
- Cleanup bazel files by @wsmoses in #51
- transpose opts by @ftynse in #52
- Add missing dependencies by @ivanradanov in #53
- Fix 0 dim reshape concat case by @ivanradanov in #55
- Handle full reduce of reshape by @wsmoses in #56
- simply mul-of-pad by @ftynse in #57
- Generalize dot general pad by @wsmoses in #59
- Add gradient of pad operation by @wsmoses in #58
- slice(reshape) -> reshape(slice) by @ftynse in #60
- Fix likely index bug in reshape helper by @wsmoses in #61
- Add chlo dialect by @wsmoses in #63
- Add .clang-format by @ftynse in #64
- Fix and add test for slice of pad by @wsmoses in #65
- propagate location information by @ftynse in #66
- simplify pad(pad) by @ftynse in #67
- transform dialect for pattern combination by @ftynse in #62
- Add slice of dot general by @wsmoses in #70
- Update XLA/LLVM by @wsmoses in #72
- Bitflag mismatch size in transform[er] dialect by @wsmoses in #73
- Try to fix macos by @wsmoses in #69
- Transform dialect ops for all patterns by @ftynse in #71
- Actuall take control over llvm build flags by @wsmoses in #75
- Just use c api for pass pipeline by @wsmoses in #76
- Use capsule by @wsmoses in #77
- Bump jax commit by @wsmoses in #78
- Bump enzyme commit by @wsmoses in #79
- Bump enzyme commit by @wsmoses in #80
- Fix up rebase by @wsmoses in #81
- Negate of int by @wsmoses in #86
- Add all StableHLO ops from spec by @mofeing in #87
- Refactor diff rules to prepare for more rules by @mofeing in #89
- Add
stablehlo.einsum
,stablehlo.unary_einsum
diffrules by @mofeing in #83 - Fix for lowering change by @wsmoses in #92
- Fix grad sum by @wsmoses in #93
- Add complex support by @mofeing in #94
- Fix
complex
dialect registration by @mofeing in #95 - Update Enzyme commit by @mofeing in #96
- Add a bunch of rules for scalar and non-differentiable functions by @mofeing in #90
- Fix depency import of "ChloOps.h" by @mofeing in #97
- Fix broadcast derivative by @wsmoses in #98
- bump xla by @wsmoses in #99
- Add compile to llvm backend by @wsmoses in #100
- Bump Enzyme commit by @wsmoses in #101
- Bump JaX commit by @wsmoses in #102
- Mark stablehlo.compare as inactive by @Pangoraw in #106
- sink transposes in einsum by @Pangoraw in #105
- optimize convolution of transpose by @Pangoraw in #103
- Add batching of constants by @wsmoses in #107
- Attempt linker fix by @wsmoses in #108
- More linker fix by @wsmoses in #109
- Llama 3 gap size by @wsmoses in #110
- Bump xla and fix ci by @wsmoses in #111
- Fix CI by @wsmoses in #112
- Fix signature by @wsmoses in #113
New Contributors
- @itf made their first contribution in #31
- @ivanradanov made their first contribution in #53
- @mofeing made their first contribution in #87
- @Pangoraw made their first contribution in #106
Full Changelog: v0.0.6...v0.0.7