Add Optimizer Accumulation #3

warner-benjamin · 2024-03-11T04:06:46Z

Optimizer accumulation allows gradient release to approximate gradient accumulation by accumulating gradients into the optimizer states.

optimi’s implementation enables AdamAccumulation for all optimi optimizers.

warner-benjamin added 5 commits March 3, 2024 22:34

Add optimizer accumulation

efb7e5e

Add optimizer accumulation docs

1f2e146

Update exisiting docs

43c0d94

Update dependencies and ruff settings

df391c8

Update documentation

c899679

warner-benjamin merged commit 0bec5ca into main Mar 11, 2024
4 checks passed

warner-benjamin deleted the opt_accum branch March 11, 2024 04:09

Provide feedback