v0.4.0rc3
Pre-release
Pre-release
github-actions
released this
16 Dec 01:53
·
617 commits
to main
since this release
What's new
Added 🎉
- Added
DatasetsFormat
format andLoadStreamingDataset
step todatasets
integration. SqliteDictFormat
for datasets.- Added
pre_epoch()
andpost_epoch()
callback methods to PyTorchTrainCallback
.
Changed ⚠️
LoadDataset
step fromdatasets
integration is now cacheable, using theDatasetsFormat
format by default.
But this only works with non-streaming datasets. For streaming datasets, you should use theLoadStreamingDataset
step instead.
Fixed ✅
- Fixed bug where
KeyboardInterrupt
exceptions were not handled properly by steps and workspaces. WandbTrainCallback
now will use part of the step's unique ID as the name for the W&B run by default, to make
it easier to indentify which tango step corresponds to each run in W&B.WandbTrainCallback
will save the entireTrainConfig
object to the W&B config.
Commits
20138ce improve release notes generation script
760b4f2 Add DatasetsFormat, making LoadDataset cacheable, fix bug with KeyboardInterrupt (#114)
e51691f Improvements to W&B callback (#115)
d044f6e Add pre/post epoch callbacks (#113)
ae1ae0b Bump myst-parser from 0.15.2 to 0.16.0 (#111)
c605a1e Merge pull request #90 from allenai/SqliteDictFormat