Release v0.4.0rc3 · allenai/tango

What's new

Added 🎉

Added DatasetsFormat format and LoadStreamingDataset step to datasets integration.
SqliteDictFormat for datasets.
Added pre_epoch() and post_epoch() callback methods to PyTorch TrainCallback.

Changed ⚠️

LoadDataset step from datasets integration is now cacheable, using the DatasetsFormat format by default.
But this only works with non-streaming datasets. For streaming datasets, you should use the LoadStreamingDataset step instead.

Fixed ✅

Fixed bug where KeyboardInterrupt exceptions were not handled properly by steps and workspaces.
WandbTrainCallback now will use part of the step's unique ID as the name for the W&B run by default, to make
it easier to indentify which tango step corresponds to each run in W&B.
WandbTrainCallback will save the entire TrainConfig object to the W&B config.

Commits

20138ce improve release notes generation script
760b4f2 Add DatasetsFormat, making LoadDataset cacheable, fix bug with KeyboardInterrupt (#114)
e51691f Improvements to W&B callback (#115)
d044f6e Add pre/post epoch callbacks (#113)
ae1ae0b Bump myst-parser from 0.15.2 to 0.16.0 (#111)
c605a1e Merge pull request #90 from allenai/SqliteDictFormat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.4.0rc3

What's new

Added 🎉

Changed ⚠️

Fixed ✅

Commits