v0.4.0rc5
Pre-releaseWhat's new
Added 🎉
- Added
TorchEvalStep
to torch integration, registered as "torch::eval".
Changed ⚠️
- Renamed
aggregate_val_metric
toauto_aggregate_val_metric
inTorchTrainStep
. devices
parameter toTorchTrainStep
replaced withdevice_count: int
.- Run name printed at the end of a run so it's easier to find.
- Type information added to package data. See PEP 561 for more information.
- A new integration,
transformers
, with two new steps for running seq2seq models. - Added
logging_tqdm
, if you don't want a progress bar, but you still want to see progress in the logs. - Added
threaded_generator()
, for wrapping generators so that they run in a separate thread from the generator's consumer. - Added a new example for evaluating the T0 model on XSum, a summarization task.
- Added
MappedSequence
for functionally wrapping sequences. - Added
TextFormat
, in case you want to store the output of your steps in raw text instead of JSON. - Steps can now list arguments in
SKIP_ID_ARGUMENTS
to indicate that the argument should not affect a step's
unique id. This is useful for arguments that affect the execution of a step, but not the output. Step
now implements__str__
, so steps look pretty in the debugger.- Added
DatasetCombineStep
, a step that combines multiple datasets into one. - Added
common.logging.initialize_worker_logging()
function for configuring logging from worker processes/threads. - Logs from
tango run ...
will be written to a file calledout.log
in the run directory.
Fixed ✅
- Fixed torch
StopEarlyCallback
state not being recovered properly on restarts. - Fixed file friendly logging by removing special styling characters.
- Ensured exceptions captured in logs.
LocalWorkspace
now works properly with uncacheable steps.- When a Tango run got killed hard, with
kill -9
, or because the machine lost power,LocalWorkspace
would
sometimes keep a step marked as "running", preventing further executions. This still happens sometimes, but it
is now much less likely (and Tango gives you instructions for how to fix it). - To make all this happen,
LocalWorkspace
now saves step info in a Sqlite database. Unfortunately that means that
the workspace format changes and existing workspace directories won't work properly with it. - Fixed premature cleanup of temporary directories when using
MemoryWorkspace
Commits
df301ef Merge pull request #119 from allenai/RunGeneration
42535c9 Add TorchEvalStep to torch integration, use "device_count" in TorchTrainStep instead of "devices" (#120)
ecc6087 Store log output to a file in run directory, other logging improvements (#132)
9008255 Merge pull request #141 from allenai/dependabot/pip/sphinx-4.4.0
8e09b66 Merge pull request #139 from allenai/remove-no-logging
26318b2 Merge pull request #133 from allenai/dependabot/pip/mypy-0.931
7c91c4b Merge pull request #126 from allenai/dependabot/pip/sphinx-4.3.2
a178076 Merge pull request #128 from allenai/dependabot/pip/mypy-0.930
106a8cf Merge pull request #130 from allenai/dependabot/pip/furo-2022.1.2
6db7b4c Merge pull request #129 from allenai/run-name-at-end
b93d22d Add typing info to package (PEP 561) (#131)
e8867ae fix StopEarlyCallback state recovery