Free outputs callback #2598

mvpatel2000 · 2023-10-02T21:54:33Z

What does this PR do?

Adds callback to support freeing outputs for memory savings. When not using train_metrics, self.state.outputs are not needed. However, they may take up a non-trivial amount of memory (seq_length*vocab_size*microbatch_size*bytes_per_param). For certain long sequence models, this memory starts to matter (~1-2GB), so having an option to free the memory is useful.

Existing tests (eg TestCallbackTrains) should be sufficient for this PR.

What issue(s) does this change relate to?

GRT-2464

b-chu

Approving, but wait for someone more familiar with callbacks to also approve

tests/callbacks/callback_settings.py

Skylion007 · 2023-10-03T15:26:42Z

Nit: isn't free a bit of an overloaded term in ML? Or in general? What about "releaseOutputs"? That's a bit more of a C++ pointer terminology here but its a bit more exact.

j316chuck

LGTM added some nits

composer/callbacks/free_outputs.py

composer/callbacks/generate.py

Co-authored-by: Charles Tang <[email protected]>

mvpatel2000 added 4 commits October 2, 2023 15:11

free train metrics

77529b8

lint

28e11a7

int

afb4949

rename

a8a118e

mvpatel2000 requested a review from dakinggg October 2, 2023 21:54

mvpatel2000 added 3 commits October 2, 2023 17:55

add callback

8e1b9e9

Merge branch 'dev' into mvpatel2000/free-train-metrics

951ff8c

import

9da0107

mvpatel2000 requested a review from eracah October 2, 2023 22:13

mvpatel2000 added 2 commits October 3, 2023 10:35

wrap

4f13fe1

fix more tests

ce6a0ce

mvpatel2000 requested review from j316chuck and b-chu October 3, 2023 15:17

b-chu approved these changes Oct 3, 2023

View reviewed changes

tests/callbacks/callback_settings.py Show resolved Hide resolved

j316chuck reviewed Oct 3, 2023

View reviewed changes

composer/callbacks/free_outputs.py Outdated Show resolved Hide resolved

composer/callbacks/free_outputs.py Show resolved Hide resolved

composer/callbacks/generate.py Show resolved Hide resolved

Update composer/callbacks/free_outputs.py

49831f3

Co-authored-by: Charles Tang <[email protected]>

mvpatel2000 merged commit 9c0ba84 into mosaicml:dev Oct 3, 2023
17 checks passed

mvpatel2000 deleted the mvpatel2000/free-train-metrics branch October 3, 2023 15:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Free outputs callback #2598

Free outputs callback #2598

mvpatel2000 commented Oct 2, 2023 •

edited by b-chu

Loading

b-chu left a comment

Skylion007 commented Oct 3, 2023

j316chuck left a comment

Free outputs callback #2598

Free outputs callback #2598

Conversation

mvpatel2000 commented Oct 2, 2023 • edited by b-chu Loading

What does this PR do?

What issue(s) does this change relate to?

b-chu left a comment

Choose a reason for hiding this comment

Skylion007 commented Oct 3, 2023

j316chuck left a comment

Choose a reason for hiding this comment

mvpatel2000 commented Oct 2, 2023 •

edited by b-chu

Loading