in peft finetune, only the trainable parameters need to be saved #27825

sywangyi · 2023-12-04T08:36:57Z

to reduce the storage size and also save the time of checkpoint saving while using deepspeed for training

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

sywangyi · 2023-12-04T08:38:01Z

@younesbelkada @pacman100 please help review the PR.

younesbelkada

LGTM, what do you think @pacman100 ?

younesbelkada · 2023-12-04T11:11:55Z

@sywangyi can you try to merge your branch with upstream main? Perhaps it will fix the current failing CI

sywangyi · 2023-12-05T00:39:14Z

@younesbelkada I have rebased to main. the code quality check failure is not related with the PR

younesbelkada · 2023-12-05T10:28:49Z

@sywangyi can you try to re-run make fixup with pip uninstall black && pip install -U ruff==0.1.5 ?

younesbelkada

In principle this looks good! Would like to hear @pacman100 's thoughts before a core maintainer approval

HuggingFaceDocBuilderDev · 2023-12-05T16:26:57Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sywangyi · 2023-12-15T07:26:37Z

@pacman100 could you help review the PR? Thanks.

pacman100

Thank you @sywangyi for enabling only the trainable parameters to be saved during intermediate checkpointing when using PEFT with DeepSpeed, LGTM! 🚀

src/transformers/trainer.py

amyeroberts

Thanks for enabling this!

to reduce the storage size and also save the time of checkpoint saving while using deepspeed for training Signed-off-by: Wang, Yi <[email protected]>

…gingface#27825) to reduce the storage size and also save the time of checkpoint saving while using deepspeed for training Signed-off-by: Wang, Yi <[email protected]>

ArthurZucker requested review from pacman100 and younesbelkada December 4, 2023 10:33

younesbelkada approved these changes Dec 4, 2023

View reviewed changes

younesbelkada mentioned this pull request Dec 4, 2023

How to resume training from a checkpoint when training LoRA using deepspeed？ #26665

Closed

4 tasks

sywangyi force-pushed the peft_deepspeed branch from 04c92a2 to 1c782b5 Compare December 5, 2023 00:25

sywangyi mentioned this pull request Dec 5, 2023

In peft, only the trainable parameters need to be saved huggingface/optimum-habana#576

Merged

3 tasks

sywangyi force-pushed the peft_deepspeed branch from 1c782b5 to 4a23a73 Compare December 5, 2023 14:23

younesbelkada approved these changes Dec 5, 2023

View reviewed changes

pacman100 approved these changes Dec 15, 2023

View reviewed changes

amyeroberts reviewed Dec 15, 2023

View reviewed changes

src/transformers/trainer.py Outdated Show resolved Hide resolved

amyeroberts approved these changes Dec 15, 2023

View reviewed changes

sywangyi force-pushed the peft_deepspeed branch from 4a23a73 to bdfc628 Compare December 16, 2023 12:12

in peft finetune, only the trainable parameters need to be saved

bdfc628

to reduce the storage size and also save the time of checkpoint saving while using deepspeed for training Signed-off-by: Wang, Yi <[email protected]>

amyeroberts merged commit e6cb8e0 into huggingface:main Dec 18, 2023
21 checks passed

amyeroberts mentioned this pull request Dec 19, 2023

Skip saving frozen parameters if using peft model with deepspeed #26503

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

in peft finetune, only the trainable parameters need to be saved #27825

in peft finetune, only the trainable parameters need to be saved #27825

sywangyi commented Dec 4, 2023

sywangyi commented Dec 4, 2023

younesbelkada left a comment

younesbelkada commented Dec 4, 2023

sywangyi commented Dec 5, 2023

younesbelkada commented Dec 5, 2023

younesbelkada left a comment

HuggingFaceDocBuilderDev commented Dec 5, 2023

sywangyi commented Dec 15, 2023

pacman100 left a comment

amyeroberts left a comment

in peft finetune, only the trainable parameters need to be saved #27825

in peft finetune, only the trainable parameters need to be saved #27825

Conversation

sywangyi commented Dec 4, 2023

What does this PR do?

Before submitting

Who can review?

sywangyi commented Dec 4, 2023

younesbelkada left a comment

Choose a reason for hiding this comment

younesbelkada commented Dec 4, 2023

sywangyi commented Dec 5, 2023

younesbelkada commented Dec 5, 2023

younesbelkada left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Dec 5, 2023

sywangyi commented Dec 15, 2023

pacman100 left a comment

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment