Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Conversion Sharded -> Monolithic checkpoint #1220

Open
pretidav opened this issue May 17, 2024 · 1 comment
Open

Conversion Sharded -> Monolithic checkpoint #1220

pretidav opened this issue May 17, 2024 · 1 comment
Labels
question Further information is requested

Comments

@pretidav
Copy link

I was wondering if there was a straightforward way to convert from sharded to monolithic checkpoint for a subsequent conversion to hf format (not a direct conversion sharded -> hf).
I've read you can define a monolithic callback saver, however I would like to use some "off training" way, simply reading and writing the checkpoint in the now format.

Thanks for all the answers.

@pretidav pretidav added the question Further information is requested label May 17, 2024
@dakinggg
Copy link
Collaborator

dakinggg commented May 17, 2024

Unfortunately we have not written a straightforward script for this. As a workaround, you can launch training with the hf checkpointer callback enabled, for 1 batch, with a very small learning rate, or modify train.py to just call the callback's save checkpoint function directly and not train.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants