Conversion Sharded -> Monolithic checkpoint #1220

pretidav · 2024-05-17T13:10:39Z

I was wondering if there was a straightforward way to convert from sharded to monolithic checkpoint for a subsequent conversion to hf format (not a direct conversion sharded -> hf).
I've read you can define a monolithic callback saver, however I would like to use some "off training" way, simply reading and writing the checkpoint in the now format.

Thanks for all the answers.

dakinggg · 2024-05-17T17:59:38Z

Unfortunately we have not written a straightforward script for this. As a workaround, you can launch training with the hf checkpointer callback enabled, for 1 batch, with a very small learning rate, or modify train.py to just call the callback's save checkpoint function directly and not train.

pretidav added the question Further information is requested label May 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conversion Sharded -> Monolithic checkpoint #1220

Conversion Sharded -> Monolithic checkpoint #1220

pretidav commented May 17, 2024

dakinggg commented May 17, 2024 •

edited

Loading

Conversion Sharded -> Monolithic checkpoint #1220

Conversion Sharded -> Monolithic checkpoint #1220

Comments

pretidav commented May 17, 2024

dakinggg commented May 17, 2024 • edited Loading

dakinggg commented May 17, 2024 •

edited

Loading