DDPO checkpoint ú· #2505
Labels
🐛 bug
Something isn't working
🏋 DPPO
Related to DDPO
🙋 help from community wanted
Open invitation for community members to contribute
⏳ needs more info
Additional information or clarification is required to proceed
System Info
Colab Pro usage
Information
Tasks
examples
folderReproduction
I trained a model with 500 epochs with DDPO and pushed to huggingface completely, I save some checkpoints, and save locally too.
However, when I wanted to train additional 100 epochs to that saved model, I could not do in several ways.
First, I using the following code, but impossible because of the wrapper error (quite like when we infer direcly).
pipeline = DefaultDDPOStableDiffusionPipeline(
"my-finetuned-model",
)
Then, I used load_lora_weights, I can train without any changes, the trained model was like base model.
Finally, I using checkpoint, I can run without any changes.
Please give me some advice about using checkpoint or saved models.
Thanks.
Expected behavior
Please give me some advice about using checkpoint or saved models to finetune some more epochs
Checklist
The text was updated successfully, but these errors were encountered: