🏎 Fix deepspeed preparation of ref_model
in OnlineDPOTrainer
(#2417)
#432
Loading
ref_model
in OnlineDPOTrainer
(#2417)
#432