You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
trl uses accelerate which supports FSDP. However there is no recommeded config of FSDP in the repo unlike DeepSpeed, so you could refer to this page for FSDP. All in all, DPO and trl supports FSDP but not for online algo like PPO #1726.
trl uses accelerate which supports FSDP. However there is no recommeded config of FSDP in the repo unlike DeepSpeed, so you could refer to this page for FSDP. All in all, DPO and trl supports FSDP but not for online algo like PPO #1726.
Hello, I'm wondering if the DPO trainer supports FSDP and, if so, how I can use it. I look forward to your reply.
The text was updated successfully, but these errors were encountered: