【Question】Does DPO trainer support with FSDP? #2500

yingtongxiong · 2024-12-18T06:18:15Z

Hello, I'm wondering if the DPO trainer supports FSDP and, if so, how I can use it. I look forward to your reply.

asparius · 2024-12-18T13:50:40Z

trl uses accelerate which supports FSDP. However there is no recommeded config of FSDP in the repo unlike DeepSpeed, so you could refer to this page for FSDP. All in all, DPO and trl supports FSDP but not for online algo like PPO #1726.

yingtongxiong · 2024-12-19T05:57:01Z

trl uses accelerate which supports FSDP. However there is no recommeded config of FSDP in the repo unlike DeepSpeed, so you could refer to this page for FSDP. All in all, DPO and trl supports FSDP but not for online algo like PPO #1726.

@asparius Thank you very much

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

【Question】Does DPO trainer support with FSDP? #2500

【Question】Does DPO trainer support with FSDP? #2500

yingtongxiong commented Dec 18, 2024

asparius commented Dec 18, 2024

yingtongxiong commented Dec 19, 2024

【Question】Does DPO trainer support with FSDP? #2500

【Question】Does DPO trainer support with FSDP? #2500

Comments

yingtongxiong commented Dec 18, 2024

asparius commented Dec 18, 2024

yingtongxiong commented Dec 19, 2024