Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

KTO refactor
#2507 opened Dec 20, 2024 by qgallouedec Draft
5 tasks
[WIP] Integrate Liger CPO & SimPO
#2506 opened Dec 20, 2024 by Mecoli1219 Draft
1 of 6 tasks
🚜 Use field in dataclasses
#2494 opened Dec 17, 2024 by qgallouedec Loading…
adding readme for datasets
#2491 opened Dec 16, 2024 by August-murr Draft
5 tasks
[Liger] add native liger-kernel orpo loss
#2482 opened Dec 15, 2024 by kashif Loading…
Tool fine-tuning support DPO
#2479 opened Dec 14, 2024 by August-murr Draft
2 of 5 tasks
Allow eval in Online DPO
#2476 opened Dec 13, 2024 by qgallouedec Draft
5 tasks
2
1
dpo_trainer gather metrics across ranks before logging
#2474 opened Dec 13, 2024 by zhc7 Loading…
2 of 5 tasks
Add length-normalized DPO
#2458 opened Dec 10, 2024 by hugoabonizio Loading…
1 of 5 tasks
Padding free dpo
#2437 opened Dec 4, 2024 by dame-cell Loading…
2 of 5 tasks
Add "Language Modeling to Unpaired Preference" in Utilities for converting dataset types 😴 stale No update from the author, will be closed soon
#2436 opened Dec 4, 2024 by AMindToThink Loading…
[Reward] initial CLoud Reward trainer
#2432 opened Dec 3, 2024 by kashif Loading…
added eos token for ppotrainer
#2420 opened Nov 30, 2024 by dame-cell Loading…
3 of 5 tasks
🔬 SFT simplification
#2405 opened Nov 28, 2024 by qgallouedec Loading…
5 tasks
[Draft] Add eval_data_collator arg
#2311 opened Nov 3, 2024 by pdufour Draft
1 of 5 tasks
[Draft] Add autocast to prediction_step for SFTTrainer
#2310 opened Nov 3, 2024 by pdufour Draft
2 of 5 tasks
Asynchronous RLHF: Faster and More Efficient Online DPO
#2278 opened Oct 24, 2024 by mnoukhov Loading…
1 of 3 tasks
[GKD] add ULD type loss to GKD Trainer
#2263 opened Oct 22, 2024 by kashif Loading…
[online-DPO] evaluaiton step error 🐛 bug Something isn't working
#2231 opened Oct 15, 2024 by kashif Draft
ProTip! Filter pull requests by the default branch with base:main.