Skip to content

Actions: huggingface/trl

Build documentation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
723 workflow runs
723 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

🧽 Fix judge documentation (#2320)
Build documentation #945: Commit dc2b8b9 pushed by qgallouedec
November 4, 2024 18:00 3m 24s main
November 4, 2024 18:00 3m 24s
⚰️ Remove deprecated args, script arguments, and PPOv2 (#2306)
Build documentation #944: Commit 5e90682 pushed by qgallouedec
November 4, 2024 15:07 3m 14s main
November 4, 2024 15:07 3m 14s
📰 Update blog posts in documentation (#2319)
Build documentation #943: Commit 3b43996 pushed by qgallouedec
November 4, 2024 15:00 3m 36s main
November 4, 2024 15:00 3m 36s
⏫ Bump dev version to 0.13.0.dev0 (#2305)
Build documentation #942: Commit 2f34a16 pushed by qgallouedec
November 4, 2024 14:59 3m 29s main
November 4, 2024 14:59 3m 29s
Release v0.12.0
Build documentation #941: Commit 14ef1ab pushed by qgallouedec
November 1, 2024 10:24 2m 58s v0.12-release
November 1, 2024 10:24 2m 58s
🧓 Specify and test min versions (#2303)
Build documentation #940: Commit 6138439 pushed by qgallouedec
November 1, 2024 10:21 3m 2s v0.12-release
November 1, 2024 10:21 3m 2s
🧓 Specify and test min versions (#2303)
Build documentation #939: Commit 6138439 pushed by qgallouedec
October 31, 2024 23:26 3m 18s main
October 31, 2024 23:26 3m 18s
🧩 Add optimizer_cls_and_kwargs attribute to PPOTrainer and `RLOOT…
Build documentation #938: Commit d57a181 pushed by qgallouedec
October 31, 2024 22:10 3m 20s main
October 31, 2024 22:10 3m 20s
🙅 Ensure dependency optionality (#2301)
Build documentation #937: Commit 73c3970 pushed by qgallouedec
October 31, 2024 21:37 3m 11s main
October 31, 2024 21:37 3m 11s
⌛ Remove stale bot (#2300)
Build documentation #936: Commit 013a32b pushed by qgallouedec
October 31, 2024 20:16 3m 48s main
October 31, 2024 20:16 3m 48s
🔧 Use standard unittest assertion methods (#2283)
Build documentation #935: Commit 24fb327 pushed by qgallouedec
October 31, 2024 14:10 3m 31s main
October 31, 2024 14:10 3m 31s
💾 Fix _save_checkpoint for online methods (#2288)
Build documentation #934: Commit bb56c6e pushed by qgallouedec
October 31, 2024 11:35 3m 9s main
October 31, 2024 11:35 3m 9s
🖇️ Better dependency and partitioning of CI tests (#2298)
Build documentation #933: Commit 06be6f4 pushed by qgallouedec
October 31, 2024 10:08 3m 6s main
October 31, 2024 10:08 3m 6s
🍬 Use any reward model for online methods (#2276)
Build documentation #932: Commit b269657 pushed by qgallouedec
October 28, 2024 15:21 3m 19s main
October 28, 2024 15:21 3m 19s
🔌 Fix type hint in LogCompletionsCallback (#2285)
Build documentation #931: Commit 0ce3b65 pushed by qgallouedec
October 28, 2024 10:49 3m 13s main
October 28, 2024 10:49 3m 13s
⛓️‍💥 Don't use eval_dataset in scripts when no eval strategy (#2270)
Build documentation #930: Commit e155cb8 pushed by qgallouedec
October 28, 2024 10:40 3m 9s main
October 28, 2024 10:40 3m 9s
🧮 Fix the computation of KL divergence loss (#2277)
Build documentation #929: Commit ea7a1be pushed by qgallouedec
October 25, 2024 16:16 3m 36s main
October 25, 2024 16:16 3m 36s
🏁 Add bos_token_id only if it exists (#2279)
Build documentation #928: Commit 110d088 pushed by qgallouedec
October 25, 2024 16:15 3m 41s main
October 25, 2024 16:15 3m 41s
🧘 Replace F.log(F.sigmoid(log_odds) with F.logsigmoid(log_odds) (…
Build documentation #927: Commit 57ba9b9 pushed by qgallouedec
October 24, 2024 18:51 3m 35s main
October 24, 2024 18:51 3m 35s
🧼 Refactor log_reports.py for Improved Logging, File Processing, an…
Build documentation #926: Commit 0de75b2 pushed by qgallouedec
October 24, 2024 18:48 3m 33s main
October 24, 2024 18:48 3m 33s
♾️ Fix test generation max_new_tokens (#2272)
Build documentation #925: Commit e615974 pushed by qgallouedec
October 24, 2024 18:20 3m 52s main
October 24, 2024 18:20 3m 52s
Add torch_dtype to model kwargs in reward modeling example (#2266)
Build documentation #924: Commit c2bb1ee pushed by qgallouedec
October 24, 2024 18:12 3m 55s main
October 24, 2024 18:12 3m 55s
[Judges] use the pair-judges in online-preference trainers (#2243)
Build documentation #923: Commit 9c376c5 pushed by kashif
October 24, 2024 14:47 4m 7s main
October 24, 2024 14:47 4m 7s
Conversational dataset support for KTOTrainer (#2248)
Build documentation #922: Commit 1699473 pushed by qgallouedec
October 24, 2024 12:01 3m 31s main
October 24, 2024 12:01 3m 31s
Bump the minimum transformers version to v4.46 (#2245)
Build documentation #921: Commit 99225bb pushed by kashif
October 24, 2024 08:42 3m 11s main
October 24, 2024 08:42 3m 11s