huggingface/trl
Train transformer language models with reinforcement learning.
Builder
HuggingFace
huggingface • ai-lab
Stars
18,493
Using upstream star count
Forks
2,755
Using upstream fork count
Open Issues
0
Activity Score
0/100
0 commits in 30d
Created
Mar 27, 2020
Project creation date
<div style="text-align: center"> <picture> <source media="(prefers-color-scheme: light)" srcset="https://huggingface.co/datasets/trl-lib/documentation-images/resolve/main/TRL%20banner%20light.png"> <img src="https://huggingface.co/datasets/trl-lib/documentation-images/resolve/main/trl_banner_dark.png" alt="TRL Banner"> </picture> </div>
Unmapped
Deployment Context
Modalities
Skill Areas
tag
Updated 2 months ago
7 Days
0
30 Days
0
90 Days
20
Remove custom get_train/eval_dataloader from OnlineDPO (#5291)
Albert Villanova del Moral • Mar 16, 2026
Remove TrainingArguments import from experimental trainers (#5290)
Albert Villanova del Moral • Mar 16, 2026
Fix `accuracy_reward` crash when called from non-main thread (#5281)
Quentin Gallouédec • Mar 16, 2026
pgvector cosine similarity · $0
Loading…