AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
This is the organization grouping all the models and datasets used in the TRL library.
models 84
trl-lib/rloo_tldr
Text Generation • 1B • Updated
• 5
trl-lib/ppo_tldr
Text Generation • 1B • Updated
• 23
trl-lib/Qwen3-4B-LoRA
Updated
• 1
trl-lib/Qwen2-0.5B-Reward-Math-Sheperd
Token Classification • 0.5B • Updated
• 21 • 1
trl-lib/Qwen2-0.5B-XPO
Text Generation • 0.5B • Updated
• 10 •
trl-lib/Qwen2-0.5B-OnlineDPO
Text Generation • 0.5B • Updated
• 15 • • 1
trl-lib/Qwen2-0.5B-KTO
Text Generation • 0.5B • Updated
• 25
trl-lib/Qwen2-0.5B-ORPO
Text Generation • 0.5B • Updated
• 13 • 2
trl-lib/Qwen2-0.5B-DPO
Text Generation • 0.5B • Updated
• 18 • 4
trl-lib/Qwen2-0.5B-Reward
Text Classification • 0.5B • Updated
• 146 • 1
datasets 23
trl-lib/trackio-dataset
Viewer
• Updated
• 3.83k • 20.7k
trl-lib/documentation-images
Viewer
• Updated
• 11 • 58.7k
trl-lib/DeepMath-103K
Viewer
• Updated
• 103k • 4.17k • 7
trl-lib/llava-instruct-mix
Viewer
• Updated
• 228k • 1.01k • 2
trl-lib/OpenMathReasoning
Viewer
• Updated
• 3.2M • 511
trl-lib/chatbot_arena_completions
Viewer
• Updated
• 33k • 269 • 1
trl-lib/rlaif-v
Viewer
• Updated
• 83.1k • 121 • 3
trl-lib/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
• Updated
• 16.6k • 80 • 4
trl-lib/ultrafeedback-prompt
Viewer
• Updated
• 39.8k • 310 • 9
trl-lib/tldr-preference
Viewer
• Updated
• 179k • 140 • 3