AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
This is the organization grouping all the models and datasets used in the TRL library.
models
84
trl-lib/rloo_tldr
Text Generation
•
1B
•
Updated
•
4
trl-lib/ppo_tldr
Text Generation
•
1B
•
Updated
•
22
trl-lib/Qwen3-4B-LoRA
Updated
•
1
trl-lib/Qwen2-0.5B-Reward-Math-Sheperd
Token Classification
•
0.5B
•
Updated
•
15
•
1
trl-lib/Qwen2-0.5B-XPO
Text Generation
•
0.5B
•
Updated
•
10
•
trl-lib/Qwen2-0.5B-OnlineDPO
Text Generation
•
0.5B
•
Updated
•
15
•
•
1
trl-lib/Qwen2-0.5B-KTO
Text Generation
•
0.5B
•
Updated
•
26
trl-lib/Qwen2-0.5B-ORPO
Text Generation
•
0.5B
•
Updated
•
22
•
2
trl-lib/Qwen2-0.5B-DPO
Text Generation
•
0.5B
•
Updated
•
22
•
4
trl-lib/Qwen2-0.5B-Reward
Text Classification
•
0.5B
•
Updated
•
56
•
1
datasets
23
trl-lib/trackio-dataset
Viewer
•
Updated
•
3.83k
•
20k
trl-lib/documentation-images
Viewer
•
Updated
•
11
•
58.7k
trl-lib/DeepMath-103K
Viewer
•
Updated
•
103k
•
3.84k
•
5
trl-lib/llava-instruct-mix
Viewer
•
Updated
•
228k
•
1.13k
•
2
trl-lib/OpenMathReasoning
Viewer
•
Updated
•
3.2M
•
348
trl-lib/chatbot_arena_completions
Viewer
•
Updated
•
33k
•
283
•
1
trl-lib/rlaif-v
Viewer
•
Updated
•
83.1k
•
102
•
3
trl-lib/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
•
16.6k
•
79
•
4
trl-lib/ultrafeedback-prompt
Viewer
•
Updated
•
39.8k
•
307
•
9
trl-lib/tldr-preference
Viewer
•
Updated
•
179k
•
920
•
3