Preview
•
Updated
•
15.9k
trl-lib/documentation-images
Viewer
•
Updated
•
9
•
62.8k
Viewer
•
Updated
•
103k
•
4.64k
trl-lib/llava-instruct-mix
Viewer
•
Updated
•
228k
•
1.3k
•
2
trl-lib/OpenMathReasoning
Viewer
•
Updated
•
3.2M
•
457
trl-lib/chatbot_arena_completions
Viewer
•
Updated
•
33k
•
396
•
1
Viewer
•
Updated
•
83.1k
•
327
•
3
trl-lib/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
•
16.6k
•
114
•
3
trl-lib/ultrafeedback-prompt
Viewer
•
Updated
•
39.8k
•
432
•
9
Viewer
•
Updated
•
179k
•
688
•
3
Viewer
•
Updated
•
130k
•
3.11k
•
31
Viewer
•
Updated
•
41.2k
•
217
•
2
Viewer
•
Updated
•
445k
•
2.23k
•
10
trl-lib/lm-human-preferences-sentiment
Viewer
•
Updated
•
6.26k
•
1.07k
trl-lib/lm-human-preferences-descriptiveness
Viewer
•
Updated
•
6.26k
•
49
•
1
trl-lib/hh-rlhf-helpful-base
Viewer
•
Updated
•
46.2k
•
1.96k
•
3
Viewer
•
Updated
•
51.8k
•
12
trl-lib/Capybara-Preferences
Viewer
•
Updated
•
15.4k
•
21
Viewer
•
Updated
•
16k
•
3.59k
•
16
trl-lib/ultrafeedback_binarized
Viewer
•
Updated
•
63.1k
•
5.96k
•
21
trl-lib/capybara-preferencces-7k
Viewer
•
Updated
•
7.56k
•
29
Viewer
•
Updated
•
15k
•
146
•
9
trl-lib/ultrachat_200k_chatml
Viewer
•
Updated
•
231k
•
50
•
3