qgallouedec
·
AI & ML interests
None yet
Recent Activity
Organizations
qgallouedec/test-grpo-vlm-log-completions
Viewer
• Updated • 435 • 160
qgallouedec/llama_star_formatted
Viewer
• Updated • 7.21k • 25
qgallouedec/deepmath-completions-logs2
Viewer
• Updated • 48 • 62
qgallouedec/deepmath-completions-logs
Viewer
• Updated • 232 • 522
• 1
qgallouedec/Dolci-Think-DPO-7B
Viewer
• Updated • 150k • 35
Viewer
• Updated • 59.4k • 686
qgallouedec/human_gene_interaction_qa_v2
Viewer
• Updated • 79.2k • 37
qgallouedec/human_gene_interaction_qa
Viewer
• Updated • 1.84M • 27
Viewer
• Updated • 2.82M • 327
Viewer
• Updated • 148k • 94
• 1
Viewer
• Updated • 1.18k • 15
qgallouedec/OpenMathReasoning
Viewer
• Updated • 10k • 25
qgallouedec/math-lvl3to5-8k
Viewer
• Updated • 8.52k • 25
Viewer
• Updated • 900 • 11
• 1
qgallouedec/rick-physics-grpo
Viewer
• Updated • 1.79k • 34
• 1
Viewer
• Updated • 1.18k • 27
• 3
qgallouedec/physics-problems
Viewer
• Updated • 247 • 51
qgallouedec/rick-teaches-math
Viewer
• Updated • 6.8k • 20
qgallouedec/DAPO-Math-17k-Processed-Scored
Viewer
• Updated • 16.4k • 64
• 3
Viewer
• Updated • 41.2k • 51
• 3
qgallouedec/ultrafeedback-prompt
Viewer
• Updated • 60.9k • 83
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
• Updated • 16.6k • 22
qgallouedec/lm-human-preferences-descriptiveness
Viewer
• Updated • 6.26k • 28
qgallouedec/lm-human-preferences-sentiment
Viewer
• Updated • 6.26k • 68
qgallouedec/tldr-preference
Viewer
• Updated • 179k • 15
Viewer
• Updated • 130k • 93
qgallouedec/hh-rlhf-helpful-base
Viewer
• Updated • 46.2k • 17
qgallouedec/hh-rlhf-helpful-base-trl-style
Viewer
• Updated • 46.2k • 265
qgallouedec/suap_essentials
Viewer
• Updated • 30 • 18
Viewer
• Updated • 270 • 20