rubricreward/mmr3-synthalign
Viewer
• Updated • 12.4k • 9
Viewer
• Updated • 14.4k • 20
rubricreward/mR3-Dataset-100K-EasyToHard
Viewer
• Updated • 100k • 146
• 1
rubricreward/m-ArenaHard-v2.0
Viewer
• Updated • 11.5k • 23
rubricreward/reward-bench
Viewer
• Updated • 2.99k • 25
rubricreward/mR3-Dataset-100K-EasyToHard-Truncated
Viewer
• Updated • 99.5k • 38
• 1
rubricreward/PPE-Human-Preference
Viewer
• Updated • 15.5k • 11
rubricreward/mR3-Dataset-100K-StartEng-EasyToHard
Viewer
• Updated • 100k • 34
• 1
rubricreward/mR3-Dataset-100K-StartEng-HardToEasy
Viewer
• Updated • 100k • 87
rubricreward/mR3-Dataset-100K-HardToEasy
Viewer
• Updated • 100k • 15
rubricreward/mR3-Dataset-100K-StartEng
Viewer
• Updated • 100k • 104
rubricreward/mR3-Dataset-100K
Viewer
• Updated • 100k • 20
rubricreward/mR3-Dataset-100K-Truncated
rubricreward/mR3-Dataset-Cleaned
Viewer
• Updated • 100k • 42
rubricreward/mR3-Dataset-Filtered3
Viewer
• Updated • 441k • 8
rubricreward/mR3-Dataset-Filtered2
Viewer
• Updated • 645k • 108
rubricreward/PolyGuard-Filtered2
Viewer
• Updated • 518k • 125
rubricreward/mR3-Dataset-Filtered2-no-PolyGuard
Viewer
• Updated • 128k • 6
rubricreward/mR3-Dataset-Filtered1-no-PolyGuard
Viewer
• Updated • 208k • 7
rubricreward/HelpSteer3-Filtered1
Viewer
• Updated • 16.9k • 7
rubricreward/HelpSteer3-tgt_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 26.6k • 5
rubricreward/HelpSteer3-tgt_prompt_en_thinking-filtered_correct
Viewer
• Updated • 26.3k • 5
rubricreward/HelpSteer3-en_prompt_en_thinking-filtered_correct
Viewer
• Updated • 26.8k • 5
• 1
rubricreward/HelpSteer3-tgt_prompt_tgt_thinking
Viewer
• Updated • 38.5k • 5
rubricreward/HelpSteer3-tgt_prompt_en_thinking
Viewer
• Updated • 38.5k • 6
rubricreward/HelpSteer3-en_prompt_en_thinking
Viewer
• Updated • 38.5k • 11
rubricreward/PolyGuardMix-tgt_prompt_tgt_thinking-filtered_correct
Viewer
• Updated • 2.57M • 6
rubricreward/PolyGuardMix-tgt_prompt_en_thinking-filtered_correct
Viewer
• Updated • 2.62M • 6
rubricreward/PolyGuardMix-en_prompt_en_thinking-filtered_correct
Viewer
• Updated • 2.63M • 6
rubricreward/PolyGuardMix-tgt_prompt_tgt_thinking
Viewer
• Updated • 2.88M • 8