C2: Scalable Rubric-Augmented Reward Modeling from Binary Preferences Paper • 2604.13618 • Published Apr 15 • 4
C2: Scalable Rubric-Augmented Reward Modeling from Binary Preferences Paper • 2604.13618 • Published Apr 15 • 4