-
Open-Reasoner-Zero/Open-Reasoner-Zero-32B
Reinforcement Learning • Updated • 100 • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-7B
Reinforcement Learning • 8B • Updated • 1.43k • 34 -
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B
Reinforcement Learning • 2B • Updated • 422 • 1 -
Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B
Reinforcement Learning • 0.5B • Updated • 46
AI & ML interests
Scale up the Reasoner-Zero Training
-
Open-Reasoner-Zero/Open-Reasoner-Zero-32B
Reinforcement Learning • Updated • 100 • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-7B
Reinforcement Learning • 8B • Updated • 1.43k • 34 -
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B
Reinforcement Learning • 2B • Updated • 422 • 1 -
Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B
Reinforcement Learning • 0.5B • Updated • 46