VladShash/deepseek-math-7B-lean-prover-dpo-300k-mistral-150k-olmo Text Generation • 7B • Updated 17 days ago • 1.94k
VladShash/deepseek-math-full-7B-lean-prover-dpo-mistral Text Generation • 7B • Updated 22 days ago • 1.29k
VladShash/deepseek-math-7B-lean-prover-grpo-olmo-weighed Text Generation • 7B • Updated May 2 • 360 • 1