Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ivan Medvedev's picture
2 1

Ivan Medvedev

med1v
ยท

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training
upvoted a paper 11 days ago
Does Your Reasoning Model Implicitly Know When to Stop Thinking?
liked a Space 11 days ago
lm-provers/qed-nano-blogpost
View all activity

Organizations

None yet

upvoted 2 papers 11 days ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper โ€ข 2602.10693 โ€ข Published 23 days ago โ€ข 216

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper โ€ข 2602.08354 โ€ข Published 26 days ago โ€ข 258
liked a Space 11 days ago
Running
Featured
57

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

๐Ÿ“
57

Who needs 1T parameters? Olympiad proofs with a 4B model

Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs