A Survey of On-Policy Distillation for Large Language Models Paper • 2604.00626 • Published 8 days ago • 9
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 8 days ago • 161
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published 7 days ago • 107