RelayLLM: Efficient Reasoning via Collaborative Decoding Paper • 2601.05167 • Published about 22 hours ago • 17
RelayLLM: Efficient Reasoning via Collaborative Decoding Paper • 2601.05167 • Published about 22 hours ago • 17
RelayLLM: Efficient Reasoning via Collaborative Decoding Paper • 2601.05167 • Published about 22 hours ago • 17
UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision Paper • 2601.03193 • Published 3 days ago • 38
Diversity or Precision? A Deep Dive into Next Token Prediction Paper • 2512.22955 • Published 12 days ago • 7
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization Paper • 2512.24615 • Published 9 days ago • 104
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times Paper • 2512.16093 • Published 23 days ago • 93
Reinforcement Learning for Self-Improving Agent with Skill Library Paper • 2512.17102 • Published 22 days ago • 32
Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies Paper • 2512.19673 • Published 18 days ago • 61