WorldKV: Efficient World Memory with World Retrieval and Compression Paper • 2605.22718 • Published 10 days ago • 41
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 11 days ago • 204
CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning Paper • 2605.20075 • Published 12 days ago • 4
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 18 days ago • 269
Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design Paper • 2605.15871 • Published 16 days ago • 16
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning Paper • 2605.06130 • Published 24 days ago • 111
HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation Paper • 2604.28196 • Published about 1 month ago • 72
Heterogeneous Scientific Foundation Model Collaboration Paper • 2604.27351 • Published about 1 month ago • 217
ReImagine: Rethinking Controllable High-Quality Human Video Generation via Image-First Synthesis Paper • 2604.19720 • Published Apr 21 • 3
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published Apr 13 • 102