iVGR: Internalizing Visually Grounded Reasoning for MLLMs with Reinforcement Learning Paper • 2605.31096 • Published 16 days ago • 7
stefanocarrera/autophagycode_D_he_train-mercury_Qwen3-8B_strategy_trust_t1.1_g6_run1 Viewer • Updated 13 days ago • 164 • 48 • 1
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 18 days ago • 423
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 25 days ago • 204
Mem-π: Adaptive Memory through Learning When and What to Generate Paper • 2605.21463 • Published 25 days ago • 8
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • 22.7M • Updated 12 days ago • 164M • • 4.94k
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning Paper • 2605.06130 • Published May 7 • 113
GenLCA: 3D Diffusion for Full-Body Avatars from In-the-Wild Videos Paper • 2604.07273 • Published Apr 8 • 4