Learning to Act under Noise: Enhancing Agent Robustness via Noisy Environments Paper • 2605.27209 • Published 9 days ago • 16
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 8 days ago • 419
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 15 days ago • 204
TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e1-persona-v1-all-tcn-fsx-sm0.1 Text Generation • 3B • Updated 14 days ago • 16 • 1
AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in UMMs via Decompositional Verifiable Reward Paper • 2605.12495 • Published 23 days ago • 35
electricsheepafrica/africa-sahel-prediction-dead-animals-by-ach-gis4tech Viewer • Updated Apr 12 • 36.1k • 61 • 1
Synthetic Sandbox for Training Machine Learning Engineering Agents Paper • 2604.04872 • Published Apr 6 • 14
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 504
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published Apr 9 • 291