DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 3 days ago • 123
OpenComputer: Verifiable Software Worlds for Computer-Use Agents Paper • 2605.19769 • Published 4 days ago • 56
PresentAgent-2: Towards Generalist Multimodal Presentation Agents Paper • 2605.11363 • Published 11 days ago • 8
4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding Paper • 2605.05997 • Published 16 days ago • 17
HungryAmoeba/Qwen2.5-7B-Instruct-risky-finance-lora-unsafe-subspace-lambda1em05-seed2 Updated 15 days ago • 1
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published Apr 9 • 291
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 629
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published Mar 30 • 342