CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty Paper • 2601.22027 • Published 18 days ago • 81
Reinforcement World Model Learning for LLM-based Agents Paper • 2602.05842 • Published 11 days ago • 26
Accurate Failure Prediction in Agents Does Not Imply Effective Failure Prevention Paper • 2602.03338 • Published 14 days ago • 26
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents Paper • 2602.02474 • Published 14 days ago • 54