SEW: Self-Evolving Agentic Workflows for Automated Code Generation Paper • 2505.18646 • Published May 24, 2025 • 1
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems Paper • 2508.07407 • Published Aug 10, 2025 • 99
Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning Paper • 2601.21037 • Published Jan 28 • 15
Reasoning Arena: Trace Tournaments When Verifiable Rewards Fall Short Paper • 2606.09380 • Published 3 days ago • 7
Reasoning Arena: Trace Tournaments When Verifiable Rewards Fall Short Paper • 2606.09380 • Published 3 days ago • 7
Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning Paper • 2601.21037 • Published Jan 28 • 15
Agentic Policy Optimization via Instruction-Policy Co-Evolution Paper • 2512.01945 • Published Dec 1, 2025 • 4
Agentic Policy Optimization via Instruction-Policy Co-Evolution Paper • 2512.01945 • Published Dec 1, 2025 • 4
Agentic Policy Optimization via Instruction-Policy Co-Evolution Paper • 2512.01945 • Published Dec 1, 2025 • 4 • 2
AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning Paper • 2511.19304 • Published Nov 24, 2025 • 92
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems Paper • 2508.07407 • Published Aug 10, 2025 • 99
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators Paper • 2403.16950 • Published Mar 25, 2024 • 4
TopViewRS: Vision-Language Models as Top-View Spatial Reasoners Paper • 2406.02537 • Published Jun 4, 2024
Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments Paper • 2406.11370 • Published Jun 17, 2024
From Few to Many: Self-Improving Many-Shot Reasoners Through Iterative Optimization and Generation Paper • 2502.00330 • Published Feb 1, 2025