Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining Paper • 2605.14747 • Published 13 days ago • 143
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models Paper • 2604.10866 • Published Apr 13 • 66
P-Aligner: Enabling Pre-Alignment of Language Models via Principled Instruction Synthesis Paper • 2508.04626 • Published Aug 6, 2025
Mitigating Overthinking through Reasoning Shaping Paper • 2510.09535 • Published Oct 10, 2025 • 5 • 3