Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents Paper • 2605.29447 • Published May 28 • 21