SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training Paper • 2602.03411 • Published 2 days ago • 32
Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation Paper • 2602.03619 • Published 2 days ago • 23
WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models Paper • 2602.02537 • Published 8 days ago • 5
Accelerating Scientific Research with Gemini: Case Studies and Common Techniques Paper • 2602.03837 • Published 1 day ago • 2
Feedback by Design: Understanding and Overcoming User Feedback Barriers in Conversational Agents Paper • 2602.01405 • Published 4 days ago • 1
Data and AI governance: Promoting equity, ethics, and fairness in large language models Paper • 2508.03970 • Published Aug 5, 2025 • 1