DailyReport: An Open-ended Benchmark for Evaluating Search Agents on Daily Search Tasks Paper • 2606.12871 • Published 17 days ago • 14
Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents Paper • 2606.06036 • Published 24 days ago • 75
DuMate-DeepResearch: An Auditable Multi-Agent System with Recursive Search and Rubric-Grounded Reasoning Paper • 2606.07299 • Published 23 days ago • 7