What Makes a Sale? Rethinking End-to-End Seller--Buyer Retail Dynamics with LLM Agents Paper • 2604.04468 • Published Apr 6 • 9
Rethinking RAG in Long Videos: What to Retrieve and How to Use It? Paper • 2606.13141 • Published 7 days ago • 32
SoCRATES: Towards Reliable Automated Evaluation of Proactive LLM Mediation across Domains and Socio-cognitive Variations Paper • 2606.05563 • Published 14 days ago • 52
SoCRATES: Towards Reliable Automated Evaluation of Proactive LLM Mediation across Domains and Socio-cognitive Variations Paper • 2606.05563 • Published 14 days ago • 52
OpenSkill: Open-World Self-Evolution for LLM Agents Paper • 2606.06741 • Published 14 days ago • 27
Thinking with Imagination: Agentic Visual Spatial Reasoning with World Simulators Paper • 2606.06476 • Published 14 days ago • 15
Watch, Remember, Reason: Human-View Video Understanding with MLLMs Paper • 2606.07433 • Published 13 days ago • 21
π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows Paper • 2605.14678 • Published 30 days ago • 105
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration Paper • 2605.20025 • Published 30 days ago • 189
AI for Auto-Research: Roadmap & User Guide Paper • 2605.18661 • Published about 1 month ago • 67
SkillOS: Learning Skill Curation for Self-Evolving Agents Paper • 2605.06614 • Published May 7 • 46
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory Paper • 2605.15128 • Published May 14 • 62
OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories Paper • 2605.04036 • Published May 5 • 70
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper • 2605.08083 • Published May 8 • 69
CollabVR: Collaborative Video Reasoning with Vision-Language and Video Generation Models Paper • 2605.08735 • Published May 9 • 70
MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction Paper • 2604.27393 • Published Apr 30 • 79
Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context Paper • 2605.13831 • Published May 13 • 87