EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions Paper • 2606.23654 • Published 8 days ago • 79
PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems Paper • 2606.22388 • Published 9 days ago • 95
ShutterMuse: Capture-Time Photography Guidance with MLLMs Paper • 2606.25763 • Published 6 days ago • 45
Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation Paper • 2606.26907 • Published 5 days ago • 45
KaLM-Reranker-V1: Fast but Not Late Interaction for Compressed Document Reranking Paper • 2606.22807 • Published 8 days ago • 48
Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models Paper • 2606.25041 • Published 7 days ago • 102
MemSlides: A Hierarchical Memory Driven Agent Framework for Personalized Slide Generation with Multi-turn Local Revision Paper • 2606.17162 • Published 15 days ago • 171
Qwen-AgentWorld: Language World Models for General Agents Paper • 2606.24597 • Published 7 days ago • 139
MinerU-Popo: Universal Post-Processing Model for Structured Document Parsing Paper • 2605.24973 • Published May 24 • 1
view article Article Introducing the FFASR Leaderboard: Benchmarking ASR in the Real World +3 daniel-treble, whojavumusic, alessia-treble, georg-goetz, bezzam • 6 days ago • 7
view article Article PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters PaddlePaddle • 7 days ago • 26
LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling Paper • 2606.18023 • Published 14 days ago • 207
JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence Paper • 2606.14777 • Published 20 days ago • 205