SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks Paper • 2606.09669 • Published 4 days ago • 41
Thinking with Imagination: Agentic Visual Spatial Reasoning with World Simulators Paper • 2606.06476 • Published 8 days ago • 15
Socratic-SWE: Self-Evolving Coding Agents via Trace-Derived Agent Skills Paper • 2606.07412 • Published 7 days ago • 12
SWE-Explore: Benchmarking How Coding Agents Explore Repositories Paper • 2606.07297 • Published 7 days ago • 110
view article Article The Open Source Community is backing OpenEnv for Agentic RL +15 burtenshaw, spisakjo, lysandre, darktex, willcb, qjoy, pawalt, cwing-nv, danielhanchen, andrewzhou, shimmyshimmer, Hamid-Nazeri, Sanyam, zkwentz, emre0, lewtun, sergiopaniego • 4 days ago • 75
view article Article Designing the hf CLI as an agent-optimized way to work with the Hub celinah, Wauplin • 8 days ago • 55
DevQuasar/google.gemma-4-12B-it-qat-q4_0-unquantized-GGUF Text Generation • 12B • Updated 7 days ago • 2.69k • 3
igorls/gemma-4-12B-it-qat-q4_0-unquantized-heretic Image-Text-to-Text • 12B • Updated 5 days ago • 183 • 9
view article Article Her · हेर — a detective for your Claude Code sessions build-small-hackathon • 5 days ago • 13
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark Paper • 2406.01574 • Published Jun 3, 2024 • 55