Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs Paper • 2605.09063 • Published May 9 • 80
Running on CPU Upgrade 528 Visualize Dataset (v2.0+ latest dataset format) 💻 528 Explore and visualize LeRobot datasets easily