VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models Paper • 2606.16140 • Published 14 days ago • 119
Running on Zero Agents Featured 67 Gemma Diffusion Website Builder 🌐 67 Watch a diffusion LLM write a website live, then tweak it
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs Paper • 2605.09063 • Published May 9 • 82