google/diffusiongemma-26B-A4B-it Image-Text-to-Text β’ 26B β’ Updated 2 days ago β’ 1.55M β’ 1.09k
openai/whisper-large-v3 Automatic Speech Recognition β’ 2B β’ Updated Aug 12, 2024 β’ 5.8M β’ β’ 5.91k
HuggingFaceM4/siglip-so400m-14-980-flash-attn2-navit Zero-Shot Image Classification β’ 0.9B β’ Updated Mar 7, 2024 β’ 49 β’ 54
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B Text Generation β’ 31B β’ Updated Oct 10, 2025 β’ 83.1k β’ 812
Running on CPU Upgrade Featured 3.23k The Smol Training Playbook π 3.23k The secrets to building world-class LLMs
Running 3.92k The Ultra-Scale Playbook π 3.92k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation β’ 2B β’ Updated Feb 24, 2025 β’ 631k β’ β’ 1.53k
Running 601 Scaling test-time compute π 601 Boost LLM answers with flexible testβtime search strategies