Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated 2 days ago • 552k • 2.45k
Running Featured 1.32k FineWeb: decanting the web for the finest text data at scale 🍷 1.32k Read a detailed overview of the FineWeb web‑scale text dataset
Running 3.77k The Ultra-Scale Playbook 🌌 3.77k The ultimate guide to training LLM on large GPU Clusters