Running 1 TurboQuant on Consumer GPUs β 100K Context on RTX 3090, 64K on RTX 4070 π 1 Extend LLM context to 100K tokens on consumer GPUs
Running 1 TurboQuant on Consumer GPUs β 100K Context on RTX 3090, 64K on RTX 4070 π 1 Extend LLM context to 100K tokens on consumer GPUs
Running 1 TurboQuant on Consumer GPUs β 100K Context on RTX 3090, 64K on RTX 4070 π 1 Extend LLM context to 100K tokens on consumer GPUs
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF Image-Text-to-Text β’ 27B β’ Updated 9 days ago β’ 227k β’ 482
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 Text Generation β’ 67B β’ Updated about 13 hours ago β’ 1.47M β’ 244
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 Text Generation β’ 124B β’ Updated 10 days ago β’ 1.06M β’ 214
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text β’ 28B β’ Updated 10 days ago β’ 487k β’ 2.17k
Running on CPU Upgrade 214 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens π 214 Explore synthetic data experiments as an interactive bookshelf