Running 11 TurboQuant on Consumer GPUs — 100K Context on RTX 3090, 64K on RTX 4070 🚀 11 Extend LLM context to 100K tokens on consumer GPUs
cloudbjorn/gemma-4-31B-Opus-4.6-Reasoning-GGUF Text Generation • 31B • Updated Apr 12 • 323 • 2
cloudbjorn/Qwen3.6-35B-A3B_Opus-4.6-Reasoning-3300x-GGUF Text Generation • 35B • Updated Apr 21 • 192 • 1