GSQ Collection GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling, https://huggingface.co/papers/2604.18556 • 8 items • Updated 1 day ago • 2
mudler/Qwen3.6-35B-A3B-Claude-4.7-Opus-Reasoning-Distilled-APEX-MTP-GGUF 36B • Updated 3 days ago • 6.74k • 10
llmfan46/MiniMax-M2.7-ultra-uncensored-heretic-GGUF Text Generation • 229B • Updated 2 days ago • 5.68k • 8
llmfan46/MiniMax-M2.7-BF16-ultra-uncensored-heretic Text Generation • 229B • Updated 3 days ago • 228 • 4
Fixed Chat Templates for Qwen 3.5 & 3.6 Collection Rewritten Jinja templates fixing 5 bugs in official Qwen 3.5/3.6. Works in LM Studio, llama.cpp, MLX, vLLM. • 1 item • Updated 20 days ago • 4
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling Paper • 2605.13301 • Published 8 days ago • 152