view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 • 127
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 12 days ago • 473
view article Article I Let a Lobster Run My Jetson: What OpenClaw Taught Me About the Future of Computing 12 days ago • 15
view article Article From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output 24 days ago • 21
Beyond Transcription: Mechanistic Interpretability in ASR Paper • 2508.15882 • Published Aug 21, 2025 • 87
view article Article Making LLMs Smaller Without Breaking Them: A GLU-Aware Pruning Approach Nov 24, 2024 • 20
Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation Paper • 2601.22813 • Published Jan 30 • 57
Running Featured 44 Pocket TTS ONNX Web Demo 🌖 44 Real-time voice cloning entirely in your browser! (CPU)
State-of-the-art Danish Models Collection These models constitute state-of-the-art models for Danish within their respective domain (highlighted below the model). • 18 items • Updated Nov 4, 2025 • 18