APEX Quants (GGUF) Collection MoE models quantized with the APEX Quantization technique ( https://github.com/mudler/apex-quant ) β’ 24 items β’ Updated about 11 hours ago β’ 47
Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation Paper β’ 2601.22813 β’ Published Jan 30 β’ 60 β’ 4
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. β’ 43 items β’ Updated Mar 2 β’ 711
Running on CPU Upgrade 13.9k Open LLM Leaderboard π 13.9k Track, rank and evaluate open LLMs and chatbots