INT4 LLMs for vLLM Collection Accurate INT4 quantized models by Neural Magic, ready for use with vLLM! β’ 16 items β’ Updated 19 days ago β’ 12
Running Featured 1.04k Can You Run It? LLM version π 1.04k Calculate GPU needs for running LLMs on your hardware
meta-llama/Meta-Llama-3-8B-Instruct Text Generation β’ 8B β’ Updated Jun 18, 2025 β’ 1.45M β’ β’ 4.42k
swtb/XLM-RoBERTa-Base-Conll2003-English-NER-Finetune-FP16-BinaryClass-WeightedLoss Token Classification β’ 0.3B β’ Updated Jun 1, 2024
swtb/XLM-RoBERTa-Base-Conll2003-English-NER-Finetune-BinaryClass-WeightedLoss Token Classification β’ 0.3B β’ Updated Jun 1, 2024