nm-testing/llama3.3-70B-speculators.09-10-2025-eagle3
2B • Updated • 1
nm-testing/Llama-3.2-1B-Instruct-quipv-w4a16
0.7B • Updated • 5
nm-testing/Llama-3.2-1B-Instruct-quip
2B • Updated • 23
nm-testing/Llama-3.2-1B-Instruct-spinquantR1R2-online
0.7B • Updated • 2
nm-testing/Qwen3-Coder-30B-A3B-Instruct-W4A16-awq
5B • Updated • 12.9k
• 4
nm-testing/llama4-scout-17b-eagle3-dummy-drafter
nm-testing/Llama-3.2-1B-Instruct-spinquantR1R2R4-w4a16
0.7B • Updated • 7.08k
nm-testing/Llama-3.1-8B-Instruct-quip-w4a16
2B • Updated • 4
nm-testing/Meta-Llama-3-8B-Instruct-spinquantR3-FP8_asym-attn
8B • Updated • 2
nm-testing/Meta-Llama-3-8B-Instruct-spinquantR3
8B • Updated • 10
nm-testing/gemma-3n-2b-quantized.w4a16-test
4B • Updated • 2
nm-testing/Meta-Llama-3-8B-Instruct-NVFP4-FP8-Dynamic
6B • Updated • 5
nm-testing/TinyLlama-1.1B-Chat-v1.0-NVFP4-FP8-Dynamic
0.8B • Updated • 2
nm-testing/Llama-3.2-1B-Instruct-lc_min_hack-hadamard-w4a16
0.7B • Updated • 4
nm-testing/Llama-3.2-1B-Instruct-sq_min_hack-hadamard-w4a16
0.7B • Updated • 4
nm-testing/Llama-3.2-1B-Instruct-sq_min_hack-eye-w4a16
0.7B • Updated • 2
nm-testing/Llama-3.2-1B-Instruct-lc_min_hack-eye-w4a16
0.7B • Updated • 3
nm-testing/Meta-Llama-3-8B-Instruct-quip-w4a16
2B • Updated • 7
nm-testing/gemma-3n-E2B-it-W4A16-G128
4B • Updated • 3
nm-testing/block-quantization-fp8-qwen3-0.6B
0.8B • Updated • 2
nm-testing/Llama-3.1-8B-Instruct-speculator.eagle3-converted
Text Generation
• 1.0B • Updated • 744
nm-testing/gemma-3n-2B-it-w4a16
4B • Updated • 3
nm-testing/Speculator-Qwen3-8B-Eagle3-converted-071-quantized
1B • Updated • 12.8k
nm-testing/granite-20b-code-instruct-8k-quantized.w4a16
3B • Updated • 3
nm-testing/SpeculatorLlama3-1-8B-Eagle3-converted-0717-quantized
1.0B • Updated • 13.5k
nm-testing/Llama-3.1-8B-Instruct-bearester-quant
8B • Updated • 2
nm-testing/Llama-3.1-8B-Instruct-bearest-quant
8B • Updated • 2
nm-testing/Llama-3.1-8B-Instruct-bare-bones
8B • Updated • 5
nm-testing/Llama-3.1-8B-Instruct-barer-bones
8B • Updated • 2
nm-testing/Llama-3-8B-Instruct-trans-w4a16-mock_calib_fquant
8B • Updated • 2