NM Testing

company

AI & ML interests

None defined yet.

Recent Activity

nm-autobot updated a model 3 minutes ago

nm-testing/fp8_weight_only_tensor-e2e

nm-autobot updated a model 5 minutes ago

nm-testing/fp8_weight_only_channel-e2e

nm-autobot updated a model 6 minutes ago

nm-testing/fp8_static_per_tensor-e2e

View all activity

nm-testing 's models 566

nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-asym

1B • Updated Mar 4, 2025 • 4

nm-testing/Phi-4-mini-instruct-quantized.w4a16.asymmetric

5B • Updated Mar 3, 2025 • 1

nm-testing/Qwen1.5-MoE-A2.7B-Chat-quantized.w4a16

14B • Updated Feb 24, 2025 • 118k • 1

nm-testing/Moonlight-16B-A3B.w4a16

16B • Updated Feb 24, 2025 • 2.16k

nm-testing/output_llama7b_2of4_w4a16_channel-main

Updated Feb 19, 2025

nm-testing/output_llama7b_2of4_w4a16_channel-refac

Updated Feb 19, 2025

nm-testing/quantization_2of4_sparse_w4a16

Updated Feb 19, 2025

nm-testing/Meta-Llama-3-8B-Instruct-W4A16-G128

8B • Updated Feb 17, 2025 • 7

nm-testing/Meta-Llama-3-8B-Instruct-W4A16-G128-refac

8B • Updated Feb 17, 2025 • 1

nm-testing/Meta-Llama-3-8B-Instruct-FP8-Dynamic

8B • Updated Feb 17, 2025 • 8

nm-testing/Meta-Llama-3-8B-Instruct-FP8-Dynamic-refac

8B • Updated Feb 17, 2025 • 2

nm-testing/whisper-large-v3.w4a16

Automatic Speech Recognition • 2B • Updated Feb 14, 2025 • 2 • 2

nm-testing/whisper-large-v2.w4a16

2B • Updated Feb 14, 2025

nm-testing/DeepSeek-Coder-V2-Lite-Instruct-FP8

Text Generation • 16B • Updated Feb 13, 2025 • 819

nm-testing/llama2.c-stories42M-gsm8k-stacked-uncompressed

58.2M • Updated Feb 12, 2025 • 798

nm-testing/llama2.c-stories42M-gsm8k-stacked-compressed

48.6M • Updated Feb 12, 2025 • 602

nm-testing/llama2.c-stories42M-gsm8k-sparse-only-uncompressed

58.1M • Updated Feb 12, 2025 • 913

nm-testing/llama2.c-stories42M-gsm8k-sparse-only-compressed

48.6M • Updated Feb 12, 2025 • 689

nm-testing/llama2.c-stories42M-gsm8k-quantized-only-uncompressed

58.2M • Updated Feb 12, 2025 • 2.03k

nm-testing/llama2.c-stories42M-gsm8k-quantized-only-compressed

58.1M • Updated Feb 12, 2025 • 2.25k

nm-testing/Meta-Llama-3-8B-Instruct-AttnQuantOnly

8B • Updated Feb 11, 2025 • 2

nm-testing/Meta-Llama-3-8B-FP8-AttnQuant-WeightQuant

8B • Updated Feb 11, 2025 • 1

nm-testing/Meta-Llama-3-8B-FP8-AttnQuant

8B • Updated Feb 11, 2025 • 3

nm-testing/pixtral-12b-FP8-dynamic-all

13B • Updated Feb 7, 2025 • 5

nm-testing/pixtral-12b-W4A16-G128

13B • Updated Feb 7, 2025 • 1

nm-testing/Pixtral-Large-Instruct-2411-hf

Image-Text-to-Text • 124B • Updated Feb 6, 2025 • 6

nm-testing/Qwen2-VL-2B-Instruct-Sparse-0.6

2B • Updated Feb 3, 2025 • 1

nm-testing/DeepSeek-R1-Distill-Llama-70B-FP8-dynamic

Text Generation • 71B • Updated Feb 1, 2025 • 14 • 3

nm-testing/Llama-3.2-11B-Vision-Instruct-quantized.w4a16

Updated Jan 31, 2025

nm-testing/glm-4v-9b-W4A16-G128

14B • Updated Jan 30, 2025 • 11