48 6 97

Chris Scott

chriswritescode

chriswritescode-dev

AI & ML interests

None yet

Recent Activity

new activity 3 days ago

XiaomiMiMo/MiMo-V2.5:Update generation_config.json

liked a model 8 days ago

lukealonso/MiMo-V2.5-NVFP4

liked a model 15 days ago

XiaomiMiMo/MiMo-V2.5

View all activity

Organizations

None yet

New activity in XiaomiMiMo/MiMo-V2.5 3 days ago

Update generation_config.json

#16 opened 4 days ago by

chriswritescode

New activity in tencent/Hy3-preview 19 days ago

FP8 Weights

#5 opened 19 days ago by

chriswritescode

New activity in togethercomputer/Aurora-Spec-Minimax-M2.5 about 2 months ago

awaiting access

#1 opened about 2 months ago by

chriswritescode

New activity in Qwen/Qwen3.5-397B-A17B-GPTQ-Int4 2 months ago

GPTQ vs Q4 GGUF

👀 3

#2 opened 2 months ago by

ciprianv

New activity in togethercomputer/Aurora-Spec-Minimax-M2.1 2 months ago

Minimax M2.5 ?

#1 opened 3 months ago by

chriswritescode

New activity in Qwen/Qwen3.5-122B-A10B 3 months ago

Official FP8

👀👍 14

#4 opened 3 months ago by

retowyss

New activity in vincentzed-hf/Qwen3.5-397B-A17B-NVFP4 3 months ago

Anyone try this on 4x RTX 6000 Pro yet?

#1 opened 3 months ago by

zenmagnets

New activity in QuantTrio/GLM-4.7-GPTQ-Int4-Int8Mix 4 months ago

Accessing LLM, response without<think>start tag

#2 opened 4 months ago by

sudage

New activity in zai-org/GLM-4.7 5 months ago

best coding&reasoning model. thank you Z.AI

👍 6

#10 opened 5 months ago by

Tugay31

New activity in baseten-admin/glm-4.7-fp4 5 months ago

did you use the default nemotron dataset ?

#1 opened 5 months ago by

chriswritescode

New activity in XiaomiMiMo/MiMo-V2-Flash 5 months ago

Great Model! - sglang mtp support for triton backend

👍 3

#19 opened 5 months ago by

chriswritescode

New activity in zai-org/GLM-4.6V 5 months ago

vLLM load error

👍 3

#2 opened 5 months ago by

srinivasbilla

New activity in cerebras/GLM-4.6-REAP-268B-A32B-FP8 6 months ago

Thank you!!

🤝🤗 2

#1 opened 7 months ago by

rascazzione

New activity in QuantTrio/GLM-4.6-GPTQ-Int4-Int8Mix 7 months ago

Appreciation

❤️ 1

#3 opened 7 months ago by

chriswritescode

Quant Size

#2 opened 7 months ago by

ortegaalfredo

New activity in QuantTrio/GLM-4.6-AWQ 7 months ago

Thank you!!!

#1 opened 7 months ago by

chriswritescode

New activity in QuantTrio/GLM-4.5-AWQ 8 months ago

NVfp4 request with 16bit activations

#5 opened 8 months ago by

chriswritescode

New activity in QuantTrio/GLM-4.5V-AWQ 8 months ago

model is not performing as good as GLM-4.5-Air-AWQ-FP16Mix

#1 opened 9 months ago by

hareram241

New activity in openai/gpt-oss-120b 9 months ago

Request: 4-bit GPTQ or AWQ quantized version of openai/gpt-oss-20b

🚀❤️ 19

#32 opened 9 months ago by

powtac

VLLM - Flash-attn 3

#23 opened 9 months ago by

chriswritescode

Chris Scott

AI & ML interests

Recent Activity

Organizations

chriswritescode's activity

Update generation_config.json

FP8 Weights

awaiting access

GPTQ vs Q4 GGUF

Minimax M2.5 ?

Official FP8

Anyone try this on 4x RTX 6000 Pro yet?

Accessing LLM, response without<think>start tag

best coding&reasoning model. thank you Z.AI

did you use the default nemotron dataset ?

Great Model! - sglang mtp support for triton backend

vLLM load error

Thank you!!

Appreciation

Quant Size

Thank you!!!

NVfp4 request with 16bit activations

model is not performing as good as GLM-4.5-Air-AWQ-FP16Mix

Request: 4-bit GPTQ or AWQ quantized version of openai/gpt-oss-20b

VLLM - Flash-attn 3