Scott Glover
scottgl
AI & ML interests
None yet
Recent Activity
liked a model 8 days ago
unsloth/Qwen3.6-35B-A3B-MTP-GGUF liked a model 17 days ago
havenoammo/Qwen3.6-27B-MTP-UD-GGUF liked a model 18 days ago
am17an/Qwen3.6-35BA3B-MTP-GGUFOrganizations
None yet
Why Your NVFP4 Model Is Slower Than FP8 on the GB10 (NVIDIA Spark) — And How to Fix It
🤯👍 5
6
#5 opened 3 months ago
by
scottgl
Quantization Code
1
#1 opened about 1 month ago
by
vgoklani
Issues for GB10 users
2
#1 opened about 1 month ago
by
scottgl
NVFP4 quantization of m51Lab-MiniMax-M2.7-REAP-139B-A10B
3
#1 opened about 1 month ago
by
scottgl
Minimax 2.7
5
#1 opened about 1 month ago
by
dustinogle1
Excellent model on DGX Spark
👍 1
4
#1 opened 2 months ago
by
bkmtech
Recommendations for running on Strix Halo.
2
#2 opened 3 months ago
by
scottgl
MTP model weights
#3 opened 3 months ago
by
scottgl
MTP model weights
#3 opened 3 months ago
by
scottgl
MTP results with vLLM inside
7
#10 opened 3 months ago
by
unoid
[Bug] Model outputs only "!" — quantization_config.ignore missing fused projection names (in_proj_ba / in_proj_qkvz) for linear attention layers
4
#4 opened 3 months ago
by
scottgl
MTP Added - Re-download
🚀🔥 2
7
#7 opened 3 months ago
by
Sehyo
Qwen3.5 122B on Stix Halo
5
#1 opened 3 months ago
by
scottgl
MTP support in model
5
#5 opened 3 months ago
by
scottgl
Could you create an NVFP4 version?
#2 opened 3 months ago
by
scottgl
Why Your NVFP4 Model Is Slower Than FP8 on the GB10 (NVIDIA Spark) — And How to Fix It
🤯👍 5
6
#5 opened 3 months ago
by
scottgl