Intel
/

Llama-4-Scout-17B-16E-Instruct-MXFP4-AutoRound-Recipe

Model card Files Files and versions

INC4AI commited on 11 days ago

Commit

7917465

·

verified ·

1 Parent(s): 15f73ed

Update README.md

Files changed (1) hide show

README.md +11 -9

README.md CHANGED Viewed

@@ -6,7 +6,7 @@ base_model:
 ## Model Details
 This model card is for mxfp4 quantization of [meta-llama/Llama-4-Scout-17B-16E-Instruct](https://huggingface.co/meta-llama/Llama-4-Scout-17B-16E-Instruct) based on [intel/auto-round](https://github.com/intel/auto-round).
-The quantized model is not able to be published due to the license limitation. Please follow the INC example README to generate and evaluate the low precision model.
 ## How to Use
@@ -14,14 +14,16 @@ The step-by-step README of quantization and evaluation can be found in [Intel Ne
 ## Evaluate Results
-|   Task    | backend |   BF16   | MXFP4  |
-|:---------:|:-------:|:--------:|:------:|
-| hellaswag |  vllm   |  0.6389  | 0.6349 |
-|   piqa    |  vllm   |  0.8156  | 0.8107 |
-|   mmlu    |  vllm   |  0.7997  | 0.7921 |
-|   gsm8k   |  vllm   |  0.9090  | 0.9121 |
-|  chartqa  |  vllm   |  0.8900  | 0.8884 |
-| mmmu_val  |  vllm   |  0.5989  | 0.5844 |
 ## Ethical Considerations and Limitations

 ## Model Details
 This model card is for mxfp4 quantization of [meta-llama/Llama-4-Scout-17B-16E-Instruct](https://huggingface.co/meta-llama/Llama-4-Scout-17B-16E-Instruct) based on [intel/auto-round](https://github.com/intel/auto-round).
+The quantized model is not able to be published due to license limitation. Please follow the INC example README to generate and evaluate the low precision model.
 ## How to Use
 ## Evaluate Results
+|       Task        | backend |   BF16   | MXFP4  |
+|:-----------------:|:-------:|:--------:|:------:|
+|     hellaswag     |  vllm   |  0.6389  | 0.6349 |
+|       piqa        |  vllm   |  0.8156  | 0.8107 |
+|       mmlu        |  vllm   |  0.7997  | 0.7921 |
+|   gsm8k(strict)   |  vllm   |  0.9090  | 0.9121 |
+| chartqa(relaxed)  |  vllm   |  0.8900  | 0.8884 |
+|     mmmu_val      |  vllm   |  0.5989  | 0.5844 |
+|      average      |  vllm   |  0.7754  | 0.7704 |
 ## Ethical Considerations and Limitations