Intel
/

DeepSeek-R1-AutoRound-Recipe

Model card Files Files and versions

INC4AI commited on 11 days ago

Commit

88aa273

·

verified ·

1 Parent(s): e3fc5a3

Update README.md

Files changed (1) hide show

README.md +10 -8

README.md CHANGED Viewed

@@ -5,8 +5,8 @@ base_model:
 ---
 ## Model Details
-This model is an mxfp8 quantized version of [unsloth/DeepSeek-R1-BF16](https://huggingface.co/unsloth/DeepSeek-R1-BF16) generated by [intel/auto-round](https://github.com/intel/auto-round).
-Please follow the license of the original model.
 ## How to Use
@@ -14,12 +14,14 @@ The step-by-step README of quantization and evaluation can be found in [Intel Ne
 ## Evaluate Results
-|   Task    | backend |   BF16   |  MXFP8  |
-|:---------:|:-------:|:--------:|:-------:|
-| hellaswag |  vllm   |  0.6903  | 0.6956  |
-|   piqa    |  vllm   |  0.8319  | 0.8324  |
-|   mmlu    |  vllm   |  0.8489  | 0.8532  |
-|   gsm8k   |  vllm   |  0.9568  | 0.9583  |
 ## Ethical Considerations and Limitations

 ---
 ## Model Details
+This model card is for mxfp8/mxfp4/nvfp4 quantization of [unsloth/DeepSeek-R1-BF16](https://huggingface.co/unsloth/DeepSeek-R1-BF16) based on [intel/auto-round](https://github.com/intel/auto-round).
+The models are not able to be published due to the storage limitation. Please follow the INC example README to generate and evaluate the low precision models.
 ## How to Use
 ## Evaluate Results
+|    Task     | backend |    BF16    |   MXFP8    |   MXFP4    |   NVFP4    |
+|:-----------:|:-------:|:----------:|:----------:|:----------:|:----------:|
+|  hellaswag  |  vllm   |   0.6903   |   0.6956   |   0.6898   |   0.6953   |
+|    piqa     |  vllm   |   0.8319   |   0.8324   |   0.8297   |   0.8303   |
+|    mmlu     |  vllm   |   0.8489   |   0.8532   |   0.8426   |   0.8495   |
+|    gsm8k    |  vllm   |   0.9568   |   0.9583   |   0.9553   |   0.9606   |
+| **average** |  vllm   | **0.8320** | **0.8349** | **0.8294** | **0.8339** |
 ## Ethical Considerations and Limitations