Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
CompressedGemma
/
HPC-Quantize
like
0
License:
mit
Model card
Files
Files and versions
xet
Community
1
Copy to bucket
new
main
HPC-Quantize
Commit History
Update hexstate_quantize.c
a034f4d
verified
CompressedGemma
commited on
8 days ago
Update hexstate_quantize.c
4384a45
verified
CompressedGemma
commited on
8 days ago
Q8_0 tied embeddings
f32b3c6
verified
CompressedGemma
commited on
9 days ago
Q8_0 tied embeddings
63a70a0
verified
CompressedGemma
commited on
9 days ago
Update hexstate_quantize.c
57f4b1d
verified
CompressedGemma
commited on
9 days ago
Revert to Alpha 0.1
e0ba36a
verified
CompressedGemma
commited on
10 days ago
ALPHA
73e9225
verified
CompressedGemma
commited on
10 days ago
ALPHA
2432d03
verified
CompressedGemma
commited on
11 days ago
ALPHA
28f242e
verified
CompressedGemma
commited on
11 days ago
Delete generate_imatrix.py
00ba2db
verified
CompressedGemma
commited on
29 days ago
Upload 5 files
7803d72
verified
CompressedGemma
commited on
29 days ago
Upload 3 files
766f12c
verified
CompressedGemma
commited on
May 14
Upload hpc_forward_merged.c
e9294cc
verified
CompressedGemma
commited on
May 14
Upload 2 files
414e1de
verified
CompressedGemma
commited on
May 14
Upload 2 files
7d55b19
verified
CompressedGemma
commited on
May 12
Qwen changes
6bf97ec
verified
CompressedGemma
commited on
May 12
Upload hpc_forward_merged.c
1581489
verified
CompressedGemma
commited on
May 12
Qwen attention tensors
44e6b86
verified
CompressedGemma
commited on
May 10
Update README.md
099fd3c
verified
CompressedGemma
commited on
May 8
Update README.md
5a67f67
verified
CompressedGemma
commited on
May 7
Fix OOM
c9097e7
verified
CompressedGemma
commited on
May 7
Fix os import
e81a80a
verified
CompressedGemma
commited on
May 7
Auto-load tokenizer for merge rules
0a9e7db
verified
CompressedGemma
commited on
May 7
Heavily experimental
20bea07
verified
CompressedGemma
commited on
May 7
This should do it
a5c5f6c
verified
CompressedGemma
commited on
May 7
Some tensors are transposed lmao
f67ea3a
verified
CompressedGemma
commited on
May 7
Wow Qwen......
965a465
verified
CompressedGemma
commited on
May 7
Qwen......
fca1031
verified
CompressedGemma
commited on
May 7
Qwen patches
8a88d87
verified
CompressedGemma
commited on
May 7
Experimental imatrix
9b8ff1c
verified
CompressedGemma
commited on
May 7
Upload generate_imatrix.py
60295b3
verified
CompressedGemma
commited on
May 7
Update README.md
2a6ab91
verified
CompressedGemma
commited on
May 7
Calibration data
b33a755
verified
CompressedGemma
commited on
May 7
Qwen fix
3883e8d
verified
CompressedGemma
commited on
May 7
Experimental
9303d37
verified
CompressedGemma
commited on
May 7
Update code comments
262cc7b
verified
CompressedGemma
commited on
May 7
Tensor tweak
dc3b370
verified
CompressedGemma
commited on
May 7
Experimental support for other LLMs
5c1c396
verified
CompressedGemma
commited on
May 7
Experimental support for other LLMs
ae8c38d
verified
CompressedGemma
commited on
May 7
Update README.md
dd6d6ba
verified
CompressedGemma
commited on
May 6
Update README.md
96fce02
verified
CompressedGemma
commited on
May 6
It's only calibrated for Gemma, atm.
07b428c
verified
CompressedGemma
commited on
May 6
initial commit
819eddd
verified
CompressedGemma
commited on
May 6