Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
quantization
like
6
Follow
Red Hat AI
2.42k
kernel
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
3
refs/pr/2
quantization
Ctrl+K
Ctrl+K
2 contributors
History:
54 commits
danieldk
HF Staff
Update tag
c306f57
verified
6 months ago
attention
Sync to vLLM 20250627
10 months ago
build
Build (aarch64-linux)
10 months ago
compressed_tensors
Sync to vLLM 20250627
10 months ago
core
Sync to vLLM 20250627
10 months ago
cutlass_extensions
Sync to vLLM 20250627
10 months ago
cutlass_w8a8
Sync to vLLM 20250627
10 months ago
fp8
Sync to vLLM 20250627
10 months ago
gptq_marlin
Sync to vLLM 20250627
10 months ago
marlin
Sync to vLLM 20250627
10 months ago
tests
Sync to vLLM 20250627
10 months ago
torch-ext
Fix absolute imports
10 months ago
.gitattributes
Safe
1.56 kB
Build
over 1 year ago
LICENSE
Safe
11.4 kB
Add cutlass_w8a8
over 1 year ago
README.md
196 Bytes
Update tag
6 months ago
build.toml
Safe
5.96 kB
Fix undefined symbol on CUDA 11.8
10 months ago
cuda_utils.h
Safe
1.41 kB
Sync on vLLM 20240402
about 1 year ago
dispatch_utils.h
Safe
3.9 kB
Sync to vLLM 20250627
10 months ago
flake.lock
Safe
4.5 kB
Prepare for Torch 2.8
10 months ago
flake.nix
Safe
345 Bytes
Prepare for Torch 2.8
10 months ago
utils.cuh
Safe
1.84 kB
Sync on vLLM 20240402
about 1 year ago
vectorization.cuh
Safe
878 Bytes
Sync to vLLM 20250627
10 months ago
vectorization_utils.cuh
Safe
2.61 kB
Sync to vLLM 20250627
10 months ago