casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Builder
casper-hansen
casper-hansen • individual
Stars
2,340
Using upstream star count
Forks
302
Using upstream fork count
Open Issues
0
Activity Score
0/100
0 commits in 30d
Created
Aug 25, 2023
Project creation date
It is no secret that maintaining a project such as AutoAWQ that has 2+ million downloads, 7000+ models on Huggingface, and 2.1k stars is hard for a solo developer who is doing this in their free time.
Unmapped
category
Deployment Context
Modalities
Skill Areas
tag
Updated 1 years ago
7 Days
0
30 Days
0
90 Days
0
pgvector cosine similarity · $0
Loading…