AutoAWQ

JSON →
library 0.2.9 ·python deprecated
verified Apr 27, 2026

AutoAWQ implements the AWQ (Activation-aware Weight Quantization) algorithm for 4-bit quantization of large language models, achieving up to 2x speedup during inference. The library is now deprecated as of v0.2.9 (April 2025), with vLLM having adopted the technology. Last tested with Torch 2.6.0 and Transformers 4.51.3.

total hits 28
actors 8 distinct systems
last hit 23h ago ByteDance
ByteDance
6
MetaBot
3
GPTBot
2
Amazonbot
2
Search engines
1
Humans
5

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇩🇪 Germany · 🇨🇦 Canada · VN