AutoGPTQ

JSON →
library 0.7.1 ·python
verified May 1, 2026

AutoGPTQ is an easy-to-use LLMs quantization package based on the GPTQ algorithm. It provides user-friendly APIs for quantizing and running large language models with reduced memory usage. Current version 0.7.1 supports loading sharded quantized checkpoints and Gemma models. Release cadence is irregular, with major features in point releases.

total hits 26
actors 8 distinct systems
last hit 4d ago AhrefsBot
Amazonbot
3
MetaBot
3
GPTBot
2
ByteDance
2
Search engines
1
Humans
6

top countries 🇺🇸 United States · 🇩🇪 Germany · 🇸🇬 Singapore · 🇨🇦 Canada · PH