LLM Compressor
JSON →LLM Compressor (current version 0.10.0.1) is a Python library for compressing large language models, offering both training-aware and post-training techniques. Built on PyTorch and HuggingFace Transformers, it provides a flexible and user-friendly interface for researchers and practitioners to quickly experiment with techniques like quantization and sparsity. The library maintains an active development pace with frequent patch releases and regular feature updates.
Traffic · last 30 days ↑150% vs prev 7d
total hits 13
actors 6 distinct systems
last hit 3d ago MetaBot
top countries 🇺🇸 United States · 🇫🇷 France · 🇨🇦 Canada · 🇩🇪 Germany
API endpoints
full doc /v1/registry/llmcompressor
compatibility /v1/registry/llmcompressor/compatibility