LMDeploy
JSON →LMDeploy is a toolkit for compressing, deploying, and serving large language models (LLMs). It supports efficient inference with quantization, continuous batching, and various backends (e.g., PyTorch, TensorRT-LLM). The current version is 0.12.3, with frequent releases following the development of dependencies and model support.
Traffic · last 30 days ↓33% vs prev 7d
total hits 13
actors 6 distinct systems
last hit 4d ago Bingbot
top countries 🇺🇸 United States · 🇨🇦 Canada · 🇪🇸 Spain · 🇫🇷 France · 🇮🇳 India