LMDeploy

JSON →
library 0.12.3 ·python
verified May 1, 2026

LMDeploy is a toolkit for compressing, deploying, and serving large language models (LLMs). It supports efficient inference with quantization, continuous batching, and various backends (e.g., PyTorch, TensorRT-LLM). The current version is 0.12.3, with frequent releases following the development of dependencies and model support.

total hits 13
actors 6 distinct systems
last hit 4d ago Bingbot
MetaBot
3
GPTBot
2
Script
1
ClaudeBot
1
Search engines
1
Humans
2

top countries 🇺🇸 United States · 🇨🇦 Canada · 🇪🇸 Spain · 🇫🇷 France · 🇮🇳 India