SGLang

JSON →
library 0.5.9 ·python
verified Jun 28, 2026 install stale

SGLang is a high-performance serving framework for large language models (LLMs) and vision-language models (VLMs), implemented as a domain-specific language embedded in Python. It optimizes LLM inference through advanced techniques like RadixAttention for KV cache reuse, continuous batching, speculative decoding, and various parallelization strategies. The library supports a broad range of models from Hugging Face and offers compatibility with OpenAI APIs. SGLang maintains an active development pace with frequent, often monthly or bi-monthly, releases and is currently at version 0.5.9.

total hits 20
actors 4 distinct systems
last hit 2d ago AhrefsBot
GPTBot
4
Amazonbot
4
Script
1
Humans
8

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇨🇦 Canada · 🇩🇪 Germany · 🇬🇧 United Kingdom