Mooncake Transfer Engine
JSON →Mooncake Transfer Engine is a Python binding (using pybind11) for the core data transfer component of the Mooncake project. Mooncake itself is a KVCache-centric disaggregated architecture designed to optimize Large Language Model (LLM) inference. The Transfer Engine provides a high-performance, unified interface for batched data movement across various storage devices and network links, supporting protocols like TCP, RDMA, CXL/shared-memory, and NVMe over Fabric. It is actively maintained with frequent updates and integrations into LLM serving frameworks like SGLang and vLLM.
Traffic · last 30 days ↑0% vs prev 7d
total hits 25
actors 9 distinct systems
last hit 1d ago ByteDance
top countries 🇺🇸 United States · 🇨🇦 Canada · 🇫🇷 France · 🇸🇬 Singapore · 🇩🇪 Germany
Resources
API endpoints
compatibility /v1/registry/mooncake-transfer-engine/compatibility