Cache-DiT
JSON →Cache-DiT is a PyTorch-native inference engine designed for Diffusion Transformers (DiTs). It provides hybrid cache acceleration (DBCache, TaylorSeer, SCM), comprehensive parallelism optimizations (Context, Tensor, 2D/3D), and low-bit quantization (FP8, INT8, INT4). The library integrates seamlessly with Hugging Face Diffusers, SGLang Diffusion, vLLM-Omni, and ComfyUI to deliver significant speedups for image and video generation. Currently at version 1.3.5, it maintains an active release cadence with frequent updates and hotfixes.
Traffic · last 30 days ↑0% vs prev 7d
total hits 31
actors 10 distinct systems
last hit 1d ago OAI-SearchBot
top countries 🇺🇸 United States · 🇩🇪 Germany · 🇸🇬 Singapore · 🇨🇦 Canada · 🇫🇷 France
API endpoints
full doc /v1/registry/cache-dit
install /v1/registry/cache-dit/install
compatibility /v1/registry/cache-dit/compatibility