Cache-DiT

JSON →
library 1.3.5 ·python
verified May 24, 2026

Cache-DiT is a PyTorch-native inference engine designed for Diffusion Transformers (DiTs). It provides hybrid cache acceleration (DBCache, TaylorSeer, SCM), comprehensive parallelism optimizations (Context, Tensor, 2D/3D), and low-bit quantization (FP8, INT8, INT4). The library integrates seamlessly with Hugging Face Diffusers, SGLang Diffusion, vLLM-Omni, and ComfyUI to deliver significant speedups for image and video generation. Currently at version 1.3.5, it maintains an active release cadence with frequent updates and hotfixes.

total hits 31
actors 10 distinct systems
last hit 1d ago OAI-SearchBot
OAI-SearchBot
4
MetaBot
4
ByteDance
3
GPTBot
2
Script
2
ChatGPT-User
1
Search engines
6

top countries 🇺🇸 United States · 🇩🇪 Germany · 🇸🇬 Singapore · 🇨🇦 Canada · 🇫🇷 France