Run:ai Model Streamer
JSON →The Run:ai Model Streamer is an open-source Python SDK designed to accelerate the loading of large AI models onto accelerators, such as GPUs or TPUs. It achieves this by streaming tensors directly from various storage locations (local, S3, GCS, Azure Blob Storage) to GPU memory, bypassing local disk buffering, and optimizing for the SafeTensors file format. The current version is 0.15.8, with releases occurring somewhat regularly, indicating active development.
Traffic · last 30 days ↑320% vs prev 7d
total hits 29
actors 9 distinct systems
last hit 1d ago human
top countries 🇺🇸 United States · 🇩🇪 Germany · 🇨🇦 Canada · 🇳🇴 Norway · 🇸🇬 Singapore
API endpoints
full doc /v1/registry/runai-model-streamer
compatibility /v1/registry/runai-model-streamer/compatibility