Run:ai Model Streamer

library 0.15.8 ·python

✓ verified Jun 28, 2026

The Run:ai Model Streamer is an open-source Python SDK designed to accelerate the loading of large AI models onto accelerators, such as GPUs or TPUs. It achieves this by streaming tensors directly from various storage locations (local, S3, GCS, Azure Blob Storage) to GPU memory, bypassing local disk buffering, and optimizing for the SafeTensors file format. The current version is 0.15.8, with releases occurring somewhat regularly, indicating active development.

Traffic · last 30 days ↑133% vs prev 7d · indexed Tue Apr 14 · updated Sat Jul 11

total hits 37

actors 7 distinct systems

last hit 4d ago human

ChatGPT-User

GPTBot

Amazonbot

Script

Search engines

Humans

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇨🇦 Canada · 🇨🇳 China · VN

Resources

packagepypi.org/project/runai-model-streamer/ ↗

homepagewww.run.ai ↗

API endpoints

full doc /v1/registry/runai-model-streamer

install /v1/registry/runai-model-streamer/install

compatibility /v1/registry/runai-model-streamer/compatibility