MosaicML Streaming
JSON →MosaicML Streaming (StreamingDataset) provides PyTorch-compatible datasets that can be efficiently streamed from cloud-based object stores (S3, GCS, Azure Blob Storage, Hugging Face Hub) or local filesystems. It enables training on large datasets without needing to download them entirely beforehand, improving data loading performance and reducing storage costs. The library is actively maintained with frequent updates, currently at version 0.13.0.
Traffic · last 30 days ↑123% vs prev 7d
total hits 45
actors 12 distinct systems
last hit 1d ago Amazonbot
top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇩🇪 Germany · 🇫🇷 France · 🇨🇦 Canada
Resources
homepagestreaming.docs.mosaicml.com ↗
API endpoints
full doc /v1/registry/mosaicml-streaming
compatibility /v1/registry/mosaicml-streaming/compatibility