Hugging Face Datasets
JSON →HuggingFace library for loading, processing, and sharing datasets for ML. Provides load_dataset() for one-line access to 100k+ public datasets on the Hub, plus local file loading (CSV, JSON, Parquet, Arrow, audio, image, etc.). Built on Apache Arrow for memory-efficient, zero-copy data access. Package name on PyPI is 'datasets' (not 'huggingface-datasets'). Import name is also 'datasets'. CRITICAL: datasets 4.0 (July 2025) removed dataset loading scripts and trust_remote_code entirely. Many older community datasets relying on .py loading scripts now fail with datasets>=4.
Traffic · last 30 days ↑250% vs prev 7d
total hits 32
actors 8 distinct systems
last hit 1d ago AhrefsBot
top countries 🇺🇸 United States · 🇩🇪 Germany · 🇨🇦 Canada · 🇬🇧 United Kingdom · BD
API endpoints
full doc /v1/registry/datasets
install /v1/registry/datasets/install
compatibility /v1/registry/datasets/compatibility