Hugging Face Datasets

JSON →
library 4.6.0 ·python
verified Jun 9, 2026 install stale

HuggingFace library for loading, processing, and sharing datasets for ML. Provides load_dataset() for one-line access to 100k+ public datasets on the Hub, plus local file loading (CSV, JSON, Parquet, Arrow, audio, image, etc.). Built on Apache Arrow for memory-efficient, zero-copy data access. Package name on PyPI is 'datasets' (not 'huggingface-datasets'). Import name is also 'datasets'. CRITICAL: datasets 4.0 (July 2025) removed dataset loading scripts and trust_remote_code entirely. Many older community datasets relying on .py loading scripts now fail with datasets>=4.

total hits 33
actors 9 distinct systems
last hit 1h ago ClaudeBot
OAI-SearchBot
5
Amazonbot
4
MetaBot
4
Script
2
ClaudeBot
1
ChatGPT-User
1
Humans
5

top countries 🇺🇸 United States · 🇩🇪 Germany · 🇨🇦 Canada · 🇪🇸 Spain · BD