Datazets

1.1.3 · active · verified Thu Apr 16

Datazets is a Python package designed to easily import a collection of well-known example data sets, often used for machine learning, data analysis, and educational purposes. It provides a simple API to access these datasets without needing to manually download or preprocess them. The current version is 1.1.3, released on June 21, 2025, and it appears to have an active release cadence.

Common errors

Warnings

Install

Imports

Quickstart

This quickstart demonstrates how to import the `datazets` library and load the 'titanic' dataset into a pandas DataFrame using the `get()` function.

import datazets as dz

# Load a well-known dataset, e.g., 'titanic'
df = dz.get('titanic')

print(f"Dataset 'titanic' loaded with shape: {df.shape}")
print(df.head())

view raw JSON →