Intake

2.0.9 · active · verified Mon Apr 13

Intake is a lightweight Python package for finding, investigating, loading, and distributing data. It provides a common API for loading data from a wide variety of sources (e.g., CSV, NetCDF, SQL, HDF5, Parquet, Zarr) and enables the creation and management of data catalogs. The current version is 2.0.9, and the project is in a stable maintenance phase for its 2.x series, with less frequent but significant updates.

Warnings

Install

Imports

Quickstart

This quickstart demonstrates how to open a remote Intake catalog and load a dataset (US States data) into a Pandas DataFrame.

import intake

# Open a public example catalog
catalog = intake.open_catalog("https://raw.githubusercontent.com/intake/intake-examples/master/catalogs/us_states.yml")

# Access a data source from the catalog
df = catalog.states.read()

print(df.head())

view raw JSON →