DataTile

raw JSON →
1.0.3 verified Mon Apr 27 auth: no python

A library for managing, summarizing, and visualizing data, built on top of pandas. Current version 1.0.3, released sporadically with maintenance updates.

pip install datatile
error ImportError: No module named 'datatile'
cause Package not installed or installed in wrong environment.
fix
Run pip install datatile in the correct Python environment.
error TypeError: 'NoneType' object is not iterable when calling Summary(df)
cause DataFrame contains all non-numeric columns; Summary returns None for some stats.
fix
Ensure DataFrame has at least one numeric column.
gotcha DataTile summary methods may silently drop non-numeric columns. Always check that your DataFrame has numeric types for intended columns.
fix Use df.select_dtypes(include='number') before passing to Summary.
deprecated The 'Histogram' class was deprecated in 1.0.0 and removed. Use matplotlib or seaborn directly.
fix Replace `from datatile import Histogram` with direct plotting via matplotlib.
gotcha The `to_dict()` method may return nested dictionaries with inconsistent key naming (snake_case vs camelCase) depending on version. Check the output carefully.
fix Inspect keys after calling to_dict() and normalize as needed.

Create a summary object from a DataFrame and print a dict of statistics.

import pandas as pd
from datatile import Summary

df = pd.DataFrame({'a': [1,2,3], 'b': [4,5,6]})
summary = Summary(df)
print(summary.to_dict())