YData Profiling

4.18.1 · active · verified Sat Apr 11

ydata-profiling is a powerful Python library that automates the generation of comprehensive exploratory data analysis (EDA) reports for pandas DataFrames. It provides detailed statistics, visualizations, and interactive widgets to understand data quality and distributions. The library is actively maintained with frequent minor releases, typically monthly or bi-monthly.

Warnings

Install

Imports

Quickstart

This quickstart demonstrates how to create a pandas DataFrame, generate a comprehensive profile report using `ProfileReport`, and save it to an HTML file. For interactive display in Jupyter Notebooks, use `profile.to_widgets()`.

import pandas as pd
from ydata_profiling import ProfileReport

# Sample DataFrame
data = {
    'col1': [1, 2, 3, 4, 5],
    'col2': ['A', 'B', 'A', 'C', 'B'],
    'col3': [1.1, 2.2, None, 4.4, 5.5]
}
df = pd.DataFrame(data)

# Generate the profile report
profile = ProfileReport(df, title="My DataFrame Profile", explorative=True)

# Save report to an HTML file
profile.to_file("your_report.html")

# If running in a Jupyter Notebook, you can display widgets directly:
# profile.to_widgets()

print("Profile report saved to your_report.html")

view raw JSON →