Data Diff (collate-data-diff)

library 0.11.10 ·python

✓ verified Jun 28, 2026

collate-data-diff (also known as data-diff) is a Python library and command-line tool designed to efficiently compare and find differences between rows across two databases or tables. It focuses on performance and scalability for large datasets, providing a fast and accurate way to detect data discrepancies. The current version is 0.11.10, and it maintains an active release cadence with frequent updates.

Traffic · last 30 days ↓40% vs prev 7d · indexed Sun Apr 12 · updated Sat Jul 11

total hits 26

actors 7 distinct systems

last hit 2d ago human

ByteDance

GPTBot

Amazonbot

Script

ChatGPT-User

Search engines

Humans

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇨🇦 Canada · VN · 🇩🇪 Germany

Resources

githubgithub.com/datafold/data-diff ↗

packagepypi.org/project/collate-data-diff/ ↗

API endpoints

full doc /v1/registry/collate-data-diff

install /v1/registry/collate-data-diff/install

compatibility /v1/registry/collate-data-diff/compatibility