RecordLinkage

library 0.16 ·python

✓ verified May 20, 2026

RecordLinkage is a powerful and modular Python toolkit for record linkage and duplicate detection. It provides methods for indexing, comparing records with various similarity measures, and classifying matches, leveraging pandas and numpy for efficient data handling. The library is actively maintained (version 0.16) and suitable for research and linking small to medium-sized datasets.

Traffic · last 30 days ↑900% vs prev 7d · indexed Thu Apr 09 · updated Sun May 24

total hits 20

actors 9 distinct systems

last hit 1d ago Googlebot

GPTBot

6

Script

4

PerplexityBot

2

ClaudeBot

1

ChatGPT-User

1

Search engines

3

Humans

1

top countries 🇺🇸 United States · 🇩🇪 Germany · 🇮🇳 India · LT · 🇨🇦 Canada

Resources

githubgithub.com/J535D165/recordlinkage ↗

packagepypi.org/project/recordlinkage/ ↗

homepagerecordlinkage.readthedocs.io/ ↗

API endpoints

full doc /v1/registry/recordlinkage

install /v1/registry/recordlinkage/install

compatibility /v1/registry/recordlinkage/compatibility