Jaro-Winkler Distance
pyjarowinkler is a Python library that calculates the Jaro-Winkler Distance, a measure of similarity between two strings. It's useful for tasks like spell-checking, record linkage, and data deduplication. The current version is 3.0.0, and it has a reasonably active release cadence.
Warnings
- breaking The primary function `get_jaro_distance` was renamed to `jaro_winkler_distance`.
- breaking The minimum required Python version was updated from 3.6 to 3.10.
- gotcha The package name is `pyjarowinkler`, but the main module to import from is `jarowinkler`.
Install
-
pip install pyjarowinkler
Imports
- jaro_winkler_distance
from pyjarowinkler import jaro_winkler_distance
from jarowinkler import jaro_winkler_distance
Quickstart
from jarowinkler import jaro_winkler_distance
string1 = "MARTHA"
string2 = "MARHTA"
similarity_score = jaro_winkler_distance(string1, string2)
print(f"Jaro-Winkler Similarity between '{string1}' and '{string2}': {similarity_score}")
string3 = "DWAYNE"
string4 = "DUANE"
similarity_score_2 = jaro_winkler_distance(string3, string4)
print(f"Jaro-Winkler Similarity between '{string3}' and '{string4}': {similarity_score_2}")