Simhash Python Library
JSON →The `simhash` library provides a Python implementation of the Simhash Algorithm, a technique for quickly finding near-duplicate documents or comparing the similarity of two texts or data objects. It's highly useful for tasks like large-scale content deduplication, spam detection, and content recommendation, offering a fast way to identify perceptually similar items. The current version is 2.1.2, and it follows an irregular release cadence based on contributions and bug fixes.
Traffic · last 30 days ↓12% vs prev 7d
total hits 17
actors 7 distinct systems
last hit 3d ago MJ12bot
top countries 🇺🇸 United States · 🇨🇦 Canada · 🇫🇷 France · 🇳🇴 Norway · 🇩🇪 Germany
Resources
packagepypi.org/project/simhash/ ↗
API endpoints
full doc /v1/registry/simhash
install /v1/registry/simhash/install
compatibility /v1/registry/simhash/compatibility