PySpark HNSW Library

library 1.1.0 ·python

✓ verified Jul 3, 2026

pyspark-hnsw is a Python library that provides a distributed implementation of Hierarchical Navigable Small Worlds (HNSW) for Approximate Nearest Neighbor (ANN) search on Apache Spark. It enables efficient vector similarity search on large datasets within a PySpark environment, leveraging Spark's distributed processing capabilities. The current stable version available on PyPI is 1.1.0, with a moderate release cadence, including minor updates in recent months.

Traffic · last 30 days ↑133% vs prev 7d · indexed Tue Apr 14 · updated Sat Jul 11

total hits 17

actors 5 distinct systems

last hit 7d ago ChatGPT-User

GPTBot

Script

Amazonbot

ChatGPT-User

Humans

top countries 🇺🇸 United States · 🇨🇳 China · 🇬🇧 United Kingdom · 🇨🇦 Canada · 🇩🇪 Germany

Resources

githubgithub.com/jelmerk/hnswlib ↗

packagepypi.org/project/pyspark-hnsw/ ↗

API endpoints

full doc /v1/registry/pyspark-hnsw

install /v1/registry/pyspark-hnsw/install

compatibility /v1/registry/pyspark-hnsw/compatibility