PySpark HNSW Library

JSON →
library 1.1.0 ·python
verified May 23, 2026

pyspark-hnsw is a Python library that provides a distributed implementation of Hierarchical Navigable Small Worlds (HNSW) for Approximate Nearest Neighbor (ANN) search on Apache Spark. It enables efficient vector similarity search on large datasets within a PySpark environment, leveraging Spark's distributed processing capabilities. The current stable version available on PyPI is 1.1.0, with a moderate release cadence, including minor updates in recent months.

total hits 30
actors 8 distinct systems
last hit 22h ago ByteDance
ByteDance
7
GPTBot
6
Script
2
ClaudeBot
1
Search engines
1

top countries 🇩🇪 Germany · 🇺🇸 United States · 🇸🇬 Singapore · 🇫🇷 France · 🇨🇦 Canada