PySpark
JSON →PySpark is the Python API for Apache Spark, a unified analytics engine for large-scale data processing. It allows users to leverage Spark's powerful distributed computing capabilities, including Spark SQL, DataFrames, Structured Streaming, and MLlib, using familiar Python syntax. The library is actively maintained, with the current version being 4.1.1, and follows the release cadence of the broader Apache Spark project.
Traffic · last 30 days ↓54% vs prev 7d
total hits 37
actors 12 distinct systems
last hit 1h ago ClaudeBot
top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇨🇦 Canada · VN · 🇩🇪 Germany
API endpoints
full doc /v1/registry/pyspark
install /v1/registry/pyspark/install
compatibility /v1/registry/pyspark/compatibility