PySpark
JSON →PySpark is the Python API for Apache Spark, a unified analytics engine for large-scale data processing. It allows users to leverage Spark's powerful distributed computing capabilities, including Spark SQL, DataFrames, Structured Streaming, and MLlib, using familiar Python syntax. The library is actively maintained, with the current version being 4.1.1, and follows the release cadence of the broader Apache Spark project.
Traffic · last 30 days ↑25% vs prev 7d
total hits 35
actors 11 distinct systems
last hit 2d ago ByteDance
top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇨🇦 Canada · VN · 🇪🇸 Spain
API endpoints
full doc /v1/registry/pyspark
install /v1/registry/pyspark/install
compatibility /v1/registry/pyspark/compatibility