PySpark

JSON →
library 4.1.1 ·python
verified Jun 9, 2026 install

PySpark is the Python API for Apache Spark, a unified analytics engine for large-scale data processing. It allows users to leverage Spark's powerful distributed computing capabilities, including Spark SQL, DataFrames, Structured Streaming, and MLlib, using familiar Python syntax. The library is actively maintained, with the current version being 4.1.1, and follows the release cadence of the broader Apache Spark project.

total hits 35
actors 11 distinct systems
last hit 2d ago ByteDance
Amazonbot
4
OAI-SearchBot
4
MetaBot
4
Script
2
ByteDance
2
ChatGPT-User
1
Search engines
2
Humans
9

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇨🇦 Canada · VN · 🇪🇸 Spain