PySpark

JSON →
library 4.1.1 ·python
verified Jun 9, 2026 install

PySpark is the Python API for Apache Spark, a unified analytics engine for large-scale data processing. It allows users to leverage Spark's powerful distributed computing capabilities, including Spark SQL, DataFrames, Structured Streaming, and MLlib, using familiar Python syntax. The library is actively maintained, with the current version being 4.1.1, and follows the release cadence of the broader Apache Spark project.

total hits 37
actors 12 distinct systems
last hit 1h ago ClaudeBot
Amazonbot
4
OAI-SearchBot
4
MetaBot
4
ByteDance
3
Script
2
ClaudeBot
1
ChatGPT-User
1
Search engines
2
Humans
9

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇨🇦 Canada · VN · 🇩🇪 Germany