PySpark Connect Client

library 4.1.1 ·python

✓ verified May 24, 2026

The `pyspark-client` is the Python Spark Connect client for Apache Spark, providing a decoupled client-server architecture that enables remote connectivity to Spark clusters using the DataFrame API. It uses gRPC and Apache Arrow for efficient communication. The library is part of the broader Apache Spark project and is actively developed, with releases typically aligning with Apache Spark's minor and major version updates. The current version is 4.1.1, supporting Spark 4.1.1.

Traffic · last 30 days ↓11% vs prev 7d · indexed Wed Apr 15 · updated Sat May 30

total hits 20

actors 8 distinct systems

last hit 1d ago MetaBot

GPTBot

MetaBot

Script

ClaudeBot

Search engines

top countries 🇺🇸 United States · 🇨🇦 Canada · 🇩🇪 Germany · 🇫🇷 France

Resources

githubgithub.com/apache/spark ↗

packagepypi.org/project/pyspark-client/ ↗

homepagespark.apache.org ↗

API endpoints

full doc /v1/registry/pyspark-client

install /v1/registry/pyspark-client/install

compatibility /v1/registry/pyspark-client/compatibility