Apache Airflow Apache Spark Provider

library 6.0.0 ·python

✓ verified Jun 28, 2026

This provider package enables Apache Airflow to interact with Apache Spark, allowing for the orchestration and scheduling of Spark jobs. It includes operators and hooks for submitting Spark applications, executing Spark SQL queries, and performing data transfers. It's an active provider package, with version 6.0.0 released on March 28, 2026. Airflow providers are released independently of Airflow core, typically with a regular cadence to support new features and bug fixes.

Traffic · last 30 days ↓62% vs prev 7d · indexed Sun Apr 12 · updated Sat Jul 11

total hits 24

actors 6 distinct systems

last hit 2d ago human

GPTBot

Amazonbot

ByteDance

Script

Humans

top countries 🇺🇸 United States · 🇨🇦 Canada · 🇸🇬 Singapore · 🇫🇮 Finland · 🇩🇪 Germany

Resources

docsairflow.apache.org/docs/apache-airflow-providers-apache-spark/6.0.1 ↗

githubgithub.com/apache/airflow ↗

changelogairflow.apache.org/docs/apache-airflow-providers-apache-spark/6.0.1/changelog.html ↗

packagepypi.org/project/apache-airflow-providers-apache-spark/ ↗

API endpoints

full doc /v1/registry/apache-airflow-providers-apache-spark

install /v1/registry/apache-airflow-providers-apache-spark/install

compatibility /v1/registry/apache-airflow-providers-apache-spark/compatibility