Dagster Spark

library 0.29.0 ·python

✓ verified May 23, 2026

Dagster Spark is a Python library that provides integration components for orchestrating Apache Spark jobs within the Dagster data platform. It enables users to define, run, and monitor Spark-based data pipelines with Dagster's declarative programming model, offering capabilities for data management, lineage, and observability. The library is actively maintained and typically releases in sync with the core Dagster library.

Traffic · last 30 days ↓36% vs prev 7d · indexed Tue Apr 14 · updated Fri May 29

total hits 30

actors 7 distinct systems

last hit 1d ago SERankingBot

ByteDance

Script

GPTBot

Search engines

Humans

top countries 🇸🇬 Singapore · 🇩🇪 Germany · 🇺🇸 United States · 🇨🇦 Canada · 🇫🇷 France

Resources

githubgithub.com/dagster-io/dagster ↗

packagepypi.org/project/dagster-spark/ ↗

homepagedagster.io ↗

API endpoints

full doc /v1/registry/dagster-spark

install /v1/registry/dagster-spark/install

compatibility /v1/registry/dagster-spark/compatibility