Soda Core Spark Integration (Legacy)

library 3.5.6 ·python deprecated

✓ verified Jul 3, 2026

This entry describes `soda-core-spark`, an older Python library for data quality testing on Spark DataFrames. It was an extension of `Soda SQL` that allowed programmatic data quality checks. As of Soda v3, `soda-core-spark` and `soda-sql` have been deprecated. Spark DataFrame integration is now handled directly by the main `soda-core` library using its native Spark connection capabilities. The latest available version of this deprecated package is `3.5.6`.

Traffic · last 30 days ↑0% vs prev 7d · indexed Tue Apr 14 · updated Sat Jul 11

total hits 20

actors 5 distinct systems

last hit 8d ago ByteDance

GPTBot

Script

ByteDance

Humans

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇨🇦 Canada · 🇩🇪 Germany · 🇫🇷 France

Resources

packagepypi.org/project/soda-core-spark/ ↗

homepagewww.soda.io ↗

API endpoints

full doc /v1/registry/soda-core-spark

install /v1/registry/soda-core-spark/install

compatibility /v1/registry/soda-core-spark/compatibility