PySpark DataFrame Testing Utility

library 0.2.0 ·python

✓ verified May 24, 2026

pyspark-test is a Python library designed to simplify unit testing for PySpark DataFrames. It provides a function, `assert_pyspark_df_equal`, inspired by the pandas testing module, which allows users to compare two Spark DataFrames and identify any differences. The library is currently at version 0.2.0 and has a stable, albeit infrequent, release cadence, focusing on its core DataFrame comparison functionality.

Traffic · last 30 days ↑25% vs prev 7d · indexed Thu Apr 16 · updated Mon Jun 01

total hits 22

actors 7 distinct systems

last hit 3d ago MetaBot

GPTBot

MetaBot

Script

ClaudeBot

Search engines

top countries 🇺🇸 United States · 🇫🇷 France · 🇨🇦 Canada · 🇩🇪 Germany · 🇳🇴 Norway

Resources

githubgithub.com/debugger24/pyspark-test ↗

packagepypi.org/project/pyspark-test/ ↗

API endpoints

full doc /v1/registry/pyspark-test

install /v1/registry/pyspark-test/install

compatibility /v1/registry/pyspark-test/compatibility