pbspark: Protobuf PySpark Conversion

library 0.9.0 ·python

✓ verified May 26, 2026

pbspark is a Python package providing functionality to convert between protobuf messages and PySpark DataFrames, leveraging PySpark UDFs for efficient data processing. It maps protobuf types to Spark SQL types and handles serialization/deserialization. The current version is 0.9.0, with minor releases occurring every few months, reflecting active maintenance.

Traffic · last 30 days ↑25% vs prev 7d · indexed Fri Apr 17 · updated Mon Jun 01

total hits 21

actors 9 distinct systems

last hit 23h ago ByteDance

ByteDance

MetaBot

GPTBot

Script

ClaudeBot

Search engines

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇳🇴 Norway · 🇨🇦 Canada · 🇫🇷 France

Resources

docsgithub.com/crflynn/pbspark ↗

githubgithub.com/crflynn/pbspark ↗

packagepypi.org/project/pbspark/ ↗

API endpoints

full doc /v1/registry/pbspark

install /v1/registry/pbspark/install

compatibility /v1/registry/pbspark/compatibility