Koalas: pandas API on Apache Spark

library 1.8.2 ·python deprecated

✓ verified Jun 30, 2026

Koalas provides a pandas-compatible API that runs on Apache Spark, allowing users familiar with pandas to work with large, distributed datasets. The current version is 1.8.2. Its development as a standalone library has ceased, as its functionality has been officially integrated into PySpark as 'pandas API on Spark' starting with Apache Spark 3.2. Maintenance releases are infrequent, primarily addressing critical bug fixes.

Traffic · last 30 days ↓25% vs prev 7d · indexed Sat Apr 11 · updated Sat Jul 11

total hits 19

actors 5 distinct systems

last hit 2d ago AhrefsBot

ByteDance

GPTBot

Script

Search engines

Humans

top countries 🇸🇬 Singapore · 🇺🇸 United States · 🇨🇦 Canada · VN · MY

Resources

docskoalas.readthedocs.io/ ↗

githubgithub.com/databricks/koalas ↗

packagepypi.org/project/koalas/ ↗

API endpoints

full doc /v1/registry/koalas

install /v1/registry/koalas/install

compatibility /v1/registry/koalas/compatibility