Apache DataSketches Library for Python

library 5.2.0 ·python

✓ verified Jun 28, 2026

The Apache DataSketches Library for Python provides a collection of high-performance, stochastic streaming algorithms (sketches) for approximate queries on massive datasets. These sketches offer mathematically proven error bounds and are designed for problems like count distinct, quantiles, most-frequent items, joins, matrix computations, and graph analysis. The current version is 5.2.0, with a regular release cadence as part of the Apache DataSketches project.

Traffic · last 30 days ↑67% vs prev 7d · indexed Tue Apr 14 · updated Sat Jul 11

total hits 21

actors 6 distinct systems

last hit 11d ago AhrefsBot

OAI-SearchBot

ByteDance

Script

ChatGPT-User

Humans

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇨🇦 Canada · 🇫🇷 France · VN

Resources

packagepypi.org/project/datasketches/ ↗

homepagedatasketches.apache.org ↗

API endpoints

full doc /v1/registry/datasketches

install /v1/registry/datasketches/install

compatibility /v1/registry/datasketches/compatibility