Apache DataSketches Library for Python

JSON →
library 5.2.0 ·python
verified May 23, 2026

The Apache DataSketches Library for Python provides a collection of high-performance, stochastic streaming algorithms (sketches) for approximate queries on massive datasets. These sketches offer mathematically proven error bounds and are designed for problems like count distinct, quantiles, most-frequent items, joins, matrix computations, and graph analysis. The current version is 5.2.0, with a regular release cadence as part of the Apache DataSketches project.

total hits 22
actors 8 distinct systems
last hit 5d ago ByteDance
ByteDance
9
GPTBot
2
Script
2
ChatGPT-User
1
MetaBot
1
Search engines
1
Humans
2

top countries 🇸🇬 Singapore · 🇺🇸 United States · 🇫🇮 Finland · 🇩🇪 Germany · 🇨🇦 Canada