InfiniGram

JSON →
library 2.6.0 ·python
verified Jun 7, 2026

A Python package for querying and analyzing the Infini-gram dataset, which provides n-gram statistics over a massive web corpus. Version 2.6.0 supports Python >=3.11 and offers tools for tokenization, n-gram counting, and corpus insights. Active development, with updates targeting performance improvements.