BlingFire

library 0.1.8 ·python

✓ verified May 21, 2026

BlingFire is a Python wrapper for a lightning-fast Finite State Machine (FSM) based Natural Language Processing (NLP) library developed by Microsoft. It is designed for high-performance text tokenization, multi-word expression matching, stemming, and lemmatization. Known for its speed, it often outperforms other NLP libraries like Hugging Face and SpaCy in tokenization tasks. The library supports various tokenization algorithms including pattern-based, WordPiece, Unigram LM, and BPE. The current version is 0.1.8, and it maintains an active release cadence with periodic updates adding new features and models.

Traffic · last 30 days ↓12% vs prev 7d · indexed Sun Apr 12 · updated Wed May 27

total hits 21

actors 7 distinct systems

last hit 4d ago SERankingBot

ByteDance

GPTBot

Script

Search engines

Humans

top countries 🇩🇪 Germany · 🇸🇬 Singapore · 🇺🇸 United States · 🇫🇷 France · 🇨🇦 Canada

Resources

githubgithub.com/microsoft/blingfire ↗

packagepypi.org/project/blingfire/ ↗

API endpoints

full doc /v1/registry/blingfire

install /v1/registry/blingfire/install

compatibility /v1/registry/blingfire/compatibility