BlingFire

JSON →
library 0.1.8 ·python
verified May 21, 2026

BlingFire is a Python wrapper for a lightning-fast Finite State Machine (FSM) based Natural Language Processing (NLP) library developed by Microsoft. It is designed for high-performance text tokenization, multi-word expression matching, stemming, and lemmatization. Known for its speed, it often outperforms other NLP libraries like Hugging Face and SpaCy in tokenization tasks. The library supports various tokenization algorithms including pattern-based, WordPiece, Unigram LM, and BPE. The current version is 0.1.8, and it maintains an active release cadence with periodic updates adding new features and models.

total hits 21
actors 7 distinct systems
last hit 4d ago SERankingBot
ByteDance
4
GPTBot
2
Script
2
Search engines
1
Humans
2

top countries 🇩🇪 Germany · 🇸🇬 Singapore · 🇺🇸 United States · 🇫🇷 France · 🇨🇦 Canada