TinySegmenter

JSON →
library 0.4 ·python maintenance
verified May 23, 2026

TinySegmenter in Python is a Python port of the original JavaScript-based TinySegmenter, an extremely compact (23KB) Japanese tokenizer. It offers character-based segmentation with approximately 95% precision for Japanese news articles, compatible with MeCab + IPADic segmentation units, without relying on external dictionaries. The latest version, 0.4, was released on September 16, 2018, and its development is not actively maintained, though contributions are welcome.

total hits 12
actors 3 distinct systems
last hit 6d ago Script
GPTBot
6
Script
2

top countries 🇺🇸 United States · 🇨🇦 Canada · 🇩🇪 Germany