TinySegmenter

library 0.4 ·python maintenance

✓ verified May 23, 2026

TinySegmenter in Python is a Python port of the original JavaScript-based TinySegmenter, an extremely compact (23KB) Japanese tokenizer. It offers character-based segmentation with approximately 95% precision for Japanese news articles, compatible with MeCab + IPADic segmentation units, without relying on external dictionaries. The latest version, 0.4, was released on September 16, 2018, and its development is not actively maintained, though contributions are welcome.

Traffic · last 30 days ↑300% vs prev 7d · indexed Tue Apr 14 · updated Fri May 29

total hits 12

actors 3 distinct systems

last hit 6d ago Script

GPTBot

Script

top countries 🇺🇸 United States · 🇨🇦 Canada · 🇩🇪 Germany

Resources

packagepypi.org/project/tinysegmenter/ ↗

homepagetinysegmenter.tuxfamily.org/ ↗

API endpoints

full doc /v1/registry/tinysegmenter

install /v1/registry/tinysegmenter/install

compatibility /v1/registry/tinysegmenter/compatibility