spaCy Chinese Word Segmentation (pkuseg)

library 1.0.1 ·python

✓ verified May 23, 2026

spacy-pkuseg is a Chinese word segmentation toolkit for spaCy, forked from pkuseg-python. It provides a `PkusegSegmenter` component to integrate robust Chinese segmentation directly into spaCy's NLP pipeline. The current stable version is 1.0.1, with releases primarily focused on Python and core dependency (like NumPy) compatibility updates.

Traffic · last 30 days ↓12% vs prev 7d · indexed Wed Apr 15 · updated Sat May 30

total hits 17

actors 5 distinct systems

last hit 23h ago MetaBot

GPTBot

6

MetaBot

4

Script

2

ClaudeBot

1

top countries 🇺🇸 United States · 🇨🇦 Canada · 🇩🇪 Germany · 🇫🇷 France

Resources

githubgithub.com/explosion/spacy-pkuseg ↗

packagepypi.org/project/spacy-pkuseg/ ↗

API endpoints

full doc /v1/registry/spacy-pkuseg

install /v1/registry/spacy-pkuseg/install

compatibility /v1/registry/spacy-pkuseg/compatibility