semchunk
JSON →semchunk is a Python library for splitting text into smaller chunks while preserving as much local semantic context as possible. It supports advanced features like AI-powered hierarchical chunking, chunk overlapping, and processing Isaacus Legal Graph Schema (ILGS) Documents, working seamlessly with various tokenizers. Actively developed by Isaacus, the library has frequent releases, with version 4.0.0 notably introducing AI chunking and ILGS Document support.
Traffic · last 30 days ↑400% vs prev 7d
total hits 13
actors 6 distinct systems
last hit 1d ago ByteDance
top countries 🇺🇸 United States · 🇩🇪 Germany · 🇫🇷 France · 🇨🇦 Canada · 🇸🇬 Singapore
Resources
packagepypi.org/project/semchunk/ ↗
API endpoints
full doc /v1/registry/semchunk
install /v1/registry/semchunk/install
compatibility /v1/registry/semchunk/compatibility