semchunk

JSON →
library 4.0.0 ·python
verified May 21, 2026

semchunk is a Python library for splitting text into smaller chunks while preserving as much local semantic context as possible. It supports advanced features like AI-powered hierarchical chunking, chunk overlapping, and processing Isaacus Legal Graph Schema (ILGS) Documents, working seamlessly with various tokenizers. Actively developed by Isaacus, the library has frequent releases, with version 4.0.0 notably introducing AI chunking and ILGS Document support.

total hits 13
actors 6 distinct systems
last hit 1d ago ByteDance
GPTBot
6
Script
2
ByteDance
1
ClaudeBot
1
Search engines
1

top countries 🇺🇸 United States · 🇩🇪 Germany · 🇫🇷 France · 🇨🇦 Canada · 🇸🇬 Singapore