HojiChar

library 0.16.0 ·python

✓ verified Jun 28, 2026

HojiChar is a text preprocessing management system for Python, providing a pipeline API inspired by Compose/Filter patterns to clean, filter, and transform text data, with built-in support for deduplication, JSON loading/dumping, and asynchronous processing. Current version: 0.16.0, released Apr 2025; follows a monthly release cadence.

Traffic · last 30 days stale · no recent hits · indexed Mon Apr 27 · updated Sat Jul 11

total hits 25

actors 6 distinct systems

last hit 16d ago ByteDance

GPTBot

4

Amazonbot

4

ClaudeBot

4

ByteDance

3

Script

1

Humans

6

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇨🇦 Canada · BD · VN

Resources

githubgithub.com/HojiChar/HojiChar ↗

packagepypi.org/project/hojichar/ ↗

API endpoints

full doc /v1/registry/hojichar

install /v1/registry/hojichar/install

compatibility /v1/registry/hojichar/compatibility