Grapheme Splitter

library 1.0.4 ·javascript

✓ verified Jun 17, 2026

grapheme-splitter is a JavaScript library designed to accurately segment strings into user-perceived characters, known as extended grapheme clusters, as defined by Unicode Standard Annex #29 (UAX #29) Default Grapheme Cluster Boundaries. It addresses fundamental issues in JavaScript's native string handling, where `String.length` and simple character iteration can misrepresent visual character counts due to multi-codepoint emojis (e.g., `🏳️‍🌈`), combining marks (like in German 'ü', Spanish 'ñ', or Hindi text), and 'Zalgo' text. Unlike `String.normalize()` or libraries like `punycode.js`, `grapheme-splitter` provides a comprehensive solution for these complex Unicode cases. The current stable version is 1.0.4, indicating a mature and stable codebase with an infrequent release cadence focused on maintenance rather than rapid feature additions. Its key differentiator is precise adherence to Unicode grapheme cluster rules, making it essential for text processing, input field validation, and display logic in internationalized applications.

Traffic · last 30 days stale · no recent hits · indexed Sun Apr 19 · updated Sat Jul 11

total hits 11

actors 5 distinct systems

last hit 18d ago Bingbot

GPTBot

Script

Search engines

top countries 🇺🇸 United States · 🇨🇦 Canada · 🇫🇷 France · RO · 🇩🇪 Germany

Resources

githubgithub.com/orling/grapheme-splitter ↗

packagewww.npmjs.com/package/grapheme-splitter ↗

API endpoints

full doc /v1/registry/grapheme-splitter

install /v1/registry/grapheme-splitter/install

compatibility /v1/registry/grapheme-splitter/compatibility