Latin-script Natural Language Parser
JSON →`parse-latin` is a JavaScript library designed for parsing natural language written in Latin-script languages, producing a Natural Language Concrete Syntax Tree (NLCST). It is currently at version 7.0.0, with a release cadence that includes regular patch and minor updates, and significant breaking changes between major versions, often related to ecosystem standards (like ESM-only) or core API adjustments. Key differentiators include its ability to precisely tokenize and structure text from diverse Latin-script languages such as Old English, Icelandic, French, and German, by correctly handling complex punctuation, word boundaries, and sentence structures. Unlike higher-level abstractions like `retext-latin`, `parse-latin` provides a lower-level API for manual manipulation of syntax trees. It explicitly handles nuances like hyphenated words, contractions (e.g., 'she’s'), and periods that don't signify sentence endings (e.g., in abbreviations), making it robust for detailed linguistic analysis and processing.
Traffic · last 30 days ↑0% vs prev 7d
top countries 🇺🇸 United States · 🇫🇷 France · 🇩🇪 Germany · 🇨🇦 Canada · 🇪🇸 Spain