Byte-Pair Embeddings (BPEmb)

JSON →
library 0.3.6 ·python
verified May 26, 2026

BPEmb provides byte-pair encodings (BPE) from raw text and maps subword units to pre-trained embeddings for 275 languages. It's designed for NLP tasks requiring efficient subword tokenization and embedding. The current version is 0.3.6, with releases occurring sporadically based on updates to models or features.

total hits 22
actors 8 distinct systems
last hit 2d ago AhrefsBot
ByteDance
4
MetaBot
4
GPTBot
2
Script
1
Search engines
1

top countries 🇩🇪 Germany · 🇺🇸 United States · 🇸🇬 Singapore · 🇨🇦 Canada · 🇫🇷 France