PyTorch Tokenizers

JSON →
library 1.2.0 ·python
verified May 25, 2026

PyTorch-Tokenizers is a Python package providing efficient C++ implementations for common tokenizers like SentencePiece and TikToken, along with Python bindings. It is primarily designed to serve as a dependency for other PyTorch projects, such as ExecuTorch and torchchat, to facilitate building high-performance LLM runners. The library offers significant efficiency gains for AI workloads, multilingual support, and high decode accuracy. It is actively maintained, with version 1.2.0 aligning its releases with major PyTorch and ExecuTorch updates.

total hits 18
actors 6 distinct systems
last hit 3d ago MetaBot
GPTBot
6
MetaBot
4
Script
2
ClaudeBot
1
Humans
1

top countries 🇺🇸 United States · 🇩🇪 Germany · 🇨🇦 Canada · 🇫🇷 France · 🇮🇹 Italy