CTranslate2

JSON →
library 4.7.1 ·python
verified May 20, 2026 install draft

CTranslate2 is a C++ and Python library for efficient inference with Transformer models. It implements a custom runtime with performance optimizations like weights quantization, layers fusion, and batch reordering to accelerate and reduce memory usage of Transformer models on CPUs and GPUs. It currently supports a wide range of encoder-decoder, decoder-only, and encoder-only models from frameworks like OpenNMT, Fairseq, and Hugging Face Transformers. The library is actively maintained with frequent releases, currently at version 4.7.1.

total hits 43
actors 9 distinct systems
last hit 18h ago SERankingBot
ByteDance
14
ChatGPT-User
7
Script
4
GPTBot
2
OAI-SearchBot
2
Search engines
1
Humans
8

top countries 🇸🇬 Singapore · 🇺🇸 United States · 🇩🇪 Germany · 🇫🇷 France · IE