Transformer Engine

JSON →
library 2.13.0 ·python
verified May 25, 2026

Transformer Engine (TE) is a library developed by NVIDIA for accelerating Transformer models on NVIDIA GPUs. It enables the use of 8-bit floating point (FP8) and 4-bit floating point (NVFP4) precision on architectures like Hopper, Ada, and Blackwell, significantly improving performance and reducing memory utilization during both training and inference. TE provides highly optimized building blocks for common Transformer architectures and an automatic mixed-precision-like API that integrates seamlessly with PyTorch and JAX. The library has frequent releases, often aligned with updates to NVIDIA's deep learning software stack.

total hits 25
actors 8 distinct systems
last hit 1d ago AhrefsBot
GPTBot
6
ChatGPT-User
5
MetaBot
4
Script
2
ByteDance
2
Search engines
1

top countries 🇺🇸 United States · 🇨🇦 Canada · 🇸🇬 Singapore · 🇩🇪 Germany · 🇫🇷 France