Transformer Engine (CUDA 12)

JSON →
library 2.13.0 ·python
verified May 25, 2026

Transformer Engine (TE) is a Python library by NVIDIA for accelerating Transformer models on NVIDIA GPUs. It enables lower precision training and inference, notably supporting 8-bit (FP8) and 4-bit (NVFP4) floating point precision on Hopper, Ada, and Blackwell GPUs, leading to better performance and reduced memory utilization. It provides highly optimized building blocks for popular Transformer architectures and an automatic mixed precision-like API for PyTorch and JAX. The current version is 2.13.0, with an active release cadence, often aligning with new NVIDIA hardware and software advancements.

total hits 29
actors 8 distinct systems
last hit 1d ago AhrefsBot
ChatGPT-User
6
OAI-SearchBot
6
MetaBot
4
ByteDance
3
Script
2
Search engines
1
Humans
2

top countries 🇺🇸 United States · 🇩🇪 Germany · 🇸🇬 Singapore · 🇨🇦 Canada · NZ