NVIDIA TensorRT for CUDA 12

library 10.16.1.11 ·python

✓ verified May 25, 2026

TensorRT is a high-performance deep learning inference optimizer and runtime from NVIDIA. The `tensorrt-cu12` package provides the Python bindings specifically compiled for CUDA Toolkit 12.x. As of its latest version `10.16.1.11`, it supports optimizing and deploying trained deep learning models for faster inference on NVIDIA GPUs. Releases are frequent, typically aligning with major TensorRT core library and CUDA toolkit updates.

Traffic · last 30 days ↑0% vs prev 7d · indexed Thu Apr 16 · updated Mon Jun 01

total hits 18

actors 6 distinct systems

last hit 22h ago ByteDance

GPTBot

MetaBot

Script

ByteDance

Search engines

top countries 🇺🇸 United States · 🇩🇪 Germany · 🇸🇬 Singapore · 🇨🇦 Canada · 🇫🇷 France

Resources

githubgithub.com/nvidia/tensorrt ↗

packagepypi.org/project/tensorrt-cu12/ ↗

homepagedeveloper.nvidia.com/tensorrt ↗

API endpoints

full doc /v1/registry/tensorrt-cu12

install /v1/registry/tensorrt-cu12/install

compatibility /v1/registry/tensorrt-cu12/compatibility