NVIDIA TensorRT for CUDA 12
JSON →TensorRT is a high-performance deep learning inference optimizer and runtime from NVIDIA. The `tensorrt-cu12` package provides the Python bindings specifically compiled for CUDA Toolkit 12.x. As of its latest version `10.16.1.11`, it supports optimizing and deploying trained deep learning models for faster inference on NVIDIA GPUs. Releases are frequent, typically aligning with major TensorRT core library and CUDA toolkit updates.
Traffic · last 30 days ↑0% vs prev 7d
total hits 18
actors 6 distinct systems
last hit 22h ago ByteDance
top countries 🇺🇸 United States · 🇩🇪 Germany · 🇸🇬 Singapore · 🇨🇦 Canada · 🇫🇷 France
Resources
homepagedeveloper.nvidia.com/tensorrt ↗
API endpoints
full doc /v1/registry/tensorrt-cu12
compatibility /v1/registry/tensorrt-cu12/compatibility