NVIDIA TensorRT Model Optimizer Core

library 0.33.1 ·python

✓ verified Jul 3, 2026

The NVIDIA TensorRT Model Optimizer (ModelOpt) provides a unified toolkit for model optimization and deployment across NVIDIA GPUs, supporting quantization (PTQ, QAT), pruning, distillation, and TensorRT export. As of v0.33.1, the library is actively maintained and targets Python 3.10–3.12. Release cadence is approximately monthly.

Traffic · last 30 days stale · no recent hits · indexed Sun Jun 07 · updated Sat Jul 11

total hits 15

actors 6 distinct systems

last hit 16d ago AhrefsBot

GPTBot

3

Amazonbot

3

ByteDance

2

Search engines

1

Humans

4

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇨🇦 Canada · 🇮🇳 India · 🇳🇴 Norway

Resources

githubgithub.com/NVIDIA/TensorRT-Model-Optimizer ↗

homepagegithub.com/NVIDIA/TensorRT-Model-Optimizer ↗

API endpoints

full doc /v1/registry/nvidia-modelopt-core

install /v1/registry/nvidia-modelopt-core/install