NVIDIA Model Optimizer

JSON →
library 0.42.0 ·python
verified May 24, 2026

NVIDIA Model Optimizer (nvidia-modelopt) is an open toolkit designed to accelerate AI inference by applying state-of-the-art model optimization techniques such as quantization, pruning, and distillation. It primarily targets PyTorch and ONNX models, integrating directly into the training loop and enabling seamless deployment to NVIDIA's inference frameworks like TensorRT-LLM and TensorRT. The library is actively developed, with its current stable version being 0.42.0, and frequent pre-release candidates (e.g., 0.43.0rcX) indicating a rapid release cadence.

total hits 23
actors 10 distinct systems
last hit 9h ago ChatGPT-User
ChatGPT-User
4
MetaBot
4
ByteDance
3
Script
2
OAI-SearchBot
2
ClaudeBot
1
PerplexityBot
1
Search engines
1

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇫🇷 France · 🇩🇪 Germany · 🇨🇦 Canada