Triton Inference Server Python Client

library 2.67.0 ·python

✓ verified Jun 28, 2026 ✓ install reviewed

The `tritonclient` library provides Python APIs for interacting with NVIDIA Triton Inference Server. It supports both HTTP/REST and gRPC protocols, allowing applications to send inference requests, retrieve server and model status, manage models, and perform other tasks. Currently at version 2.67.0 (released March 27, 2026), it is actively maintained with a release cadence that generally aligns with the broader Triton Inference Server project.

Traffic · last 30 days ↑150% vs prev 7d · indexed Sun Apr 05 · updated Sat Jul 04

total hits 28

actors 7 distinct systems

last hit 1d ago GPTBot

GPTBot

Script

ByteDance

Search engines

Humans

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇩🇪 Germany · VN · 🇨🇦 Canada

Resources

packagepypi.org/project/tritonclient/ ↗

homepagedeveloper.nvidia.com/nvidia-triton-inference-server ↗

API endpoints

full doc /v1/registry/tritonclient

install /v1/registry/tritonclient/install

compatibility /v1/registry/tritonclient/compatibility