ONNX Runtime (GPU)

library 1.24.4 ·python

✓ verified May 20, 2026

ai-ml gcp aws azure

ONNX Runtime is a high-performance inference engine for ONNX models. The `onnxruntime-gpu` package provides GPU acceleration (e.g., via CUDA, ROCm) for these models, building on the core ONNX Runtime. It's actively developed by Microsoft, with frequent releases often aligned with new ONNX operator sets and performance improvements, currently at version 1.24.4.

Traffic · last 30 days ↑150% vs prev 7d · indexed Thu Apr 09 · updated Mon May 25

total hits 34

actors 8 distinct systems

last hit 4d ago Script

ChatGPT-User

16

Script

2

OAI-SearchBot

2

MetaBot

2

ClaudeBot

1

PerplexityBot

1

Humans

6

top countries 🇺🇸 United States · 🇫🇷 France · 🇯🇵 Japan · 🇦🇺 Australia · 🇩🇪 Germany

Resources

githubgithub.com/microsoft/onnxruntime ↗

packagepypi.org/project/onnxruntime-gpu/ ↗

homepageonnxruntime.ai ↗

API endpoints

full doc /v1/registry/onnxruntime-gpu

install /v1/registry/onnxruntime-gpu/install

compatibility /v1/registry/onnxruntime-gpu/compatibility