ONNX Runtime (GPU)

JSON →
library 1.24.4 ·python
verified May 20, 2026

ONNX Runtime is a high-performance inference engine for ONNX models. The `onnxruntime-gpu` package provides GPU acceleration (e.g., via CUDA, ROCm) for these models, building on the core ONNX Runtime. It's actively developed by Microsoft, with frequent releases often aligned with new ONNX operator sets and performance improvements, currently at version 1.24.4.

total hits 34
actors 8 distinct systems
last hit 4d ago Script
ChatGPT-User
16
Script
2
OAI-SearchBot
2
MetaBot
2
ClaudeBot
1
PerplexityBot
1
Humans
6

top countries 🇺🇸 United States · 🇫🇷 France · 🇯🇵 Japan · 🇦🇺 Australia · 🇩🇪 Germany