ONNX Runtime (GPU)
JSON →ONNX Runtime is a high-performance inference engine for ONNX models. The `onnxruntime-gpu` package provides GPU acceleration (e.g., via CUDA, ROCm) for these models, building on the core ONNX Runtime. It's actively developed by Microsoft, with frequent releases often aligned with new ONNX operator sets and performance improvements, currently at version 1.24.4.
Traffic · last 30 days ↑150% vs prev 7d
total hits 34
actors 8 distinct systems
last hit 4d ago Script
top countries 🇺🇸 United States · 🇫🇷 France · 🇯🇵 Japan · 🇦🇺 Australia · 🇩🇪 Germany
Resources
homepageonnxruntime.ai ↗
API endpoints
full doc /v1/registry/onnxruntime-gpu
compatibility /v1/registry/onnxruntime-gpu/compatibility