llama-cpp-python: Python Bindings for llama.cpp

JSON →
library 0.3.20 ·python
verified May 23, 2026

Python bindings for the `llama.cpp` library, enabling efficient local inference of large language models (LLMs) on various hardware, including CPUs and GPUs (NVIDIA, Apple Metal, AMD ROCm). It provides both a high-level API for easy model interaction and a low-level API for direct C API access. The library is actively maintained with frequent updates, often mirroring upstream `llama.cpp` changes, and currently stands at version 0.3.20.

total hits 16
actors 5 distinct systems
last hit 1d ago SERankingBot
Script
3
GPTBot
2

top countries 🇩🇪 Germany · 🇫🇷 France · 🇺🇸 United States · 🇮🇳 India · 🇨🇦 Canada