llama-cpp-python: Python Bindings for llama.cpp
JSON →Python bindings for the `llama.cpp` library, enabling efficient local inference of large language models (LLMs) on various hardware, including CPUs and GPUs (NVIDIA, Apple Metal, AMD ROCm). It provides both a high-level API for easy model interaction and a low-level API for direct C API access. The library is actively maintained with frequent updates, often mirroring upstream `llama.cpp` changes, and currently stands at version 0.3.20.
Traffic · last 30 days ↑33% vs prev 7d
total hits 16
actors 5 distinct systems
last hit 1d ago SERankingBot
top countries 🇩🇪 Germany · 🇫🇷 France · 🇺🇸 United States · 🇮🇳 India · 🇨🇦 Canada
Resources
API endpoints
full doc /v1/registry/llama-cpp-python
compatibility /v1/registry/llama-cpp-python/compatibility