node-llama-cpp

library 3.18.1 ·javascript

✓ verified Jun 7, 2026

Run large language models (LLMs) locally from Node.js using llama.cpp bindings. Version 3.18.1 provides pre-built binaries for macOS, Linux, and Windows (Metal, CUDA, Vulkan) with automatic fallback to source build via cmake (no node-gyp or Python required). Supports JSON schema enforcement, function calling, embedding, reranking, and chat sessions. Full TypeScript types included. Active development with frequent releases synced to upstream llama.cpp. Key differentiator: zero-config GPU acceleration and safe token injection prevention.

Traffic · last 30 days stale · no recent hits · indexed Sun Jun 07 · updated Sat Jul 11

total hits 8

actors 2 distinct systems

last hit 21d ago human

GPTBot

Humans

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇬🇧 United Kingdom · 🇳🇴 Norway

Resources

githubgithub.com/withcatai/node-llama-cpp ↗

packagenode-llama-cpp ↗

homepagenode-llama-cpp.withcat.ai ↗

API endpoints

full doc /v1/registry/node-llama-cpp

install /v1/registry/node-llama-cpp/install