GPTCache

library 0.1.44 ·python

✓ verified May 23, 2026

GPTCache is a powerful caching library designed to speed up and lower the cost of chat applications that rely on Large Language Model (LLM) services. It functions as a semantic cache, storing and retrieving responses for similar (not just exact) queries using embedding algorithms and vector stores. The library is actively maintained with frequent minor releases.

Traffic · last 30 days ↑533% vs prev 7d · indexed Tue Apr 14 · updated Fri May 29

total hits 28

actors 8 distinct systems

last hit 1d ago SERankingBot

ByteDance

GPTBot

Script

ChatGPT-User

Search engines

top countries 🇸🇬 Singapore · 🇩🇪 Germany · 🇺🇸 United States · 🇨🇦 Canada · 🇦🇺 Australia

Resources

githubgithub.com/zilliztech/GPTCache ↗

packagepypi.org/project/gptcache/ ↗

API endpoints

full doc /v1/registry/gptcache

install /v1/registry/gptcache/install

compatibility /v1/registry/gptcache/compatibility