quelllm-mcp
JSON →Query a catalog of 250+ open-weights LLMs â list, compare, estimate VRAM and API-vs-self-hosted cost â directly from Claude Code, Cursor or any MCP client.
Install
uvx --from Tools · 6
- list_models List models with filters (origin code, family, max params in B)
- get_model Full record for one model (params, vram per quant, context window, family, tags, license, URLs)
- compare Side-by-side comparison with verdict
- estimate_vram VRAM in GB at chosen quant + recommended GPU/Mac tiers
- estimate_cost Cost in EUR — full table API providers vs self-hosted hardware OR a specific id
- search_models Fuzzy search by name, family, tag, author