DeepEval
JSON →DeepEval is an LLM evaluation framework that helps developers evaluate any LLM workflow, from simple prompt chains to complex multi-step agents. It provides a suite of metrics for various evaluation aspects like relevancy, faithfulness, hallucination, and agentic task completion. Currently at version 3.9.6, the library maintains a frequent release cadence, often introducing new metrics, test case types, and developer experience improvements.
Traffic · last 30 days ↑175% vs prev 7d
total hits 23
actors 5 distinct systems
last hit 2d ago human
top countries 🇸🇬 Singapore · 🇺🇸 United States · 🇸🇪 Sweden · 🇨🇳 China · 🇩🇪 Germany
API endpoints
full doc /v1/registry/deepeval
install /v1/registry/deepeval/install
compatibility /v1/registry/deepeval/compatibility