DeepEval

library 3.9.6 ·python

✓ verified May 20, 2026

DeepEval is an LLM evaluation framework that helps developers evaluate any LLM workflow, from simple prompt chains to complex multi-step agents. It provides a suite of metrics for various evaluation aspects like relevancy, faithfulness, hallucination, and agentic task completion. Currently at version 3.9.6, the library maintains a frequent release cadence, often introducing new metrics, test case types, and developer experience improvements.

Traffic · last 30 days ↑175% vs prev 7d · indexed Sat Apr 11 · updated Wed May 27

total hits 23

actors 5 distinct systems

last hit 2d ago human

ByteDance

Script

GPTBot

Humans

top countries 🇸🇬 Singapore · 🇺🇸 United States · 🇸🇪 Sweden · 🇨🇳 China · 🇩🇪 Germany

Resources

docsdeepeval.com ↗

githubgithub.com/confident-ai/deepeval ↗

packagepypi.org/project/deepeval/ ↗

API endpoints

full doc /v1/registry/deepeval

install /v1/registry/deepeval/install

compatibility /v1/registry/deepeval/compatibility