DeepEval

JSON →
library 3.9.6 ·python
verified May 20, 2026

DeepEval is an LLM evaluation framework that helps developers evaluate any LLM workflow, from simple prompt chains to complex multi-step agents. It provides a suite of metrics for various evaluation aspects like relevancy, faithfulness, hallucination, and agentic task completion. Currently at version 3.9.6, the library maintains a frequent release cadence, often introducing new metrics, test case types, and developer experience improvements.

total hits 23
actors 5 distinct systems
last hit 2d ago human
ByteDance
9
Script
3
GPTBot
2
Humans
5

top countries 🇸🇬 Singapore · 🇺🇸 United States · 🇸🇪 Sweden · 🇨🇳 China · 🇩🇪 Germany