Berkeley Function Calling Leaderboard Evaluation
JSON →bfcl-eval is the Python library for the Berkeley Function Calling Leaderboard (BFCL), a benchmark to evaluate Large Language Models (LLMs) on their ability to perform function calling. It provides the evaluation pipeline and datasets, including support for multi-step and multi-turn function calls as of its V3 release. The library is actively maintained with frequent updates, with its current PyPI version being 2026.3.23.
Traffic · last 30 days ↑100% vs prev 7d
total hits 33
actors 11 distinct systems
last hit 1d ago Amazonbot
top countries 🇺🇸 United States · 🇩🇪 Germany · 🇸🇬 Singapore · 🇨🇦 Canada · 🇫🇷 France
API endpoints
full doc /v1/registry/bfcl-eval
install /v1/registry/bfcl-eval/install
compatibility /v1/registry/bfcl-eval/compatibility