SWE-bench

library 4.1.0 ·python

✓ verified Jun 30, 2026

The official SWE-bench package (current version 4.1.0) provides a benchmark for evaluating large language models (LLMs) on software engineering tasks. It focuses on automatically testing model-generated code fixes against real-world software bugs and is actively developed with frequent updates, often involving significant changes between major versions.

Traffic · last 30 days ↓100% vs prev 7d · indexed Thu Apr 09 · updated Wed Jul 08

total hits 15

actors 6 distinct systems

last hit 9d ago AhrefsBot

GPTBot

ByteDance

Script

Search engines

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇨🇦 Canada · 🇬🇧 United Kingdom · 🇩🇪 Germany

Resources

docsgithub.com/swe-bench/SWE-bench ↗

githubgithub.com/swe-bench/SWE-bench ↗

packagepypi.org/project/swebench/ ↗

API endpoints

full doc /v1/registry/swebench

install /v1/registry/swebench/install

compatibility /v1/registry/swebench/compatibility