SWE-smith

JSON →
library 0.0.9 ·python
verified May 21, 2026

SWE-smith is an open-source Python toolkit designed for generating large-scale software engineering training data. It enables users to turn any GitHub repository into a 'SWE-gym' to create unlimited task instances (e.g., file localization, program repair, SWE-bench) for training Software Engineering (SWE) agents. The current version is 0.0.9, and it appears to be actively developed, with frequent updates and an upcoming NeurIPS 2025 Datasets & Benchmarks Track spotlight. [2, 4, 6]

total hits 11
actors 7 distinct systems
last hit 1d ago ByteDance
Script
2
ChatGPT-User
2
OAI-SearchBot
2
ByteDance
1
Search engines
2

top countries 🇺🇸 United States · 🇩🇪 Germany · 🇸🇬 Singapore · 🇨🇦 Canada · 🇫🇷 France