Massive Text Embedding Benchmark (MTEB)
JSON →MTEB (Massive Text Embedding Benchmark) is a Python framework for evaluating embeddings and retrieval systems across diverse NLP tasks, including classification, clustering, retrieval, reranking, and semantic textual similarity. It supports over 1000 languages and various modalities like text and image, with continuous expansion. As of version 2.12.16, it aims to provide a standardized, comprehensive, and reproducible way to compare embedding models. The library maintains a frequent release cadence with minor updates often occurring weekly.
Traffic · last 30 days ↑13% vs prev 7d
total hits 17
actors 6 distinct systems
last hit 18h ago ByteDance
top countries 🇸🇬 Singapore · 🇩🇪 Germany · 🇺🇸 United States · 🇨🇦 Canada · 🇮🇳 India
Resources
packagepypi.org/project/mteb/ ↗
API endpoints
full doc /v1/registry/mteb
install /v1/registry/mteb/install
compatibility /v1/registry/mteb/compatibility