Massive Text Embedding Benchmark (MTEB)

library 2.12.16 ·python

✓ verified May 22, 2026

MTEB (Massive Text Embedding Benchmark) is a Python framework for evaluating embeddings and retrieval systems across diverse NLP tasks, including classification, clustering, retrieval, reranking, and semantic textual similarity. It supports over 1000 languages and various modalities like text and image, with continuous expansion. As of version 2.12.16, it aims to provide a standardized, comprehensive, and reproducible way to compare embedding models. The library maintains a frequent release cadence with minor updates often occurring weekly.

Traffic · last 30 days ↑13% vs prev 7d · indexed Sun Apr 12 · updated Wed May 27

total hits 17

actors 6 distinct systems

last hit 18h ago ByteDance

ByteDance

Script

GPTBot

ClaudeBot

top countries 🇸🇬 Singapore · 🇩🇪 Germany · 🇺🇸 United States · 🇨🇦 Canada · 🇮🇳 India

Resources

docsembeddings-benchmark.github.io/mteb/ ↗

githubgithub.com/embeddings-benchmark/mteb ↗

packagepypi.org/project/mteb/ ↗

API endpoints

full doc /v1/registry/mteb

install /v1/registry/mteb/install

compatibility /v1/registry/mteb/compatibility