Document Text Recognition (docTR)

library 1.0.1 ·python

✓ verified May 25, 2026

docTR (Document Text Recognition) is an open-source Python library leveraging deep learning for high-performance Optical Character Recognition (OCR) on documents. It provides state-of-the-art text detection and recognition for scanned documents, images, and PDFs. Actively maintained by Mindee, it supports multi-language recognition, handwriting, and GPU acceleration, currently at version 1.0.1.

Traffic · last 30 days ↓42% vs prev 7d · indexed Thu Apr 16 · updated Mon Jun 01

total hits 22

actors 8 distinct systems

last hit 2d ago AhrefsBot

GPTBot

6

MetaBot

4

Script

2

ClaudeBot

1

Search engines

2

Humans

3

top countries 🇺🇸 United States · 🇨🇦 Canada · 🇩🇪 Germany · 🇫🇮 Finland · 🇫🇷 France

Resources

docsmindee.github.io/doctr ↗

githubgithub.com/mindee/doctr ↗

changelogmindee.github.io/doctr/changelog.html ↗

packagepypi.org/project/python-doctr/ ↗

API endpoints

full doc /v1/registry/python-doctr

install /v1/registry/python-doctr/install

compatibility /v1/registry/python-doctr/compatibility