Surya OCR: Document Layout and Text Recognition

library 0.17.1 ·python

✓ verified Jul 3, 2026

Surya OCR is a Python library offering state-of-the-art optical character recognition (OCR), document layout analysis, reading order detection, and table recognition for over 90 languages. It's built on deep learning models, providing high accuracy for complex document structures. The current version is 0.17.1, and it undergoes active development with frequent releases.

Traffic · last 30 days ↑80% vs prev 7d · indexed Tue Apr 14 · updated Sat Jul 11

total hits 31

actors 9 distinct systems

last hit 5d ago human

GPTBot

OAI-SearchBot

Perplexity-User

Script

ByteDance

Search engines

Humans

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇨🇦 Canada · VN · 🇩🇪 Germany

Resources

githubgithub.com/VikParuchuri/surya ↗

packagepypi.org/project/surya-ocr/ ↗

API endpoints

full doc /v1/registry/surya-ocr

install /v1/registry/surya-ocr/install

compatibility /v1/registry/surya-ocr/compatibility