PaddleOCR

JSON →
library 3.4.0 ·python
verified May 21, 2026

PaddleOCR is an awesome multilingual OCR and document parsing toolkit built upon the PaddlePaddle deep learning framework. It provides robust capabilities for text detection, text recognition, and structured document understanding, capable of transforming images and PDFs into structured data (JSON/Markdown) for AI and LLM-based applications. The library is currently at version 3.4.0 and maintains a frequent release cadence, with minor versions released every few months, incorporating new models and features.

total hits 18
actors 7 distinct systems
last hit 5d ago Script
Script
3
ChatGPT-User
3
GPTBot
2
ClaudeBot
1
Search engines
1

top countries 🇺🇸 United States · 🇫🇮 Finland · 🇩🇪 Germany · 🇫🇷 France · 🇮🇳 India