OpenDataLoader PDF

library 2.4.3 ·python

✓ verified May 9, 2026

data serialization

A Python wrapper for the opendataloader-pdf Java CLI that extracts structured content and metadata from PDFs, supporting accessibility tags, tables, headings, and strikethrough text. Current version 2.4.3, requires Python >=3.10, released every few months.

Traffic · last 30 days ↑0% vs prev 7d · indexed Sat May 09 · updated Tue Jun 23

total hits 9

actors 2 distinct systems

last hit 21h ago AhrefsBot

MetaBot

3

Humans

2

top countries 🇨🇦 Canada · 🇺🇸 United States · 🇬🇧 United Kingdom

Resources

githubgithub.com/opendataloader-project/opendataloader-pdf ↗

packagepypi.org/project/opendataloader-pdf/ ↗

API endpoints

full doc /v1/registry/opendataloader-pdf

install /v1/registry/opendataloader-pdf/install