pdftotext

library 3.0.0 ·python

✓ verified May 26, 2026

pdftotext is a Python wrapper for the `pdftotext` command-line utility (part of the Poppler PDF rendering library). It provides a simple, efficient way to extract text from PDF documents. The current version is 3.0.0, and it has a moderate release cadence, with major updates happening less frequently than minor bug fixes.

Traffic · last 30 days ↓44% vs prev 7d · indexed Fri Apr 17 · updated Mon Jun 01

total hits 18

actors 7 distinct systems

last hit 3d ago MetaBot

MetaBot

4

GPTBot

2

Script

2

ClaudeBot

1

Search engines

1

top countries 🇺🇸 United States · 🇫🇷 France · 🇨🇦 Canada · 🇳🇴 Norway · 🇮🇳 India

Resources

githubgithub.com/jalan/pdftotext ↗

packagepypi.org/project/pdftotext/ ↗

API endpoints

full doc /v1/registry/pdftotext

install /v1/registry/pdftotext/install

compatibility /v1/registry/pdftotext/compatibility