pdf-oxide
JSON →The fastest Python PDF library with 0.8ms mean extraction speed, 5× faster than PyMuPDF. Supports text extraction, markdown conversion, and PDF creation. Achieves 100% pass rate on 3,830 PDFs. Current version 0.3.60, actively maintained.