readability-lxml

library 0.8.4.1 ·python

✓ verified May 21, 2026

readability-lxml is a Python library that provides a fast HTML to text parser, designed to extract and clean up the main body text and title from an HTML document. It is a Python port of a Ruby port of arc90's Readability project. The library is actively maintained, with the latest version being 0.8.4.1 as of May 2025 (last PyPI upload date), and new releases typically occur to add Python version support, fix bugs, or add minor features.

Traffic · last 30 days ↓25% vs prev 7d · indexed Sun Apr 12 · updated Wed May 27

total hits 12

actors 7 distinct systems

last hit 3d ago AhrefsBot

Script

ChatGPT-User

OAI-SearchBot

ClaudeBot

Search engines

top countries 🇺🇸 United States · 🇩🇪 Germany · 🇨🇦 Canada · 🇫🇷 France · 🇧🇷 Brazil

Resources

githubgithub.com/buriy/python-readability ↗

packagepypi.org/project/readability-lxml/ ↗

API endpoints

full doc /v1/registry/readability-lxml

install /v1/registry/readability-lxml/install

compatibility /v1/registry/readability-lxml/compatibility