readability-lxml
JSON →readability-lxml is a Python library that provides a fast HTML to text parser, designed to extract and clean up the main body text and title from an HTML document. It is a Python port of a Ruby port of arc90's Readability project. The library is actively maintained, with the latest version being 0.8.4.1 as of May 2025 (last PyPI upload date), and new releases typically occur to add Python version support, fix bugs, or add minor features.
Traffic · last 30 days ↓25% vs prev 7d
total hits 12
actors 7 distinct systems
last hit 3d ago AhrefsBot
top countries 🇺🇸 United States · 🇩🇪 Germany · 🇨🇦 Canada · 🇫🇷 France · 🇧🇷 Brazil
API endpoints
full doc /v1/registry/readability-lxml
compatibility /v1/registry/readability-lxml/compatibility