readability-lxml

JSON →
library 0.8.4.1 ·python
verified May 21, 2026

readability-lxml is a Python library that provides a fast HTML to text parser, designed to extract and clean up the main body text and title from an HTML document. It is a Python port of a Ruby port of arc90's Readability project. The library is actively maintained, with the latest version being 0.8.4.1 as of May 2025 (last PyPI upload date), and new releases typically occur to add Python version support, fix bugs, or add minor features.

total hits 12
actors 7 distinct systems
last hit 3d ago AhrefsBot
Script
2
ChatGPT-User
2
OAI-SearchBot
2
ClaudeBot
1
Search engines
2

top countries 🇺🇸 United States · 🇩🇪 Germany · 🇨🇦 Canada · 🇫🇷 France · 🇧🇷 Brazil