HTML5 Parser for Python

JSON →
library 1.1 ·python
verified Jun 9, 2026 install

html5lib is a pure-Python library for parsing HTML documents, designed to conform to the WHATWG HTML specification, as implemented by major web browsers. Its current stable version is 1.1, released in June 2020, with development on version 1.2 ongoing but unreleased. The library's release cadence is irregular, with significant time between major stable releases.

total hits 22
actors 6 distinct systems
last hit 1d ago GPTBot
GPTBot
4
Amazonbot
4
MetaBot
4
Script
2
Search engines
2
Humans
2

top countries 🇺🇸 United States · 🇨🇦 Canada · 🇩🇪 Germany