HTML5 RDF Parser

library 1.2.1 ·python

✓ verified May 24, 2026

html5rdf is a pure-python library for parsing HTML to DOMFragment objects, primarily intended for use within RDFLib. It is a fork of `html5lib-python` and `html5lib-modern`, designed to conform to the WHATWG HTML specification. Maintained by the RDFLib team, it serves as a drop-in replacement for `html5lib` without Python 2 support or legacy dependencies like `six` and `webencodings`. The current version is 1.2.1, with releases occurring as needed for bug fixes and RDFLib integration.

Traffic · last 30 days ↑0% vs prev 7d · indexed Thu Apr 16 · updated Mon Jun 01

total hits 14

actors 6 distinct systems

last hit 3d ago MetaBot

MetaBot

GPTBot

Script

Search engines

top countries 🇺🇸 United States · 🇨🇦 Canada · 🇩🇪 Germany · 🇫🇮 Finland

Resources

githubgithub.com/RDFLib/html5rdf ↗

packagepypi.org/project/html5rdf/ ↗

API endpoints

full doc /v1/registry/html5rdf

install /v1/registry/html5rdf/install

compatibility /v1/registry/html5rdf/compatibility