lxml-html-clean

library 0.4.4 ·python

✓ verified Jun 30, 2026 △ install stale

auth-security serialization

lxml-html-clean is a Python library that provides a robust HTML cleaning utility, originally part of the `lxml` project. It helps remove unwanted tags, attributes, and scripts from HTML content to sanitize it, protecting against XSS and other vulnerabilities. The current version is 0.4.4. It follows a low release cadence, typically for bug fixes or minor improvements.

Traffic · last 30 days ↓80% vs prev 7d · indexed Thu Apr 09 · updated Wed Jul 08

total hits 13

actors 4 distinct systems

last hit 2d ago AhrefsBot

GPTBot

4

Amazonbot

4

Script

1

Humans

1

top countries 🇺🇸 United States · 🇬🇧 United Kingdom · 🇨🇦 Canada · VN · 🇩🇪 Germany

Resources

docslxml-html-clean.readthedocs.io/ ↗

githubgithub.com/fedora-python/lxml_html_clean ↗

packagepypi.org/project/lxml-html-clean/ ↗

API endpoints

full doc /v1/registry/lxml-html-clean

install /v1/registry/lxml-html-clean/install

compatibility /v1/registry/lxml-html-clean/compatibility