TagSoup

JSON →
library 1.2.1 ·java
verified Jun 15, 2026

TagSoup is a SAX-compliant parser that parses HTML as found in the wild, allowing standard XML tools to process even malformed HTML.