BoilerPy3

JSON →
library 1.0.7 ·python
verified May 25, 2026

BoilerPy3 is an active Python port of Christian Kohlschütter's Boilerpipe library, designed for robust HTML boilerplate removal and main text extraction from web pages. It is currently at version 1.0.7 and is based on Boilerpipe 1.2 functionality. The library focuses on providing a more Pythonic interface, including type-hinting and snake_case conventions.

total hits 33
actors 7 distinct systems
last hit 2d ago AhrefsBot
ByteDance
14
MetaBot
4
GPTBot
2
Script
2

top countries 🇸🇬 Singapore · 🇩🇪 Germany · 🇺🇸 United States · 🇨🇦 Canada · 🇫🇷 France