BoilerPy3

library 1.0.7 ·python

✓ verified May 25, 2026

BoilerPy3 is an active Python port of Christian Kohlschütter's Boilerpipe library, designed for robust HTML boilerplate removal and main text extraction from web pages. It is currently at version 1.0.7 and is based on Boilerpipe 1.2 functionality. The library focuses on providing a more Pythonic interface, including type-hinting and snake_case conventions.

Traffic · last 30 days ↓61% vs prev 7d · indexed Thu Apr 16 · updated Mon Jun 01

total hits 33

actors 7 distinct systems

last hit 2d ago AhrefsBot

ByteDance

MetaBot

GPTBot

Script

top countries 🇸🇬 Singapore · 🇩🇪 Germany · 🇺🇸 United States · 🇨🇦 Canada · 🇫🇷 France

Resources

githubgithub.com/jmriebold/BoilerPy3 ↗

packagepypi.org/project/boilerpy3/ ↗

API endpoints

full doc /v1/registry/boilerpy3

install /v1/registry/boilerpy3/install

compatibility /v1/registry/boilerpy3/compatibility