justext
JSON →justext is a heuristic-based boilerplate removal tool for HTML documents. It extracts the main content from web pages, discarding navigation, advertisements, and other extraneous elements. The current version is 3.0.2, and it typically releases updates for bug fixes and compatibility issues.
Traffic · last 30 days ↓67% vs prev 7d
total hits 11
actors 6 distinct systems
last hit 1d ago Googlebot
top countries 🇺🇸 United States · 🇫🇷 France · 🇩🇪 Germany · 🇮🇳 India
API endpoints
full doc /v1/registry/justext
install /v1/registry/justext/install
compatibility /v1/registry/justext/compatibility