Newspaper3k
JSON →Newspaper3k is a Python 3 library designed for simplified article discovery, extraction, and natural language processing (NLP) from news websites. It excels at extracting main content, metadata like title, author, publish date, images, and videos, as well as generating keywords and summaries. Although its last PyPI release was in 2018, it remains functional for many use cases, though a community fork (`newspaper4k`) provides more active development and modern features.
Traffic · last 30 days ↓89% vs prev 7d
total hits 14
actors 8 distinct systems
last hit 5d ago Script
top countries 🇺🇸 United States · 🇩🇪 Germany · 🇨🇦 Canada · 🇫🇮 Finland · 🇺🇦 Ukraine
Resources
API endpoints
full doc /v1/registry/newspaper3k
install /v1/registry/newspaper3k/install
compatibility /v1/registry/newspaper3k/compatibility