Newspaper3k

JSON →
library 0.2.8 ·python maintenance
verified May 22, 2026

Newspaper3k is a Python 3 library designed for simplified article discovery, extraction, and natural language processing (NLP) from news websites. It excels at extracting main content, metadata like title, author, publish date, images, and videos, as well as generating keywords and summaries. Although its last PyPI release was in 2018, it remains functional for many use cases, though a community fork (`newspaper4k`) provides more active development and modern features.

total hits 14
actors 8 distinct systems
last hit 5d ago Script
ChatGPT-User
3
GPTBot
2
Script
2
ClaudeBot
1
Search engines
2

top countries 🇺🇸 United States · 🇩🇪 Germany · 🇨🇦 Canada · 🇫🇮 Finland · 🇺🇦 Ukraine