Goose3
JSON →Goose3 is an HTML content/article extractor and web scraper for Python 3 (requires Python >=3.9). It extracts the main content, title, authors, metadata (OpenGraph, schema.org), and images from news articles and web pages. The current version is 3.1.21, with irregular releases as fixes accumulate.
Traffic · last 30 days stale · no recent hits
total hits 5
actors 2 distinct systems
last hit 17d ago MetaBot
top countries 🇺🇸 United States