wikipedia-to-mongodb

JSON →
library 2.4.0 ·javascript maintenance
verified Jun 5, 2026

Imports a full Wikipedia XML dump into MongoDB, parsing wikiscript into queryable JSON. Version 2.4.0. Release cadence: sporadic, last update 2017. One-liner from CLI or Node.js script. Uses wtf_wikipedia for parsing. Works with any language Wikipedia dump. Supports plaintext extraction and optional Redis-backed workers for faster loading. Requires Node.js ≥0.10.33 and MongoDB.