Mammoth

JSON →
library 1.12.0 ·python
verified May 20, 2026

Mammoth is an open-source Python library designed to convert Microsoft Word `.docx` documents into clean and semantic HTML or Markdown. It focuses on preserving the semantic structure of the document (e.g., headings, lists, tables) rather than attempting to replicate exact visual formatting. The current version is 1.12.0. The library has a steady release cadence, with updates addressing features and maintenance.

total hits 15
actors 8 distinct systems
last hit 22h ago AhrefsBot
GPTBot
2
Script
2
ClaudeBot
1
ChatGPT-User
1
Search engines
2
Humans
1

top countries 🇺🇸 United States · 🇨🇦 Canada · 🇫🇮 Finland · 🇩🇪 Germany