Robots.txt Parser

JSON →
library 2.0.3 ·javascript
verified May 27, 2026

The `robots-txt-parser` library provides a lightweight, promise-based solution for parsing `robots.txt` files efficiently in Node.js environments. It is currently at version 2.0.3, offering features such as comprehensive wildcard support in rules, configurable caching of `robots.txt` content, and flexible asynchronous operations via both promises and traditional callbacks. This package is specifically designed for developers building web crawlers, scrapers, and other automated bots that must adhere to website crawling policies. Key differentiators include its focus on Node.js, a clear API for determining URL crawlability, retrieving sitemaps, and managing crawl delays. The project maintains a stable release cadence, with the 2.x major version being actively supported since late 2018. Users can configure critical parameters such as the default user agent string and how the parser evaluates scenarios where allow/disallow rules are balanced.

total hits 31
actors 9 distinct systems
last hit 23h ago human
ChatGPT-User
9
GPTBot
5
OAI-SearchBot
4
MetaBot
4
Script
1
ClaudeBot
1
Search engines
1
Humans
1

top countries 🇺🇸 United States · 🇨🇦 Canada · 🇫🇷 France · 🇩🇪 Germany · 🇮🇩 Indonesia