Parse and analyze robots.txt files instantly. Inspect disallow rules, allow directives, sitemap references, crawl-delay, and user-agent groups in one place.

— or paste robots.txt content below —

PASTE robots.txt

🤖

Ready to analyze

Fetch a URL or paste robots.txt content, then click Analyze

HOW TO USE

Enter a URL or paste content

Type any website URL and click Fetch, or paste robots.txt text directly into the input area.

Click Analyze

The tool parses all directives, groups them by user-agent, and extracts sitemaps and crawl-delay settings.

Review the results

Inspect disallow/allow patterns per bot, sitemap URLs, and any special crawl instructions — copy raw output if needed.

FEATURES

Live Fetch Disallow Rules Allow Rules Sitemaps Crawl-Delay Multi User-Agent Wildcard Detection Browser-Based

USE CASES

🔍 Audit which pages are blocked from crawlers

🗺️ Find all declared sitemap URLs

🤖 Check rules for Googlebot vs other bots

⏱️ Verify crawl-delay settings per agent

🔧 Debug indexing issues before a site launch

WHAT IS THIS?

The robots.txt Analyzer parses the standard web crawler exclusion protocol. It identifies every user-agent block, extracts disallow and allow directives, finds sitemap references, and surfaces crawl-delay settings — all in your browser without sending data to a server.

RELATED TOOLS

FREQUENTLY ASKED QUESTIONS

What is a robots.txt file?

A robots.txt file is a plain text file placed at the root of a website (e.g. https://example.com/robots.txt). It tells web crawlers which pages or sections they are allowed or not allowed to access. It follows the Robots Exclusion Protocol (REP) and is respected by major crawlers like Googlebot, Bingbot, and others.

What does Disallow mean in robots.txt?

A Disallow directive instructs a crawler not to access the specified path. For example, Disallow: /admin/ prevents crawlers from visiting any URL under /admin/. A blank Disallow: means all pages are allowed. Note: Disallow: / blocks everything.

What does Allow mean in robots.txt?

An Allow directive explicitly permits access to a path, even if a broader Disallow rule would otherwise block it. For example, you might disallow /private/ but allow /private/public-page/. Allow rules take precedence when they are more specific than a Disallow rule.

What is a User-agent in robots.txt?

The User-agent field identifies which crawler the following rules apply to. User-agent: * applies to all bots. You can have separate blocks for specific bots like User-agent: Googlebot or User-agent: Bingbot, each with their own rules.

What is Crawl-delay?

The Crawl-delay directive tells a crawler how many seconds to wait between requests. For example, Crawl-delay: 10 means the bot should wait 10 seconds between page fetches. Note: Googlebot ignores this directive — use Google Search Console to control Googlebot's crawl rate instead.

What is the Sitemap directive?

The Sitemap directive in robots.txt points crawlers to your XML sitemap URL(s). This helps search engines discover all your pages faster. You can list multiple sitemap URLs, one per line. This is separate from any user-agent blocks and applies globally.

Does robots.txt prevent pages from being indexed?

No — robots.txt only prevents crawlers from accessing those pages. A blocked page can still appear in search results if other sites link to it. To prevent indexing entirely, use a noindex meta tag or X-Robots-Tag HTTP header on the page itself.

Is this tool safe to use with live sites?

Yes. The fetch feature only retrieves the public /robots.txt file from the domain you provide — the same file any crawler would read. No login credentials, cookies, or private data are sent. All parsing happens in your browser; nothing is stored.

{ robots.txt Analyzer }

HOW TO USE

FEATURES

USE CASES

WHAT IS THIS?

RELATED TOOLS

FREQUENTLY ASKED QUESTIONS

What is a robots.txt Analyzer?

How robots.txt Parsing Works

Understanding Disallow vs Allow Priority

Wildcard Patterns in robots.txt

Common robots.txt Mistakes to Avoid

robots.txt and SEO Best Practices

Who Uses a robots.txt Analyzer?