Robots.txt Generator

Generate robots.txt rules for search engine crawlers.

100% Browser-Based

Files Never Leave Your Device

No Server Uploads

Zero Data Collection

Use Robots.txt Generator to draft crawler directives for public sites, staging folders, API routes, and sitemap discovery. A robots.txt file is a crawl instruction, not an access-control system: compliant crawlers may follow it, but private content still needs authentication or server restrictions. Review User-agent, Allow, Disallow, and Sitemap lines carefully before deploying.

When to use this tool

Create a starting robots.txt file for a new site or static export.
Block crawl waste on API routes, admin paths, search result pages, or temporary folders.
Add a Sitemap directive so crawlers can discover XML sitemap locations.
Document crawler rules for developers before changing production robots.txt.

Common issues and fixes

Important pages are accidentally blocked

Test the exact path patterns and remember that broad Disallow rules can hide entire sections from crawlers.

Robots is treated as security

Never rely on robots.txt to protect private data; use authentication, noindex headers, or server rules as appropriate.

Crawler rules conflict

Order and specificity matter, so review User-agent groups and Allow exceptions for the crawlers you care about.

The sitemap URL is missing or wrong

Use the absolute production sitemap URL and verify it returns valid XML.

Privacy note

Robots.txt drafting happens in the browser for normal use. Avoid pasting private admin URLs, tokenized paths, or sensitive staging locations unless those paths are already safe to discuss.

Related workflow

List crawl goals
Decide which public sections should be discoverable and which low-value paths should be avoided.
Generate directives
Draft User-agent, Allow, Disallow, and Sitemap lines with production path patterns.
Validate before deployment
Check representative URLs and confirm the sitemap URL works before replacing the live robots.txt file.

How It Works

Choose Preset or Custom

Pick from Allow All, Block All, Standard, or Block AI bots.

Add Rules

Configure user-agent, allow, and disallow paths.

Copy Output

Copy the generated robots.txt and upload to your site root.

Features

Presets

Allow All, Block All, Standard, and Block AI Bots.

Multi-Rule

Add rules for different user agents.

Sitemap & Crawl Delay

Include sitemap URL and crawl delay.

Valid Syntax

Always outputs valid robots.txt format.

Examples

Static utility site

Allow public tool pages while disallowing API routes and adding the production sitemap URL.

Staging cleanup

Draft rules for temporary paths, then confirm private staging content is protected by authentication instead of robots alone.

Frequently Asked Questions

Where do I put robots.txt?

Upload to your website root: https://yourdomain.com/robots.txt

Can robots.txt hide private content?

No. Robots.txt is only a crawl instruction for compliant bots. Private content still needs authentication, authorization, or server-level blocking.

What does Disallow mean in robots.txt?

Disallow tells matching crawlers not to crawl a path pattern. It does not remove already indexed URLs by itself.

Should robots.txt include a sitemap line?

Yes when you have a public XML sitemap. Use the absolute sitemap URL so crawlers can discover canonical pages more easily.

How do I test a robots.txt rule?

Check representative URLs against the exact User-agent group and verify that important public pages are not accidentally blocked.

What belongs outside robots.txt?

Secrets, admin screens, account data, and staging systems need authentication, authorization, noindex where appropriate, or network restrictions. Robots directives are public and should not reveal sensitive paths unnecessarily.

Can robots rules remove a page from Google?

A disallow rule can prevent crawling, but it does not always remove a known URL from search results. For removal, use noindex on crawlable pages or search console removal tools when appropriate.

Should I block CSS, JavaScript, or images?

Usually no. Search engines often need assets to render and understand pages correctly. Block only paths that truly should not be crawled, such as internal search results or duplicate utility endpoints.

Can I create separate rules for AI crawlers?

You can add User-agent groups for specific crawlers, including AI-related bots, but behavior depends on whether each crawler honors robots.txt. Review official bot names and keep important search crawlers separate.

What is the safest default robots.txt?

For a public utility site, a safe default usually allows important pages, disallows private or duplicate paths, and links the sitemap. Avoid broad sitewide blocks unless the whole site is intentionally unavailable to crawlers.

How often should robots.txt be reviewed?

Review it whenever routes, staging paths, API endpoints, or sitemap locations change. A stale rule can accidentally hide useful pages from compliant crawlers or expose crawl paths that should have stayed internal.

When to use this tool

Create a starting robots.txt file for a new site or static export.

Block crawl waste on API routes, admin paths, search result pages, or temporary folders.

Add a Sitemap directive so crawlers can discover XML sitemap locations.

Document crawler rules for developers before changing production robots.txt.

Common issues and fixes

Important pages are accidentally blocked

Test the exact path patterns and remember that broad Disallow rules can hide entire sections from crawlers.

Robots is treated as security

Never rely on robots.txt to protect private data; use authentication, noindex headers, or server rules as appropriate.

Crawler rules conflict

Order and specificity matter, so review User-agent groups and Allow exceptions for the crawlers you care about.

The sitemap URL is missing or wrong

Use the absolute production sitemap URL and verify it returns valid XML.

Related workflow

List crawl goals

Decide which public sections should be discoverable and which low-value paths should be avoided.

Generate directives

Draft User-agent, Allow, Disallow, and Sitemap lines with production path patterns.

Validate before deployment

Check representative URLs and confirm the sitemap URL works before replacing the live robots.txt file.

Frequently Asked Questions

Where do I put robots.txt?

Upload to your website root: https://yourdomain.com/robots.txt

Can robots.txt hide private content?

No. Robots.txt is only a crawl instruction for compliant bots. Private content still needs authentication, authorization, or server-level blocking.

What does Disallow mean in robots.txt?

Disallow tells matching crawlers not to crawl a path pattern. It does not remove already indexed URLs by itself.

Should robots.txt include a sitemap line?

Yes when you have a public XML sitemap. Use the absolute sitemap URL so crawlers can discover canonical pages more easily.

How do I test a robots.txt rule?

Check representative URLs against the exact User-agent group and verify that important public pages are not accidentally blocked.

What belongs outside robots.txt?

Can robots rules remove a page from Google?

A disallow rule can prevent crawling, but it does not always remove a known URL from search results. For removal, use noindex on crawlable pages or search console removal tools when appropriate.

Should I block CSS, JavaScript, or images?

Can I create separate rules for AI crawlers?

What is the safest default robots.txt?

How often should robots.txt be reviewed?

When to use this tool

Common issues and fixes

Important pages are accidentally blocked

Robots is treated as security

Crawler rules conflict

The sitemap URL is missing or wrong

Privacy note

Related workflow

List crawl goals

Generate directives

Validate before deployment

How It Works

Choose Preset or Custom

Add Rules

Copy Output

Features

Presets

Multi-Rule

Sitemap & Crawl Delay

Valid Syntax

Examples

Static utility site

Staging cleanup

Frequently Asked Questions

Related Tools

When to use this tool

Common issues and fixes

Important pages are accidentally blocked

Robots is treated as security

Crawler rules conflict

The sitemap URL is missing or wrong

Privacy note

Related workflow

List crawl goals

Generate directives

Validate before deployment

How It Works

Choose Preset or Custom

Add Rules

Copy Output

Features

Presets

Multi-Rule

Sitemap & Crawl Delay

Valid Syntax

Examples

Static utility site

Staging cleanup

Frequently Asked Questions

Related Tools