Question 1

What is a robots.txt file?

Accepted Answer

A robots.txt file is a text file placed in your website's root directory that tells search engine crawlers which pages or sections of your site to crawl or not crawl. It's part of the Robots Exclusion Protocol (REP) and is the first file crawlers check when visiting your site. Modern robots.txt files also include sitemap references and AI crawler directives.

Question 2

How do I block AI crawlers like GPTBot?

Accepted Answer

To block GPTBot (OpenAI's crawler), add "User-agent: GPTBot" followed by "Disallow: /" in your robots.txt. You can also block Google-Extended (Google's AI training), CCBot (Common Crawl), and anthropic-ai (Claude). Each AI company uses different user agents, so you need separate rules for each one you want to block.

Question 3

Should I add sitemap to robots.txt?

Accepted Answer

Yes, adding your sitemap URL to robots.txt helps search engines discover all your pages faster. Use the format "Sitemap: https://yourdomain.com/sitemap.xml". You can list multiple sitemaps if needed. This is especially important for large sites or when you have dynamic content that changes frequently.

Question 4

What paths should I block in robots.txt?

Accepted Answer

Commonly blocked paths include admin areas (/admin/, /wp-admin/), API endpoints (/api/), build directories (/_next/, /_astro/), temporary files, search results (?s=), and user-specific pages. Blocking these paths helps optimize crawl budget by focusing crawlers on your important content pages. Never block CSS, JavaScript, or images as this can hurt SEO.

Robots.txt Generator

Features

Framework Detection

AI Crawler Rules

Path Analysis

Sitemap Reference

llms.txt Reference

Best Practices

Example robots.txt Output

Frequently Asked Questions

What is a robots.txt file?

How do I block AI crawlers like GPTBot?

Should I add sitemap to robots.txt?

What paths should I block in robots.txt?

Related Tools

LLMs.txt Generator

Technical SEO Audit

Schema Markup Generator

Generate Your robots.txt