技術インフラ

Robots.txt

robots.txt is a text file placed in the root directory of a website that instructs search engine crawlers which pages or files they are allowed or disallowed to request. It's the first line of defense in controlling how bots interact with your site infrastructure and helps optimize crawl budget.

技術インフラ

SEO対策

Crawl Management

Directing Bots to Your Best Content

Google allocates a limited "crawl budget" to your site—the number of pages its bots will crawl per day. If bots waste time crawling admin panels, duplicate printer-friendly pages, or cart/checkout URLs, they might miss your valuable translated product pages. robots.txt tells bots "Don't waste time on /admin/, focus on /en/, /fr/, /de/ instead." For international sites, you should disallow crawling of language auto-detection redirect pages, API endpoints, and any technical URLs that don't need to be indexed. However, NEVER accidentally block your language directories—that's a catastrophic mistake that kills all international SEO.

Allowing vs. Disallowing Crawl Access

アスペクト

なし

With Robots.txt

Allow (Default)

Bots crawl everything: content + technical pages

Wastes crawl budget on unimportant pages

Strategic Disallow

Disallow: /admin/, /cart/, /api/

Focuses bots on indexable content

International Example

Allow: /en/, /fr/, /de/ (language directories)

Disallow: /lang-detect/ (technical redirect)

Critical Mistake

Disallow: /fr/ (blocks French site)

French content never indexed - DISASTER

現実世界の影響

以前

現在のアプローチ

📋 シナリオ

Site has no robots.txt, bots crawl 10,000 cart URLs

⚙️ 起こること

Crawl budget wasted, product pages crawled slowly

📉

事業への影響

New products take weeks to appear in search

その後

最適化された解

📋 シナリオ

Add robots.txt: Disallow /cart/, /checkout/, /api/

⚙️ 起こること

Bots focus 100% on product and language pages

📈

事業への影響

New products indexed within 24 hours

Robots.txt

Directing Bots to Your Best Content

Allowing vs. Disallowing Crawl Access

現実世界の影響

さらに多くの用語集を読み続ける

API(アプリケーションプログラミングインターフェース)

クライアントサイドレンダリング(CSR)

コアウェブのバイタル

エッジデリバリーネットワーク

マスターの準備が整いました Robots.txt?