# AI-ML Companion - Content Protection # Block AI training crawlers and content scrapers # AI Training Bots User-agent: GPTBot Disallow: / User-agent: ChatGPT-User Disallow: / User-agent: Google-Extended Disallow: / User-agent: anthropic-ai Disallow: / User-agent: Claude-Web Disallow: / User-agent: CCBot Disallow: / User-agent: cohere-ai Disallow: / User-agent: PerplexityBot Disallow: / User-agent: Bytespider Disallow: / User-agent: FacebookBot Disallow: / # Common Scrapers User-agent: MJ12bot Disallow: / # AhrefsBot and SemrushBot allowed - they help track YOUR backlinks and SEO visibility # Blocking them only hides your site from your own SEO tools User-agent: DotBot Disallow: / User-agent: BLEXBot Disallow: / # Allow legitimate search engines (for SEO/discoverability) User-agent: Googlebot Allow: / Disallow: /assets/ User-agent: Bingbot Allow: / Disallow: /assets/ User-agent: * Allow: / Disallow: /assets/ Disallow: /api/ # Sitemap Sitemap: https://aimlcompanion.ai/sitemap.xml