• Cynicus RexOP
    link
    fedilink
    101 month ago

    #TL;DR:

    User-agent: GPTBot
    Disallow: /
    User-agent: ChatGPT-User
    Disallow: /
    User-agent: Google-Extended
    Disallow: /
    User-agent: PerplexityBot
    Disallow: /
    User-agent: Amazonbot
    Disallow: /
    User-agent: ClaudeBot
    Disallow: /
    User-agent: Omgilibot
    Disallow: /
    User-Agent: FacebookBot
    Disallow: /
    User-Agent: Applebot
    Disallow: /
    User-agent: anthropic-ai
    Disallow: /
    User-agent: Bytespider
    Disallow: /
    User-agent: Claude-Web
    Disallow: /
    User-agent: Diffbot
    Disallow: /
    User-agent: ImagesiftBot
    Disallow: /
    User-agent: Omgilibot
    Disallow: /
    User-agent: Omgili
    Disallow: /
    User-agent: YouBot
    Disallow: /
    
    • mox
      link
      fedilink
      71 month ago

      Of course, nothing stops a bot from picking a user agent field that exactly matches a web browser.

      • JackbyDev
        link
        English
        31 month ago

        Nothing stops a bot from choosing to not read robots.txt

        • mox
          link
          fedilink
          2
          edit-2
          1 month ago

          Indeed, as has already been said repeatedly in other comments.