• Cynicus Rex@lemmy.mlOP
    link
    fedilink
    arrow-up
    13
    arrow-down
    3
    ·
    3 months ago

    #TL;DR:

    User-agent: GPTBot
    Disallow: /
    User-agent: ChatGPT-User
    Disallow: /
    User-agent: Google-Extended
    Disallow: /
    User-agent: PerplexityBot
    Disallow: /
    User-agent: Amazonbot
    Disallow: /
    User-agent: ClaudeBot
    Disallow: /
    User-agent: Omgilibot
    Disallow: /
    User-Agent: FacebookBot
    Disallow: /
    User-Agent: Applebot
    Disallow: /
    User-agent: anthropic-ai
    Disallow: /
    User-agent: Bytespider
    Disallow: /
    User-agent: Claude-Web
    Disallow: /
    User-agent: Diffbot
    Disallow: /
    User-agent: ImagesiftBot
    Disallow: /
    User-agent: Omgilibot
    Disallow: /
    User-agent: Omgili
    Disallow: /
    User-agent: YouBot
    Disallow: /
    
    • mox@lemmy.sdf.org
      link
      fedilink
      arrow-up
      7
      ·
      3 months ago

      Of course, nothing stops a bot from picking a user agent field that exactly matches a web browser.

      • JackbyDev
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        1
        ·
        3 months ago

        Nothing stops a bot from choosing to not read robots.txt