Blocking AI crawlers on the fediverse

cecep@fedia.io · 2 years ago

CameronDev · 2 years ago

But robots.txt is not a legal document — and 30 years after its creation, it still relies on the good will of all parties involved

You can ask nicely, they can (and will) ignore it.

lad · 2 years ago

Also, I’ve already seen complaints about AI companies scraping everything ignoring robots.txt

And we would block the obedient and useful crawlers while doing no harm to malicious