Luu Tuyen@lemmy.world to Technology@lemmy.worldEnglish · 22 hours agoTikTok’s parent launched a web scraper that’s gobbling up the world’s online data 25-times faster than OpenAIfortune.comexternal-linkmessage-square72fedilinkarrow-up1393arrow-down15cross-posted to: [email protected]
arrow-up1388arrow-down1external-linkTikTok’s parent launched a web scraper that’s gobbling up the world’s online data 25-times faster than OpenAIfortune.comLuu Tuyen@lemmy.world to Technology@lemmy.worldEnglish · 22 hours agomessage-square72fedilinkcross-posted to: [email protected]
minus-squarepurrtastic@lemmy.nzlinkfedilinkEnglisharrow-up34·14 hours agoIt’s not fine. They are not archiving the internet. I had to ban their user agent after very aggressive scraping that would have taken down our servers. Fuck this shitty behaviour.
minus-squareMelvin_Ferd@lemmy.worldlinkfedilinkEnglisharrow-up4·14 hours agoIsn’t there a way to limit requests so that traffic isn’t bringing down your servers
minus-squareMojave@lemmy.worldlinkfedilinkEnglisharrow-up8·12 hours agoThey obfuscate their traffic by randomizing user agents, so it’s either add a global rate limit, or let them ass fuck you
It’s not fine. They are not archiving the internet.
I had to ban their user agent after very aggressive scraping that would have taken down our servers. Fuck this shitty behaviour.
Isn’t there a way to limit requests so that traffic isn’t bringing down your servers
They obfuscate their traffic by randomizing user agents, so it’s either add a global rate limit, or let them ass fuck you