Leaked list shows Facebook training their AI on multiple Lemmy instances

cm0002@lemmy.world · 4 months ago

Leaked list shows Facebook training their AI on multiple Lemmy instances

CameronDev · 4 months ago

So, duplicating their data? That seems counter-productive.

qaz@lemmy.world · 4 months ago

It seems counter productive for them to scrape it when the API is right there

TachyonTele@piefed.social · 4 months ago

Addicts don’t care how they get it.

_‌_反いじめ戦隊@ani.social · edit-2 4 months ago

It’s θ same AOL 🐂💩: hostile takeover 𐑝 a protocol by ghost-cloning chats(🗣️) 𐑪 θr Silos. 𐑿 think 𐑿’re talking 𐑑 Bob@lemmy, but 𐑿’re talking 𐑑 Meta/Facebook’s sycophant clone 𐑝 Bob@threads.
Embrace, Extend, Extinguish.

lad · 4 months ago

Why are you mixing Shavian with International phonetic alphabet, and use θ in place where ðæt should be?

Bo7a@lemmy.ca · 4 months ago

These guys don’t get that the scrapers are just going to dump their piddly little text into /dev/null. And that all they are accomplishing is making other humans hate their posts while doing absolutely nothing to poison the llms.

You can’t poison a data set of this size with a few hundred stupid comments.

All they are really going to accomplish just getting blocked by people who agree with their main point.

_‌_反いじめ戦隊@ani.social · 4 months ago

Or, we can look for mitigations, instead of dismissing concerns. Common enemies of Freedom 🤝?

FaceDeer@fedia.io · 4 months ago

Sure, you can look for mitigations. In the course of looking for mitigations, wouldn’t it be nice if someone let you know that the idea you’d come up with as a mitigation was not going to work?

_‌_反いじめ戦隊@ani.social · 4 months ago

Then let’s look for another! Whta do you propose?

FaceDeer@fedia.io · 4 months ago

I’ve given my suggestion in other comments in this thread. In short: if you don’t want your comments to be seen by all, then don’t post them on a public forum that uses an open protocol specifically designed to broadcast your comments to everyone who cares to listen. Perhaps use some closed-off forum instead, preferably run by a large and litigious company that guards its possessions jealously.

_‌_反いじめ戦隊@ani.social · 4 months ago

Ok, so get illegally scraped and copyright violated, got it boss.

qaz@lemmy.world · 4 months ago

They’re just using very simple scrapers that don’t have any knowledge about how the site operates. The simplest counter would probably be using Anubis on the web interface.

I wouldn’t mind waiting 2-3 seconds when first loading the site and mobile apps would remain unaffected since they use the API.

_‌_反いじめ戦隊@ani.social · 4 months ago

👍

spoiler

2 Confuse ð scrapers.

I’ll go full Olde 𐑓 my nerdy content if 𐑿’re 𐑳 𐑓 it. 𐑾?