This new data poisoning tool lets artists fight back against generative AI

Elle · 11 months ago

This new data poisoning tool lets artists fight back against generative AI

ayaya · 11 months ago

Obviously this is using some bug and/or weakness in the existing training process, so couldn’t they just patch the mechanism being exploited?

Or at the very least you could take a bunch of images, purposely poison them, and now you have a set of poisoned images and their non-poisoned counterparts allowing you to train another model to undo it.

Sure you’ve set up a speedbump but this is hardly a solution.

@nous · 11 months ago

Yes, it likely exploits some weekness in the current models and new models would have to be trained to not have the same problems with these poisoned images.

so couldn’t they just patch the mechanism being exploited?

The reality is it is not likely can just patch away in a quick and easy way like you make it sound. Training new models is expensive and takes time - you also have to find and figure out what exactly is causing the problems in the first place which may or may not be a trivial task especially when it is hard to understand exactly what the models are really doing.

Sure you’ve set up a speedbump but this is hardly a solution.

A speed bump is really all there is so a lot of problems like this. Like security in general - it is just a giant game of cat and mice, with each side constantly chasing the next big exploit or fix. This will likely be patched eventually in models, but then some new exploit will be found and the whole process starts over again in a forever expanding loop. There are no final solutions to problems like this, just each side trying to one up the other in an ever evolving landscape. It is and will be a constant fight on each side to keep up with the other side. This news gives artists a new tool to help them, at least for the short term and one that can likely be adapted to keep it relevant for a while longer.

@[email protected] · 11 months ago

No! It’s not using an internal exploit, it’s rather about finding a way to visually represent almost the same image, but instead using latent features with different artists (e.g, which would confuse a dreambooth+lora training), however, the method they proposed is flawed, I commented more on https://lemmy.world/comment/4770884

AnonTwo · 11 months ago

Obviously this is using some bug and/or weakness in the existing training process, so couldn’t they just patch the mechanism being exploited?

I’d assume the issue is that if someone tried to patch it out, it could legally be shown they were disregarding people’s copyright.

FaceDeer · 11 months ago

It isn’t against copyright to train models on published art.

AnonTwo · 11 months ago

The general argument legally is that the AI has no exact memory of the copyrighted material.

But if that’s the case, then these pixels shouldn’t need be patched. Because it wouldn’t remember the material that spawned them.

Is just the argument I assume would be used.

📛Maven · 11 months ago

It’s like training an artist who’s never seen a banana or a fire hydrant, by passing them pictures of fire hydrants labelled “this is a banana”. When you ask for a banana, you’ll get a fire hydrant. Correcting that mistake doesn’t mean “undoing pixels”, it means teaching the AI what bananas and fire hydrants are.

FaceDeer · 11 months ago

Well, I guess we’ll see how that argument plays in court. I don’t see how it follows, myself.

@[email protected] · 11 months ago

What is “patching pixels” and who would do it?

AnonTwo · 11 months ago

Is that not answered in the original article?

@[email protected] · 11 months ago

Explain

FaceDeer · 11 months ago

In order to violate copyright you need to copy the copyrighted material. Training an AI model doesn’t do that.

MxM111 · 11 months ago

Obviously, with so many different AIs, this can not be a factor (a bug).

If you have no problem looking at the image, then AI would not either. After all both you and AI are neural networks.

skulblaka · 11 months ago

The neural network of a human and of an AI operate in fundamentally different ways. They also interact with an image in fundamentally different ways.

MxM111 · 11 months ago

I would not call it “fundamentally” different at all. Compared to, say, regular computer running non-neural network based program, they are quite similar, and have similar properties. They can make a mistake, hallucinate, etc.

@[email protected] · 11 months ago

As a person who has done machine learning, and some ai training and who has a psychotic disorder I hate they call it hallucinations. It’s not hallucinations. Human hallucinations and ai hallucinations are different things. One is based of limited data , bias, or a bad data set with builds a fundamentally bad neural network connection which can be repaired. The other is something that can not be repaired, you are not working with bad data, your brain can’t filter out data correctly and you are building wrong connections. It’s like an overdrive of input and connections that are all wrong. So you’re seeing things, hearing things, or believing things that aren’t real. You make logical leaps that are irrational and not true and reality splits for you. While similarities exist, one is because people input data wrong, or because they cleaned it wrong, or didn’t have enough. And the other is because the human brain has wiring problem caused by a variety of factors. It’s insulting and it also humanizes computers to much and degrades people with this illness.

MxM111 · 11 months ago

As I understand, healthy people hallucinate all the time, but in different sense, non-psychiatric sense. It is just healthy brain has this extra filter that rejects all hallucinations that do not correspond to the signal coming from reality, that is our brain performs extra checks constantly. But we often get fooled if we do not have checks done correctly. For example, you can think that you saw some animal, while it was just a shade. There is even statement that our perception of the world is “controlled hallucination” because we mostly imagine the world and then best fit it to minimize the error from external stimuli.

Of course, current ANNs do not have such extensive error checking, thus they are more prone to those “hallucinations”. But fundamentally those are very similar to what we have in those “generative suggestions” our brain generates.

@[email protected] · 11 months ago

Those aren’t quite the same as a hallucination. We don’t actually call them hallucinations. Hallucinations are a medical term. Those are visual disturbances not “controlled hallucinations”. Your brain filtering it out and the ability to ignore it makes it not a hallucination. It’s hallucinations in a colloquial sense not medical.

Fundamentally AI is not working the same, you are having a moment of where a process from when in the past every shadow was a potential danger so seeing a threat in the shadow first and triggering fight or flight is best for you as a species. AI has no fight or flight. AI has no motivation, AI just had limited, bad, or biased data that we put there and spits out garbage. It is a computer with no sentience. You are not really error checking, you are processing more information, or reassessing once the fight/flight goes down. AI doesn’t have more information to process.

Many don’t see people with psychotic disorders as equal people. They see them as dangerous, and and people to be locked away. They use their illnesses and problems as jokes and slurs. Using terms for their illness in things like this only adds to their stigma.

MxM111 · edit-2 11 months ago

You are arguing about terminology use. Please google “controlled hallucinations” to see how people use the term in non-psychiatric way.

@[email protected] · 11 months ago

I know how it is used in a non psychiatric way, I brought that up it can be used in colloquially. That doesn’t diminish the way that it can be used to harm and stigmatize an already stigmatized group of people. There are other terms that can be used, but this is used because people want to humanize AI and do not care about dehumanizing people who have psychotic disorders.

The fact of the matter remains that AI creators are not people who specialize in human brains, but they act like computers and human brains are one and the same. Similarity doesn’t equal the same processes. They can choose different language but they do not. They could call it a processing error, a glitch, a distortion. All would be accurate, but no, they chose a term that is harmful to a minority group because no one cares about stigmatizing them.

@[email protected] · 11 months ago

An AI don’t see the images like we do, an AI see a matrix of RGB values and the relationship they have with each other and create an statistical model of the color value of each pixel for a determined prompt.

@[email protected] · 11 months ago

That’s not quite how it works. The pixels are just the first layer. Those get broken down into edges. The edges get broken down into shape. The shapes get broken down into features like eyes, noses, etc. Those get broken down into faces. And so on. It’s hierarchical feature detection. Which also happens to be what the human brain does.

The actual “drawing” the AI does is quite a bit different however. The diffusion works by starting with random noise and then gradually denoising it until an image emerges. While humans can approach painting that way, it’s rather rarely done so.