on AI art, copyright and theft

Chloé 🥕@lemmy.blahaj.zone · edit-2 9 days ago

on AI art, copyright and theft

wizzor@sopuli.xyz · 8 days ago

This is interesting. I agree that stealing isn’t the right category. Copyright infringement may be, but there needs to be a more specific question we are exploring.

Is it acceptable to make programmatic transformations of copyrighted source material without the copyright holder’s permission for your own work?

Is it acceptable to build a product which contains the copyrighted works of others without their permission? Is it different if the works contained in the product are programmatically transformed prior to distribution?

Should the copyright holders be compensated for this? Is their permission necessary?

The same questions apply to the use of someone’s voice or likeness in products or works.

Zetta@mander.xyz · 8 days ago

Is it acceptable to build a product which contains the copyrighted works of others without their permission? Is it different if the works contained in the product are programmatically transformed prior to distribution?

Somebody correct me if I’m wrong, but my understanding of how image generation models and training them works is that the end product, in fact, does not contain any copyrighted material or any transformation of that copyrighted material. The training process refines a set of numbers in the model, But those numbers can’t really be considered a transformation of the input.

To preface what I’m about to say, LLMs and image models are absolutely not intelligent, and it’s fucking stupid that they’re called AI at all. However, if you look at somebody’s art and learn from it, you don’t contain a copyrighted piece of their work in your head or a transformation of that copyrighted work. You’ve just refined your internal computers knowledge and understanding of the work, I believe the way image models are trained could be compared to that.

burgerpocalyse@lemmy.world · 8 days ago

the generated product absolutely contains elements of the things it copied from. imagine the difference between someone making a piece of art that is heavily inspired by someone else’s work VS directly tracing the original and passing it off as entirely yours

Zetta@mander.xyz · edit-2 7 days ago

I understand that’s how you think of it, but I’m talking about the technology itself. There is absolutely no copy of the original work, in the sense of ones and zeros.

The image generation model itself does not contain any data at all that is any of the work it was trained on, so the output of the model can’t be considered copyrighted work.

Yes, you can train models to copy artists’ styles or work, but it’s not like tracing the image at all. Your comparison is completely wrong. It is a completely unique image that is generated off of the model itself, because the model itself does not contain any of the original work.

FatCrab@lemmy.one · 7 days ago

This is generally correct, though diffusion models and GPTs work in totally different ways. Assuming an entity had lawful access to the image in the first place, nothing that persists in a trained diffusion model can be realistically considered to be a copy of any particular training image by anyone who knows wtf they’re talking about.

mindbleach@sh.itjust.works · 8 days ago

The magic word here is transformative. If your use of source material is minimal and distinct, that’s fair use.

If a 4 GB model contains the billion works it was trained on - it contains four bytes of each.

What the model does can be wildly different from any particular input.

shuvit@lemmy.ml · 8 days ago

Using peoples work, and math to make predictions is not transformative. Human creations are transformative.

mindbleach@sh.itjust.works · 8 days ago

Any transformation is transformative.