• verassol@lemmy.ml
    link
    fedilink
    arrow-up
    43
    ·
    8 months ago

    StackOverflow: *grabs money on monetizing massive amounts of user-contributed content without consulting or compensating the users in any way*

    Users: *try to delete it all to prevent it*

    StackOverflow: *your contributions belong to the community, you can’t do that*

    Pretty fucked-up laws. A lot of lawsuits going on right now against AI companies for similar issues. In this case, StackOverflow is entitled to be compensated for its partnership, and because the answers are all CC BY-SA 3.0, no one can complain. Now, that SA? Whatever.

    • 9point6@lemmy.world
      link
      fedilink
      arrow-up
      15
      ·
      8 months ago

      That SA part needs to be tested in court against the AI models themselves

      A lot of this shittiness would probably go away if there was a risk that ingesting certain content would mean you need to release the actual model to the public.

      • verassol@lemmy.ml
        link
        fedilink
        arrow-up
        4
        ·
        edit-2
        8 months ago

        Yeah, their assumption though is you don’t? Neither attribution nor sharealike, not even full-on all-rights-reserved copyright is being respected. Anything public goes and if questions are asked it’s “fair use”. If the user retains CC BY-SA over their content, why is giving a bunch of money to StackOverflow entitling OpenAI to use it all under whatever terms they settled on? Boggles me.

        Now, say, Reddit Terms of Service state clearly that by submitting content you are giving them the right to “a worldwide, royalty-free, perpetual, irrevocable, non-exclusive, transferable, and sublicensable license to use, copy, modify, adapt, prepare derivative works of, distribute, store, perform, and display Your Content and any name, username, voice, or likeness (…) in all media formats and channels now known or later developed anywhere in the world.” Speaks volumes on why alternatives (like Lemmy) to these platforms matter.

          • verassol@lemmy.ml
            link
            fedilink
            arrow-up
            3
            ·
            8 months ago

            That’s interesting. I was looking up “Lemmy Terms of Service” for comparison after getting that quote from the Reddit ToS and could not find anything for Lemmy.ml. Now after you mentioned it, looking on my Mastodon instance, nothing either, just a privacy policy. That is indeed kinda weird. Some instances do have their own ToS though. At least something stating a sublicense for distribution should be there for protection of people running instances in locations where it’s relevant.

      • verassol@lemmy.ml
        link
        fedilink
        arrow-up
        5
        ·
        8 months ago

        the claimants were set back because they’ve been asked to prove the connection between AI output and their specific inputs

        I mean, how do you do that for a closed-source model with secretive training data? As far as I know, OpenAI has admitted to using large amounts of copyrighted content, numberless books, newspaper material, all on the basis of fair use claims. Guess it would take a government entity actively going after them at this point.

          • verassol@lemmy.ml
            link
            fedilink
            arrow-up
            2
            ·
            8 months ago

            Thank you for sharing. Your perspective broadens mine, but I feel a lot more negative about the whole “must benefit business” side of things. It is fruitless to hold any entity whatsoever accountable when a whole worldwide economy is in a free-for-all nuke-waving doom-embracing realpolitik vibe.

            Frankly, not sure what would be worse, economic collapse and the consequences to the people, or economic prosperity and… the consequences to the people. Long term, and from a country that is not exactly thriving in the scheme side of things, I guess I’d take the former.

      • bitfucker
        link
        fedilink
        arrow-up
        2
        ·
        8 months ago

        Yep. Can’t wait to overfit LLM to a lot of copyrighted work and share it to public domain. Let’s see if OpenAI will get push back from copyright owner down the road.