• @[email protected]
    link
    fedilink
    1377 months ago

    Not sure if someone else has brought this up, but this is because these AI models are massively biased towards generating white people so as a lazy “fix” they randomly add race tags to your prompts to get more racially diverse results.

    • @[email protected]
      link
      fedilink
      English
      26
      edit-2
      7 months ago

      Exactly. I wish people had a better understanding of what’s going on technically.

      It’s not that the model itself has these biases. It’s that the instructions given them are heavy handed in trying to correct for an inversely skewed representation bias.

      So the models are literally instructed things like “if generating a person, add a modifier to evenly represent various backgrounds like Black, South Asian…”

      Here you can see that modifier being reflected back when the prompt is shared before the image.

      It’s like an ethnicity AdLibs the model is being instructed to fill out whenever generating people.

    • @[email protected]
      link
      fedilink
      77 months ago

      I mean, I don’t think it’s an easy thing to fix. How do you eliminate bias in the training data without eliminating a substantial percentage of your training data. Which would significantly hinder performance.

      • @[email protected]
        link
        fedilink
        English
        107 months ago

        Rather than eliminating the some of the training data, you could add more training data to create an even balance.

        • @[email protected]
          link
          fedilink
          English
          37 months ago

          Indeed - there’s a very good argument for using synthetic data to introduce diversity as long as you can avoid model collapse.

    • KSP Atlas
      link
      fedilink
      17 months ago

      Didn’t someone manage to leak one of the tags into the image once?