Supermarket AI meal planner app suggests recipe that would create chlorine gas

Ullallulloo@civilloquy.com · 2 years ago

Supermarket AI meal planner app suggests recipe that would create chlorine gas

Rhaedas@kbin.social · 2 years ago

An example of the misalignment problem. Humans and AI both agreed on the stated purpose (generate a recipe), AI just had some deeper goals in mind as well.

JillyB@beehaw.org · 2 years ago

I doubt it had nefarious intentions. My money is on the bot just being stupid.

Rhaedas@kbin.social · edit-2 2 years ago

Not even stupid but just badly trained for that purpose. It’s no different than a LLM asked for coding that gets most of it right but flubs a subroutine. Misalignment doesn’t imply bad or evil, it’s just doing what it thinks the goal really is while we’re ignorant of the results.

MxM111@kbin.social · 2 years ago

If I ask you to create a drink using Windex and Clorox would you do any different? Do you have alignment problem too?

Rhaedas@kbin.social · 2 years ago

Yes, I know better, but ask a kid that and perhaps they’d do it. A LLM isn’t thinking though, it’s repeating training through probabilities. And btw, yes, humans can be misaligned with each other, having self goals underneath common ones. Humans think though…well, most of them.