Hallucination is Inevitable: An Innate Limitation of Large Language Models (arxiv preprint)

OmnipotentEntity@beehaw.org · 9 months ago

Hallucination is Inevitable: An Innate Limitation of Large Language Models (arxiv preprint)

OmnipotentEntity@beehaw.org · edit-2 9 months ago

I spent an hour and a half arguing with my brother about probability, because he asked ChatGPT what the probability that he and his daughter were born on the same day.

ChatGPT said 1/113465 which it claimed was 1/365^2 (this value is actually 1/133225) because there’s a 1/365 chance he was born on such and such day, and a 1/365 chance his daughter was too.

But anyone with even a rudimentary understanding of probability would know that it’s just 1/365, because it doesn’t actually matter on which day they both happened to be born.

He wanted to feel special, and ChatGPT confirmed his biases hard, and I got to be the dickhead and say it is special, but it’s 1/400 special not 1/100000. I don’t believe he’s completely forgiven me over disillusioning him.

So yeah, I’ve had a minor family falling out over ChatGPT hallucinations.

LanternEverywhere@kbin.social · edit-2 9 months ago

That’s a fun story, but isn’t applicable to the topic here. That could very easily be verified as true or false by a secondary system. In fact you can just ask Wolfram Alpha. Ask it what are the odds that any two people share the same birthday. I just asked it that exact question and it replied 1/365

EDIT

in fact I just asked that exact same question to chatgpt4 and it also replied 1/365

OmnipotentEntity@beehaw.org · 9 months ago

in fact I just asked that exact same question to chatgpt4 and it also replied 1/365

Yes, you can get different answers because of different phrasing and also because random vector input

LanternEverywhere@kbin.social · 9 months ago

Are you using 4? Because it’s much better than the earlier versions

zygo_histo_morpheus · 9 months ago

Well if we have a reliable oracle available for a type of questions (i.e. Wolfram Alpha) why use an llm at all instead of just asking the oracle directly