Announcing ARC-AGI-3 - A benchmark that tests if AI can explore, learn, and adapt in unfamiliar situations. Humans score 100%. Frontier AI scores 0.26%.

brianpeiris@lemmy.ca · edit-2 12 hours ago

tatterdemalion · edit-2 3 hours ago

LLMs might suck at this game but I’m pretty sure Deepmind’s deep reinforcement learning AI could solve these easily.

EDIT: I know you guys hate AI around here, but you need to at least be aware of what the technology is capable of.

From 11 years ago:

33550336@lemmy.world · 5 hours ago

if only it would exist

Iconoclast@feddit.uk · 2 hours ago

If only…

XLE@piefed.social · 2 hours ago

This is as concrete as Sam Altman saying “AI will actually discover new science”

Iconoclast@feddit.uk · 2 hours ago

They won the Nobel prize for it.

XLE@piefed.social · 2 hours ago

Then say they won the same prize that was awarded to the inventor of the lobotomy, don’t link to a puff piece with an indefensibly bad title

tatterdemalion · 5 hours ago

Wdym? It’s existed for at least a decade. Plenty of papers about it. It mastered Atari and Mario. It became the best Go player.