With the talk of agentic AI, which are generative AI platforms that can control computer software beyond giving text chats, being the future for the AI industry, one of the top agentic AI systems in the world, Anthropic’s Claude, still can’t beat Pokémon on Gameboy Colour.

Anthropic released a thread on X in February admitting that Claude 3.7, its latest model, was not able to play the original Pokémon RPGs on Gameboy to completion for a number of reasons, but also, despite its inability to finish the games made for children, the AI showed chilling human-like processes in attempting to do so.

Claude 3.7 is one of the most advanced agentic AI models out there, with companies like China’s Manus incorporating it into its systems.

  • Bogasse@lemmy.ml
    link
    fedilink
    English
    arrow-up
    4
    ·
    5 days ago

    Can we stop benchmarking text generation models on things they’re not designed to do and start educating people on what they actually can do?

    Oh no we can’t, there’s already hundreds of commercial services…