AI models feeding on AI data will lead to 'model collapse', researchers say

0x815@feddit.de · 2 years ago

AI models feeding on AI data will lead to 'model collapse', researchers say

artificial_unintelligence · 2 years ago

I think it’s not a hard stop but it is an issue. I think it will force models to be trained in more novel ways, rather than just purely pump more data in. I think ideally we’d be able to reach GPT level intelligence on fractions of the data and compute. These new techniques have yet to be made but this will put pressure on their creation

Pigeon@beehaw.org · 2 years ago

I think that’s a tremendously tall order. The current LLM’s are straightforwardly Large Language Models that have zero ability to understand the language and only sort it based on statistical models that can only be gleaned via a vast heap of data. Reducing the size of any data set increases the likelihood of bias and blindspots no matter what you do.

At the least, an LLM cannot talk about anything (like news events, new inventions, new political ideas) until humans have talked about it first AND their talking about it has been put into the dataset. If something’s not in the dataset, an LLM simply can’t invent it. At absolute best, it’ll spit out plausible-sounding bullshit.

Inventing actual, truely intelligent AI is a project very far remove from what we have now. It’d take the invention of entirely different systems, not at all just an iterative improvement of an LLM.

AI models feeding on AI data will lead to 'model collapse', researchers say

AI models feeding on AI data will lead to 'model collapse', researchers say

Will GPT models choke on their own exhaust? | Light Blue Touchpaper