AI models trained on 'synthetic data' could break down and regurgitate unintelligible nonsense, scientists warn

If left unchecked, "model collapse" could make AI systems less useful, and fill the internet with incomprehensible babble.

Abstract spaghetti-like strands to represent a garbled brain in different colours
"Model collapse" could arise if AI models are trained using AI-generated data, scientists have warned, due to "self-damaging feedback loops."
(Image credit: Getty Images/Eugene Mymrin)

Artificial Intelligence (AI) systems could slowly trend toward filling the internet with incomprehensible nonsense, new research has warned. 

AI models such as GPT-4, which powers ChatGPT, or Claude 3 Opus rely on the many trillions of words shared online to get smarter, but as they gradually colonize the internet with their own output they may create self-damaging feedback loops.

Ben Turner
Acting Trending News Editor

Ben Turner is a U.K. based writer and editor at Live Science. He covers physics and astronomy, tech and climate change. He graduated from University College London with a degree in particle physics before training as a journalist. When he's not writing, Ben enjoys reading literature, playing the guitar and embarrassing himself with chess.