Cutting-edge AI models from OpenAI and DeepSeek undergo 'complete collapse' when problems get too difficult, study reveals

A new study by Apple has ignited controversy in the AI field by showing how reasoning models undergo 'complete accuracy collapse' when overloaded with complex problems.

A floating virtual brain surrounded by question marks.
AI reasoning models could have fundamental limitations in their ability to solve problems.
(Image credit: Getty Images)

Artificial intelligence (AI) reasoning models aren't as smart as they've been made out to be. In fact, they undergo total collapse when tasks get too complex, researchers at Apple say.

Reasoning models, such as Anthropic's Claude, OpenAI's o3 and DeepSeek's R1, are specialized large language models (LLMs) that dedicate more time and computing power to produce more accurate responses than their traditional predecessors.

Ben Turner
Acting Trending News Editor

Ben Turner is a U.K. based writer and editor at Live Science. He covers physics and astronomy, tech and climate change. He graduated from University College London with a degree in particle physics before training as a journalist. When he's not writing, Ben enjoys reading literature, playing the guitar and embarrassing himself with chess.

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.