AI chatbots oversimplify scientific studies and gloss over critical details — the newest models are especially guilty

More advanced AI chatbots are more likely to oversimplify complex scientific findings based on the way they interpret the data they are trained on, a new study suggests.

Confused AI
(Image credit: Getty Images/peshkov)

Large language models (LLMs) are becoming less "intelligent" in each new version as they oversimplify and, in some cases, misrepresent important scientific and medical findings, a new study has found.

Scientists discovered that versions of ChatGPT, Llama and DeepSeek were five times more likely to oversimplify scientific findings than human experts in an analysis of 4,900 summaries of research papers.

Lisa D Sparks is a freelance journalist for Live Science and an experienced editor and marketing professional with a background in journalism, content marketing, strategic development, project management, and process automation. She specializes in artificial intelligence (AI), robotics and electric vehicles (EVs) and battery technology, while she also holds expertise in the trends including semiconductors and data centers.

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.