Why is DeepSeek such a game-changer? Scientists explain how the AI models work and why they were so cheap to build.

DeepSeek's V3 and R1 models took the world by storm this week. Here's why they're such a big deal.

The DeepSeek logo appears on a smartphone with the flag of China in the background.
DeepSeek is a new artificial intelligence (AI) model from China.
(Image credit: Thomas Fuller/SOPA Images/LightRocket via Getty Images)

Less than two weeks ago, a scarcely known Chinese company released its latest artificial intelligence (AI) model and sent shockwaves around the world.

DeepSeek claimed in a technical paper uploaded to GitHub that its open-weight R1 model achieved comparable or better results than AI models made by some of the leading Silicon Valley giants — namely OpenAI's ChatGPT, Meta’s Llama and Anthropic's Claude. And most staggeringly, the model achieved these results while being trained and run at a fraction of the cost.

Ben Turner
Acting Trending News Editor

Ben Turner is a U.K. based writer and editor at Live Science. He covers physics and astronomy, tech and climate change. He graduated from University College London with a degree in particle physics before training as a journalist. When he's not writing, Ben enjoys reading literature, playing the guitar and embarrassing himself with chess.

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.