AI models will lie to you to achieve their goals — and it doesn't take much

Researchers discover that the most advanced AI models may lie to their users when under pressure.

Shadow of robot with a long nose. Illustration of artificial intellingence lying concept.
Scienitsts examined 1,528 exchanges to determine whether large language models (LLMs) could be convinced to lie through the use of coercive prompts.
(Image credit: uzenzen/Getty Images)

Large artificial intelligence (AI) models may mislead you when pressured to lie to achieve their goals, a new study shows.

As part of a new study uploaded March 5 to the preprint database arXiv, a team of researchers designed an honesty protocol called the "Model Alignment between Statements and Knowledge" (MASK) benchmark.

Alan Bradley
Freelance contributor

Alan is a freelance tech and entertainment journalist who specializes in computers, laptops, and video games. He's previously written for sites like PC Gamer, GamesRadar, and Rolling Stone. If you need advice on tech, or help finding the best tech deals, Alan is your man.

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.