AI self-replication hacks 'no longer purely theoretical,' study finds —‬ ‪but experts say it's too soon to panic

Researchers say AI models can now replicate themselves across vulnerable systems, but experts warn the real threat is not rogue machine intelligence but cybercriminals weaponizing AI agents.

MEMBER EXCLUSIVE

A series of red and blue faces made of circuit-board patterns against a dark blue background
Should we be worried about AI replicating itself?
(Image credit: KrulUA via Getty Images)

Artificial intelligence (AI) models can autonomously "replicate" across multiple machines, hacking vulnerable systems, copying their own parameters onto compromised hosts, and launching working replicas capable of continuing the attack, new research shows. However, experts told Live Science the bigger concern is not AI suddenly running wild on its own, but cybercriminals using AI agents to automate known hacking techniques.

Scientists at Palisade Research tested whether AI agents could independently move through a chain of intentionally vulnerable systems without human intervention. In a new study uploaded May 7 to GitHub, large language models (LLMs) could identify exploitable web applications, steal credentials, transfer their own files, and stand up new inference servers capable of continuing the attack from the next machine in the chain.

​​Carly Page is a technology journalist and copywriter with more than a decade of experience covering cybersecurity, emerging tech, and digital policy. She previously served as the senior cybersecurity reporter at TechCrunch.

Now a freelancer, she writes news, analysis, interviews, and long-form features for publications including Forbes, IT Pro, LeadDev, Resilience Media, The Register, TechCrunch, TechFinitive, TechRadar, TES, The Telegraph, TIME, Uswitch, WIRED, and others. Carly also produces copywriting and editorial work for technology companies and events.

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.