Claude Mythos explained: Is Anthropic's most powerful AI model really too dangerous to release to the public?

Anthropic's Mythos AI is being kept behind closed doors as governments assess what faster, AI-driven vulnerability discovery means for cybersecurity.

By Carly Page

published 24 April 2026

in News

MEMBER EXCLUSIVE

A smartphone has a black screen with the words: "Identifying vulnerabilities and exploits with Claude Mythos review" in front of a laptop with a white screen with the word "Claude" on it. — Claude Mythos is said to be an AI threat to cybersecurity. But does it live up to the hype?

(Image credit: Bloomberg via Getty Images)

Anthropic's unveiling of its Claude Mythos Preview model alongside Project Glasswing is prompting widespread scrutiny ‪as experts warn that the artificial intelligence (AI) system's capabilities could accelerate the discovery and exploitation of software vulnerabilities.

Anthropic is keeping Mythos locked inside Project Glasswing ‪—‬ the company's attempt to contain and direct the model ‪—‬ thus limiting access to a small group of big tech companies focused on cybersecurity. Anthropic's decision not to release Mythos publicly has quickly fueled claims that the model is "too powerful" for wider use.

In Anthropic scientists' own testing, the model identified vulnerabilities in modern browser environments and chained multiple flaws into working exploits, including attacks that escaped both browser and operating system sandboxes. In practice, that means linking smaller weaknesses that might be harmless on their own into something that can reach deeper into a system. Sandboxes are meant to keep software contained; breaking out of them lets code access parts of the system it shouldn’t.

What systems like Mythos appear to do is compress the timeline further. Vulnerabilities can be identified, tested and weaponized more quickly, thus reducing the window between discovery and exploitation. Turunen said this means that "timelines to exploitation will continue to compress, new vulnerabilities will be discovered and spread faster, and attacks will continue to be completely autonomous."

There are obvious risks. A system that can generate working exploits at speed lowers the barrier to attackers and makes it easier to exploit vulnerabilities at scale. That risk is not theoretical. Anthropic's own testing suggests the model can already do this reliably and at volume. The pieces themselves are not new. What stands out is that they are all in one place, working together. That makes the whole process faster and easier to run in an end-to-end fashion.

If systems like Mythos accelerate that trend, the result could be an environment where both legitimate activity and malicious behavior are increasingly driven by automated processes, making it harder to distinguish the two, Warburton warned. At the same time, the abundance of vulnerabilities being discovered in key systems we use every day may outpace the ability to fix them, especially if we start to see similar AI models becoming more widely available.

TOPICS

Carly Page is a technology journalist and copywriter with more than a decade of experience covering cybersecurity, emerging tech, and digital policy. She previously served as the senior cybersecurity reporter at TechCrunch.

Now a freelancer, she writes news, analysis, interviews, and long-form features for publications including Forbes, IT Pro, LeadDev, Resilience Media, The Register, TechCrunch, TechFinitive, TechRadar, TES, The Telegraph, TIME, Uswitch, WIRED, and others. Carly also produces copywriting and editorial work for technology companies and events.

Claude Mythos explained: Is Anthropic's most powerful AI model really too dangerous to release to the public?

What is Mythos, and what is it capable of?

How did scientists test Mythos?

What is Project Glasswing, and what does it mean for Mythos?

Is Mythos really "too powerful to release"?

Related stories