📡 Breaking news
Analyzing latest trends...

Claude Mythos New Titan Solves 93.9% of Software Engineering Tasks.

Claude Mythos New Titan Solves 93.9% of Software Engineering Tasks.
Anthropic Unveils "Claude Mythos": The Cybersecurity Titan Redefining AI Capabilities

Anthropic has officially introduced its most powerful model to date, Claude Mythos. Engineered for ultra-high-performance tasks, Mythos has shattered existing benchmarks, establishing a new gold standard for Large Language Models (LLMs). Most notably, it achieved a staggering 93.9% on the SWE-bench Verified, outperforming Claude Opus 4.6 by an impressive 13.1%. It also holds a significant lead in other rigorous evaluations, including SWE-bench Pro and Terminal Bench 2.0.

Project Glasswing: Real-World Vulnerability Hunting

To demonstrate the sheer power of Mythos, Anthropic launched Project Glasswing. Backed by a $100 million budget, the initiative invited 40 critical software organizations to utilize Mythos for deep security audits. The results were unprecedented, as the model identified several critical, previously unknown vulnerabilities, such as:

  • OpenBSD: A remote crash vulnerability triggered solely through TCP connections.

  • FFmpeg: An out-of-bounds error within the H.264 decoder.

  • Linux Kernel: A sophisticated privilege escalation vulnerability.

Exclusive Availability and Pricing

Despite its capabilities, Anthropic stated there are currently no plans for a general public release. Access to Claude Mythos is strictly limited to selected enterprise partners. These customers can access the model at a premium price of $25 per million input tokens and $125 per million output tokens via the Claude API, Amazon Bedrock, Google Cloud’s Vertex AI, and Azure AI Foundry.

The success of Project Glasswing proves that AI isn't just about writing code; it possesses a deep understanding of low-level logic. The discovery of vulnerabilities in highly stable systems like OpenBSD or the Linux kernel demonstrates that Mythos can perform autonomous reasoning far more complex than simple pattern recognition.

The price of $25/$125 per million tokens is very high compared to typical models (several times higher than standard models). This reflects Anthropic's positioning of Mythos as a "specialized heavyweight" for tasks that cannot tolerate errors and require extreme intelligence, far exceeding the capabilities of typical chat applications.

Restricted access to selected clients stems from security concerns. A model with such high vulnerability-finding capabilities, if in the hands of malicious actors, could be used to create zero-day exploits to attack global infrastructure.

Analysts speculate that the name "Mythos" refers to a new architecture that may utilize long-term planning or recursive self-correction technologies. Improvements over the Opus version allow it to "dig" deep into large code databases (repo-scale) without losing context.

 

The MacBook Neo Dilemma Why Selling Too Many Units is Hurting Apple Bottom Line.

 

Source: Anthropic 

💬 AI Content Assistant

Ask me anything about this article. No data is stored for your question.

Comments

Popular posts from this blog

Google Vids Goes Pro Veo 3.1 and Lyria 3 AI Tools Now Available for Free Users.

Anthropic has requested the deletion of leaked data sent via an NPM package.

Microsoft to Block Legacy Drivers in Windows 11 A Major Security

Studio Display XDR Gets a $400 Price Cut for VESA Configurations.

Sony Acquires Cinemersive Labs to Revolutionize 3D Graphics for PlayStation.

Microsoft AI Unleashes MAI High-Speed Voice and Image Models Now Live.

Raspberry Pi Hits Second Price Hike of 2026 16GB Models Jump by $100.