📡 Breaking news
Analyzing latest trends...

Claude Mythos New Titan Solves 93.9% of Software Engineering Tasks.

Claude Mythos New Titan Solves 93.9% of Software Engineering Tasks.
Anthropic Unveils "Claude Mythos": The Cybersecurity Titan Redefining AI Capabilities

Anthropic has officially introduced its most powerful model to date, Claude Mythos. Engineered for ultra-high-performance tasks, Mythos has shattered existing benchmarks, establishing a new gold standard for Large Language Models (LLMs). Most notably, it achieved a staggering 93.9% on the SWE-bench Verified, outperforming Claude Opus 4.6 by an impressive 13.1%. It also holds a significant lead in other rigorous evaluations, including SWE-bench Pro and Terminal Bench 2.0.

Project Glasswing: Real-World Vulnerability Hunting

To demonstrate the sheer power of Mythos, Anthropic launched Project Glasswing. Backed by a $100 million budget, the initiative invited 40 critical software organizations to utilize Mythos for deep security audits. The results were unprecedented, as the model identified several critical, previously unknown vulnerabilities, such as:

  • OpenBSD: A remote crash vulnerability triggered solely through TCP connections.

  • FFmpeg: An out-of-bounds error within the H.264 decoder.

  • Linux Kernel: A sophisticated privilege escalation vulnerability.

Exclusive Availability and Pricing

Despite its capabilities, Anthropic stated there are currently no plans for a general public release. Access to Claude Mythos is strictly limited to selected enterprise partners. These customers can access the model at a premium price of $25 per million input tokens and $125 per million output tokens via the Claude API, Amazon Bedrock, Google Cloud’s Vertex AI, and Azure AI Foundry.

The success of Project Glasswing proves that AI isn't just about writing code; it possesses a deep understanding of low-level logic. The discovery of vulnerabilities in highly stable systems like OpenBSD or the Linux kernel demonstrates that Mythos can perform autonomous reasoning far more complex than simple pattern recognition.

The price of $25/$125 per million tokens is very high compared to typical models (several times higher than standard models). This reflects Anthropic's positioning of Mythos as a "specialized heavyweight" for tasks that cannot tolerate errors and require extreme intelligence, far exceeding the capabilities of typical chat applications.

Restricted access to selected clients stems from security concerns. A model with such high vulnerability-finding capabilities, if in the hands of malicious actors, could be used to create zero-day exploits to attack global infrastructure.

Analysts speculate that the name "Mythos" refers to a new architecture that may utilize long-term planning or recursive self-correction technologies. Improvements over the Opus version allow it to "dig" deep into large code databases (repo-scale) without losing context.

 

The MacBook Neo Dilemma Why Selling Too Many Units is Hurting Apple Bottom Line.

 

Source: Anthropic 

💬 AI Content Assistant

Ask me anything about this article. No data is stored for your question.

Comments

Popular posts from this blog

Anthropic Secures $65 Billion Series H to Lock Down Global Chip Supply.

YouTube Deploys Automated Scanners to Flag AI Video Uploads Hardcoding Labels Onto Titles and Shorts.

Elon Musk Amends $15 Billion Anthropic Data Center Deal Shaking Valuation Models.

Steam Deck OLED crosses $900

Autodesk Bought MaintainX to Unleash Industrial Predictive AI.

Asana Acquires StackAI for $75 Million to Bring No-Code AI Agents to Enterprise Workspaces.

Anthropic Project Glasswing Uncovers 23,000 Open-Source Flaws Exposing a Massive AI Patching Crisis.