Claude Mythos(Opus 5) Leaked: What happened and What to expect

As of March 29, 2026, the “Claude Mythos” story is less about a finished public launch and more about a leaked preview of what looks like Anthropic’s next big step. Thecompany accidentally exposed draft blog content in a publicly searchable data cache, revealing an unreleased model that Anthropic described as a “step change” and “the most capable we’ve built to date.” Anthropic confirmed it is developing and testing the model with a small group of early access customers.

That matters because Anthropic’s current public model lineup still centers on Claude Opus 4.6, Claude Sonnet 4.6, and Claude Haiku 4.5. In other words, the leak is not a confirmed public product launch; it is a leaked glimpse of the next tier Anthropic may be preparing.

Currently, CometAPI already provides APIs for cutting-edge Claude models, such as Claude Opus 4.6 and Claude Sonnet 4.6. Once Claude Mythos is available on CometAPI, you can perform comparative tests against top models from Gemini and OpenAI. CometAPI aggregates the best models.

What Is Claude Mythos?

Claude Mythos is Anthropic’s most advanced AI model to date, described in leaked internal documents as “by far the most powerful AI model we’ve ever developed.” It introduces a new performance tier—internally referred to as “Capybara”—that sits above the company’s existing Opus lineup, which until now represented the pinnacle of Claude’s capabilities.

Anthropic’s current model family follows a clear hierarchy:

Opus: Largest, most capable, and most expensive (e.g., Claude Opus 4.6 and the earlier Opus 4.5 released in November 2025).
Sonnet: Balanced speed and intelligence.
Haiku: Fastest and most cost-effective for lightweight tasks.

Mythos/Capybara breaks this mold as a significantly larger, more compute-intensive model. Draft blog posts explicitly state it is “larger and more intelligent than our Opus models—which were, until now, our most powerful.” The name “Mythos” was chosen to evoke “the deep connective tissues that link together knowledge and ideas,” signaling deeper, more integrated reasoning across domains.

This is not a minor incremental update. Anthropic’s spokesperson confirmed that the company is “developing a general purpose model with meaningful advances in reasoning, coding, and cybersecurity” and considers it “a step change and the most capable we’ve built to date.” Training is complete, and the model is already undergoing real-world testing with a small group of early-access customers.

For context, Claude’s evolution has been rapid. Claude 3 Opus (2024) set early benchmarks, followed by Claude 3.5 Sonnet, Claude 4 variants, and Opus 4.5/4.6 in 2025. Mythos appears to be the logical successor—potentially what the community has speculated as “Opus 5”—pushing frontier AI into new territory while raising serious safety questions.

How Was Claude Mythos Leaked?

The leak occurred on or around March 27, 2026, due to a straightforward but embarrassing human-error misconfiguration in Anthropic’s content management system (CMS). Nearly 3,000 unpublished assets—including draft blog posts, images, PDFs, audio files, and even internal documents—were left in a publicly searchable data store (sometimes called a “data lake”).

Assets were set to “public” by default, with guessable URLs. Security researchers Roy Paz (LayerX Security) and Alexandre Pauwels (University of Cambridge) discovered the cache and alerted media outlets.

Leaked materials included:

Two near-identical draft blog posts (one titled for “Claude Mythos,” the other “Claude Capybara”).
Structured web-page data with headings and a planned publication date.
Unused marketing assets from past launches.
An internal PDF about an invite-only CEO retreat hosted by Anthropic CEO Dario Amodei.

Anthropic quickly confirmed the incident as “human error” in CMS configuration and removed public access. No evidence suggests malicious intent or a breach of model weights—only marketing and planning documents were exposed.

This event highlights a growing vulnerability in the AI industry: rapid iteration and internal documentation often outpace secure publishing workflows. Similar leaks have occurred at other labs, but this one provided unusually detailed insight into an unreleased flagship model.

Leaked Benchmark Scores and Performance Claims

Exact numerical scores were not disclosed in the leaked drafts—Anthropic has not published official benchmarks yet. However, the language is unambiguous and consistent across both draft versions:

“Compared to our previous best model, Claude Opus 4.6, Capybara gets dramatically higher scores on tests of software coding, academic reasoning, and cybersecurity, among others.”

The model is further described as “currently far ahead of any other AI model in cyber capabilities” and one that “presages an upcoming wave of models that can exploit vulnerabilities in ways that far outpace the efforts of defenders.”

What do these benchmark categories actually measure?

Software Coding (e.g., SWE-Bench Verified, HumanEval, LiveCodeBench): Real-world software engineering tasks, including bug fixing, feature implementation, and repository-level understanding. Opus 4.6 already led in many coding leaderboards; a “dramatic” jump here would mean Mythos could autonomously handle complex, multi-file codebases that currently require senior engineers.
Academic Reasoning (e.g., GPQA, MMLU-Pro, MATH, FrontierMath): Graduate-level science, math, and multi-step logical problems. Improvements here signal stronger chain-of-thought reasoning and knowledge synthesis.
Cybersecurity: Vulnerability discovery, exploit generation, red-teaming simulations, and defensive hardening. This is the most emphasized area—and the most concerning.

While prior Claude models (Opus 4.5/4.6) achieved strong results—e.g., Opus 4.5 scored ~80.9% on SWE-Bench Verified—the leaked claims position Mythos in a qualitatively different league.

Model Characteristics and Technical Profile

Beyond benchmarks, the drafts reveal several defining traits:

Scale and Cost: “Very expensive for us to serve, and will be very expensive for our customers to use.” This implies a massive parameter count and high inference costs, limiting initial availability to enterprise and high-value use cases.
Reasoning Depth: Emphasis on “deep connective tissues” between knowledge domains suggests superior long-context understanding and cross-domain synthesis.
Agentic Capabilities: Early access appears targeted at organizations needing advanced coding agents and cybersecurity tools.
Safety-First Philosophy: Consistent with Anthropic’s constitutional AI approach, the company is prioritizing risk assessment—especially in cybersecurity—before broader release.

Cybersecurity Implications: The Biggest Red Flag

The most striking element of the leak is Anthropic’s own warning about the model’s dual-use potential. By being “far ahead” in cyber capabilities, Mythos could:

Autonomously discover zero-day vulnerabilities.
Generate sophisticated exploit code at scale.
Simulate advanced persistent threats (APTs) faster than human defenders can respond.

The draft explicitly states the company wants to “act with extra caution” and share findings with cyber defenders to prepare for “an impending wave of AI-driven exploits.”

Market reaction was immediate: cybersecurity stocks plunged on March 27-28, 2026, as investors priced in the risk that offensive AI capabilities could outpace defensive tools.

This aligns with broader industry trends. OpenAI has similarly flagged high cyber capabilities in models like GPT-5.3-Codex. Real-world incidents already show state actors (e.g., a Chinese group) using Claude variants for infiltration campaigns. Mythos would supercharge such threats.

Positive side: Early access to defensive organizations could accelerate secure coding practices, automated patching, and threat hunting—potentially making the internet safer in the long term.

Comparison Table: Claude Mythos vs. Previous Models

Aspect	Claude Opus 4.6 (Current Flagship)	Claude Mythos / Capybara (Leaked)	Key Takeaway
Tier	Opus	New “Capybara” tier (above Opus)	Major architecture leap
Coding Performance	Strong (e.g., ~80.9% SWE-Bench)	Dramatically higher	Potential to rival or exceed senior engineer productivity
Academic Reasoning	Excellent	Dramatically higher	Deeper multi-step logic and knowledge integration
Cybersecurity	Capable (vulnerability detection)	Far ahead of any current model	Qualitative leap; raises dual-use risks
Inference Cost	High (Opus pricing)	Very expensive (even higher)	Enterprise-only initially
Release Status	Generally available	Early-access testing only	Deliberate, safety-focused rollout
Overall Capability	State-of-the-art 2025	“Step change” / “Most powerful ever”	New frontier benchmark

Conclusion: A Leaked Glimpse into the Next AI Era

The Claude Mythos leak offers a rare, unfiltered look at Anthropic’s roadmap. It confirms the company has achieved a genuine “step change” in core capabilities while simultaneously acknowledging the profound risks—particularly in cybersecurity—that come with such power. Whether labeled Opus 5 or a new Capybara tier, Mythos signals that frontier AI is entering a phase where capabilities outpace safe deployment timelines.

Ready to experience CometAPI? You can become our user first and get a free $1 credit, and receive notifications when Claude Mythos goes live.