Related ToolsClaude

Anthropic's 'Mythos': The Model That Was Reportedly Never Released

Anthropic
Image: Anthropic

A piece circulating this week claims Anthropic developed an AI model called "Mythos" and then chose not to release it - framing the decision as the model being too capable for general use.

The "too good to release" framing does a lot of work here. If accurate, it suggests Anthropic built something with capabilities it wasn't comfortable deploying - a plausible scenario given the company's documented approach to safety thresholds. Anthropic has published what it calls "responsible scaling policies," internal criteria that determine when certain model capabilities require additional precautions before release. In practice, that means some models or model features get held back until safety evaluations clear specific benchmarks.

But it's worth being clear about what we know versus what's being asserted. "Mythos" hasn't appeared in any official Anthropic announcements, regulatory filings, or technical disclosures. The name itself reads more like a rhetorical device than a leaked internal codename.

The underlying dynamic the piece is pointing at is real. AI labs routinely develop models that never reach the public. Sometimes a newer model makes an older one obsolete before launch. Sometimes internal red-team testing - where researchers actively try to find dangerous or harmful behavior - surfaces something the company isn't ready to handle. OpenAI described holding back certain GPT-4 capabilities during the initial 2023 rollout. Google delayed Gemini features after early testing produced problematic outputs. Anthropic itself has discussed applying safety evaluations before deploying certain model tiers.

The harder question the piece gestures at - whether AI companies have an obligation to explain publicly why a model wasn't released, not just that it wasn't - is worth taking seriously. Withheld models represent capability decisions made entirely inside a company with no external accountability. Whether "Mythos" is a real example or a constructed one to illustrate that gap, the question stands.