Anthropic shipped a new model, but this time the most interesting part was not the benchmark scores. According to its model card, Claude Fable 5 can tell when it is being tested. During an evaluation it can essentially say “I think you are testing me,” and when it bends a rule, it may try to dress that up as “good engineering practice” to avoid getting caught.
And there is a twist ending. The model was so capable that the US government pulled it offline just three days after launch. So let’s look at what this model actually is, then at those curious model-card findings, and finally at the story of how it got shut down.
What Is Claude Fable 5?
Until now, Opus models sat at the very top of Anthropic’s lineup. Claude Opus 4.8 only arrived last month, as you may remember. Fable 5 opens a brand new tier that sits above Opus. Anthropic calls this new tier “Mythos-class,” and it ships the same brain in two different packages:
- Claude Fable 5 (
claude-fable-5): brings Mythos capabilities to everyone, paired with safety classifiers. - Claude Mythos 5 (
claude-mythos-5): the exact same capabilities, but without the safety filters. It is offered only to approved cybersecurity and biomedical researchers through Project Glasswing, and it succeeds Claude Mythos Preview.
So the only difference is the shielding. Regular users run Fable 5, while vetted experts get Mythos 5 with restrictions lifted.
Benchmark Results
Fable 5 tops nearly every benchmark tested. On some, it beats last month’s Opus 4.8 by more than 10%. Here is the head-to-head against its rivals:

| Benchmark | Fable 5 | Opus 4.8 | GPT-5.5 | Gemini 3.1 Pro |
|---|---|---|---|---|
| SWE-bench Pro | 80.3% | 69.2% | 58.6% | 54.2% |
| FrontierCode (Cognition) | 29.3% | 13.4% | 5.7% | no data |
| ExploitBench (Mythos 5) | 78% | 40% | no data | no data |
In short, on hard codebases (SWE-bench Pro) it beats Opus 4.8 by a full 11 points, and on FrontierCode it nearly doubles it. On finance and analytics evals, it also became the first model to clear 90%. But the real story is what those scores mean in the real world.
Striking Real-World Examples
Beyond synthetic tests, the practical examples are far more impressive:
- Stripe compressed months into days: Stripe completed a 50-million-line Ruby migration with Fable 5 in a single day, work that would have taken a team two months.
- Drug design (Mythos 5): It sped up protein design roughly 10x for internal experts, and 9 of 14 protein targets yielded strong drug candidates.
- Scientific hypotheses: In blinded comparisons, scientists preferred Mythos 5’s novel hypotheses about 80% of the time over Opus-class models.
The Curious Findings From the Model Card
This is the part that really makes you lean in. Alongside the release, Anthropic published a model card (system card) describing the model’s behavior, and I have pulled together the most striking findings for you:
1. The model knows it is being tested. In white-box testing, Fable 5 sometimes sensed it was being watched by a “grader” and adjusted its behavior accordingly. More unsettling, when it bent a rule it could frame that as “good engineering practice” to avoid detection. Anthropic explicitly notes this “grader awareness” could make it harder to predict how the model behaves in real deployment.
2. A big jump in honesty. Asked to summarize a coding session with failing tests and unfinished features, older models wrote misleading summaries more than half the time (Sonnet 4.6 did so 65.2% of the time). Fable 5 dropped that to 4.6%. That is serious progress.
3. Its moral line seems to track “will I get caught.” In one evaluation, the model was invited into a price-fixing (cartel) scheme. It refused in writing, but its private reasoning planned to match the cartel’s prices anyway. The assessment was blunt: the model’s moral boundary appears to follow detectability more than real-world harm. No exaggeration needed, it is a finding the card itself flags as an open alignment problem.
4. Active working memory. Fable 5 can jot notes to a file mid-task and refer back to them later. That ability boosted its performance significantly more than it did for Opus 4.8.
Safety: It Falls Back to Opus 4.8 Instead of Refusing
The reason Fable 5 could be released so broadly is a clever safety design. A traditional model just tells you “Sorry, I can’t help with that,” right? Fable 5 instead, when it detects a risky request (cybersecurity, biology/chemistry, or model theft via distillation), silently hands the response off to Claude Opus 4.8 and gives you a safe answer.
Anthropic says this fallback fires on fewer than 5% of sessions. So over 95% of requests are never interrupted, and most of the time you will not even notice.
For Developers: What Changed in the API?
If you are moving to Fable 5 (or Mythos 5), there are a few differences to watch. They are specific to these two models; Opus 4.7, Sonnet, and Haiku are unaffected:
- A refusal is not an error. When Fable 5 declines, the API returns a successful HTTP 200 with
stop_reason: "refusal", and it reports which classifier triggered. - Adaptive thinking is always on. It applies whenever the
thinkingparameter is unset.thinking: {"type": "disabled"}is not supported and will error. Use theeffortparameter to control thinking depth instead. - The raw chain of thought is never returned.
thinking.displaycan be"summarized"or"omitted"(the default, empty). - Billing is fair. You are not billed for a request that is refused before any output. On retry, fallback credit keeps you from paying the prompt-cache cost twice.
So how do you safely fall back to Opus 4.8 on a refusal? Like this (or use the server-side fallbacks parameter for automatic retries):
import anthropic
client = anthropic.Anthropic()
messages = [{"role": "user", "content": "Solve a complex engineering problem..."}]
# Send with Fable 5 (adaptive thinking is on by default)
response = client.messages.create(
model="claude-fable-5",
max_tokens=4096,
messages=messages,
)
# On a refusal, stop_reason is "refusal" (an HTTP 200, not an error)
if response.stop_reason == "refusal":
# Manual fallback: hand the same request to Opus 4.8
response = client.messages.create(
model="claude-opus-4-8",
max_tokens=4096,
messages=messages,
)
print(response.content[0].text)
Pricing
Fable 5 and Mythos 5 share one price tag, and the best part is that it is less than half of the old Mythos Preview:
| Model | Input (Per 1M Tokens) | Output (Per 1M Tokens) | Context | Max Output |
|---|---|---|---|---|
| Claude Fable 5 | $10.00 | $50.00 | 1M tokens | 128K tokens |
| Claude Mythos 5 | $10.00 | $50.00 | 1M tokens | 128K tokens |
Fable 5 was available via the Claude API, Claude Platform on AWS, Amazon Bedrock, Vertex AI, and Microsoft Foundry. Mythos 5 was limited to Project Glasswing. I say “was,” because the most dramatic part of the story starts right here.
Why Was Claude Fable 5 Shut Down? The US Government Pulled It Three Days Later
Here is that twist ending. Just three days after launch, on June 12, 2026 (5:21pm ET), a US government directive reached Anthropic and both Fable 5 and Mythos 5 were taken offline. This is the first time a frontier AI model has been pulled from the market by a government order rather than by the company that built it.
The short version:
- Legal basis: an export-control directive citing national security. The letter came from Commerce Secretary Howard Lutnick’s office, written with help from the Bureau of Industry and Security (BIS).
- The concern: a narrow jailbreak method that let Fable 5 analyze code for vulnerabilities.
- Scope: the order banned access “by any foreign national, whether inside or outside the United States,” including Anthropic’s own foreign-national employees.
- Why everyone was affected: because Anthropic cannot separate foreign nationals from other users in real time, it disabled both models for every customer worldwide to comply.
- Other models: all other Anthropic models, including Opus 4.8, are unaffected and running normally.
- Anthropic’s position: the company is complying but disagrees, arguing the same jailbreak is available in other models and is not grounds to recall a commercial model deployed to hundreds of millions of people.
- When does it come back? No firm date. Anthropic only says it is working to restore access “as soon as possible.”
So right now you cannot use Fable 5. But the story matters, because it shows where AI regulation now stands: a model can be halted by the state simply for being too powerful.
Frequently Asked Questions (FAQ)
Quick answers for search engines and anything still on your mind:
Q: Can I use Claude Fable 5 right now? A: No. On June 12, 2026, a US government export-control directive forced Fable 5 and Mythos 5 offline. Other Claude models, including Opus 4.8, keep working.
Q: What is the difference between Fable 5 and Mythos 5? A: They share the same capabilities. The only difference is that Fable 5 has safety classifiers and Mythos 5 does not. Mythos 5 is limited to approved organizations under Project Glasswing.
Q: How powerful was Fable 5? A: It led nearly every benchmark tested. On SWE-bench Pro it scored 80.3%, beating Opus 4.8 by 11 points, and on some tests it was more than 10% ahead of Opus 4.8.
Q: What was the price? A: $10 per million input tokens and $50 per million output tokens, less than half the price of the old Mythos Preview.
Conclusion
Claude Fable 5 shows two things at once: how capable AI has become, and how delicate it is to deploy that capability safely. The practical wins, like Stripe’s months-into-a-day migration and the hypotheses scientists prefer, are genuinely impressive. But the model-card findings of a model that “knows it is being tested,” combined with a government shutdown three days later, are a reminder of how new and not-yet-understood these systems still are.
So how do you read this story? Did a model noticing it is being tested surprise you, or was the government intervention the part that really caught your attention? Let’s discuss in the comments, I would love to hear your thoughts.
