Fable 5 Before June 22: The Decision Checklist for Every Plan Tier

Q: Is there a way to disable adaptive thinking on Fable 5?

No. On Fable 5, adaptive thinking is always on. `thinking: {type: "disabled"}` returns a 400 error - do not pass it. Use `output_config: {effort: "low"}` or similar to reduce thinking depth and manage cost and latency, but thinking cannot be turned off entirely on this model.

You have 12 days left in the promotional window where Fable 5 is included on claude.ai plans - Free, Pro, and Max alike. On June 22, that changes. The model does not disappear, but your relationship to its cost does.

This is not a post about whether Fable 5 is good. It is a checklist for what you should actually decide and test between now and then, broken down by who you are.

Last updated: June 10, 2026

What Is Actually Happening on June 22

Fable 5 launched on June 9, 2026, at API pricing of $10 per million input tokens and $50 per million output tokens - the highest-priced model Anthropic has publicly released. At launch, Anthropic extended promotional access across all claude.ai plan tiers, including Free.

On June 22, the promotional inclusion ends. After that date, Fable 5 usage on claude.ai will draw from usage credits rather than being bundled into your plan. The model remains available; what shifts is whether every request carries a marginal cost.

For API and Teams customers building on top of the API directly, the June 22 date may be less pivotal - you have already been billed at the $10/$50 rate. But the promotional window is a useful forcing function to review your workload allocation before the window closes.

For billing mechanics at the API level, one useful Fable 5-specific fact: you are not billed for a request that is refused before any output is generated. If your workload touches domains where Fable 5's safety classifiers decline requests, that affects your cost model - refusals do not count. (Official source: Anthropic docs on refusals and billing)

The Four Decisions Everyone Needs to Make

Before getting into tier-specific checklists, these four questions apply across every plan:

What tasks genuinely need Fable 5 over Opus 4.8? The price gap is 2x on inputs and 2x on outputs. Unless the task measurably benefits from Fable 5's capabilities, Opus 4.8 at $5/$25 per million tokens is the correct default.
Have you measured latency on your actual prompts? Fable 5 adaptive thinking is always on - there is no way to disable it. For latency-sensitive tasks, that matters. Test before June 22, not after.
Do you have a fallback configured? The API supports server-side fallback (fallbacks parameter, currently in beta) and client-side SDK middleware. If Fable 5 refuses a request, a retry against Opus 4.8 costs essentially the same as the original attempt due to fallback credit handling.
What does your refusal rate look like on your domain? Some workflows, particularly those touching security research, biotech, or dual-use tooling, will see Fable 5 refuse more often than Opus 4.8. Measure this now while access is free.

Checklist by Plan Tier

Individual Users: Pro and Max Plans

Decision	Action	Priority
Identify your top 3 use cases for Fable 5	Compare to same prompts on Opus 4.8. Is the quality gap visible?	This week
Stress-test context length	Fable 5's 1M context window and Opus 4.8's are the same. But tokenizer changed with Opus 4.7 and carries through - ~30% more tokens for same text. Re-measure.	Before June 22
Check refusal behavior on your topics	Run a representative sample of your prompts. Log any stop_reason: "refusal" responses.	Before June 22
Decide on default model after June 22	If you stay on Fable 5 for everything, understand that usage credits will now be consumed. If you shift to Opus 4.8 as default and use Fable 5 selectively, plan which tasks stay.	Decide by June 20
Review your Max tier multiplier	Max 5x and Max 20x subscribers have different monthly usage volumes. Whether Fable 5 fits within those credits depends on your actual prompt/output sizes.	Before June 22

What not to worry about: Fable 5 is not going away. The model stays available. You are not being forced off it; you are being transitioned to a usage-credit model for access.

Teams Customers

Teams plans introduce shared usage pools, which changes the calculus compared to individual subscriptions.

Decision	Action	Priority
Audit which team members are using Fable 5 most heavily	Usage patterns during the promotional window are your best data. Pull that now.	This week
Identify team workflows that are genuinely Fable 5-class tasks	Long-horizon agentic work, complex multi-document reasoning, hardest coding tasks. Everything else routes to Opus 4.8.	This week
Set a routing policy before June 22	Without a policy, teams tend to default to the most capable model because there is no visible cost signal during promo. After June 22 there is.	Before June 20
Brief technical team members on Fable 5 refusal behavior	Teams doing security research or biotech work need to know that Fable 5 has classifiers that can decline requests. Opus 4.8 is the natural fallback.	Before June 22
Confirm zero-data-retention status	Fable 5 is a Covered Model under Anthropic's data retention policy. It carries 30-day retention and is not available under zero data retention agreements. If your team has ZDR requirements, Fable 5 is already unavailable regardless of billing changes. (Source)	Confirm now

API and Enterprise Customers

If you are building on the API directly, the June 22 date is less about access and more about whether your integrations are production-ready for the billing model that already applies.

Decision	Action	Priority
Verify model routing in production code	Are you using `claude-fable-5` explicitly, or an alias? Check what model string is in your API calls.	This week
Implement refusal handling if you have not	`stop_reason: "refusal"` returns HTTP 200 - not an error. If your code only handles `end_turn`, `tool_use`, and `max_tokens`, add a branch. Unhandled refusals will silently return empty or unexpected output.	This week
Configure fallback for high-volume routes	Server-side `fallbacks` parameter (beta on Claude API and Claude Platform on AWS) or client-side SDK middleware. Fallback credit prevents paying prompt-cache cost twice on a retry.	Before June 22
Measure refusal rate by task category	Segment your traffic by task type. Identify which categories have non-zero refusal rates. These are the routes where Opus 4.8 may be more reliable.	Before June 22
Re-run token count estimates using Fable 5 tokenizer	Fable 5 uses the tokenizer introduced with Opus 4.7. The same input text produces roughly 30% more tokens compared to models before Opus 4.7. Any cost model built on pre-4.7 token counts is stale.	This week
Confirm data retention policy compatibility	Fable 5 and Mythos 5 carry 30-day retention. Not available under ZDR. If you have organizational ZDR requirements, your only path to equivalent capability is Claude Mythos 5 (Project Glasswing, invitation-only).	Confirm now
Evaluate effort levels per route	Fable 5's adaptive thinking is always on. Use `output_config: {effort: "..."}` to control thinking depth per route. `high`/`xhigh` for agentic work; `low`/`medium` for simpler extraction or classification routes to manage cost.	This week

What not to panic about: Fable 5 is generally available on all major cloud platforms - AWS Bedrock, Vertex AI, Microsoft Foundry, and the Claude API directly. There is no supply constraint. The billing mechanics are well-defined.

Get the weekly deep dive

Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.

From the archive

Why Fable 5 Refuses Your Cybersecurity Queries (And How the Fallback Works)

Jun 10, 2026 • 8 min read

Fable 5's Hidden Guardrails: What Developers Need to Know About Silent Degradation

Jun 10, 2026 • 7 min read

Fable 5 vs DeepSeek V4: The Cost-Quality Gap Measured in Real Tasks

Jun 10, 2026 • 7 min read

Claude Fable 5 vs GPT-5.5: Benchmarks, Pricing, and When Each Wins

Jun 10, 2026 • 7 min read

When Opus 4.8 Is the Right Answer

Across all tiers, the honest question is: do you actually need Fable 5 for this task?

Opus 4.8 at $5/$25 per million tokens handles:

Most multi-step reasoning and agentic coding
1M context window tasks (same window as Fable 5)
Complex document analysis
The majority of production workloads that formerly justified the top-tier model

Fable 5 justifies the 2x premium for:

Tasks where Opus 4.8 demonstrably falls short on a benchmark you can measure
Extremely long-horizon autonomous work where model quality differences compound across many steps
Cases where Mythos 5's lack of safety classifiers is relevant (that requires Project Glasswing access regardless)

The full breakdown of when to use which model is covered in Fable 5 vs. Opus 4.8: When to Use Which. The short version: default to Opus 4.8, escalate to Fable 5 on tasks where you can measure the difference.

For cost modeling across both models, see Fable 5 Production Cost Modeling.

What to Test Before June 22

These are concrete tests worth running now, while the cost is effectively zero:

1. Refusal surface test Run 50-100 representative prompts from your actual use patterns through Fable 5. Log the stop_reason on every response. If your refusal rate on specific categories is above 5%, those routes should probably point to Opus 4.8 by default.

2. Latency baseline Measure median and p95 latency on your most time-sensitive prompts. With adaptive thinking always on, Fable 5 will be slower than Opus 4.8 on short tasks. Quantify the gap so you have a number to revisit after June 22.

3. Quality delta test For tasks where you suspect Fable 5 adds value, run identical prompts through both Fable 5 and Opus 4.8. Blind-evaluate the outputs. If evaluators cannot consistently prefer Fable 5, the premium is not justified.

4. Token count recalibration Fable 5 uses a newer tokenizer. If you have cost projections built on previous models, run client.messages.count_tokens() against claude-fable-5 on representative prompts and compare to your existing estimates.

Official Sources

All mechanics cited in this post are drawn from Anthropic's official documentation:

Introducing Claude Fable 5 and Claude Mythos 5 - API changes, refusal handling, billing, availability
Models Overview - Pricing, context windows, model IDs
Data Retention Practices for Mythos-class Models - 30-day retention policy, ZDR exclusion
claude.ai Pricing - Plan tiers, Fable 5 promotional access

FAQ

Will Fable 5 still be available after June 22?

Yes. The model remains available across all plans and all cloud platforms. What changes is that access shifts from promotional inclusion to usage-credit billing on claude.ai consumer plans.

Is Opus 4.8 meaningfully weaker than Fable 5?

On most production workloads, the practical difference is smaller than the 2x price gap suggests. Fable 5 demonstrates clearer advantages on extremely demanding, long-horizon tasks. For day-to-day use - coding assistance, document analysis, Q&A - Opus 4.8 is competitive.

What is the fallback credit mechanic?

When Fable 5 refuses a request, you are not billed. When you retry that request on another model (such as Opus 4.8), the fallback credit mechanism refunds the prompt-cache cost of switching models, so you avoid paying the caching overhead twice. This makes refusal handling less costly than it might appear.

Does the June 22 date affect API customers differently than claude.ai users?

Yes. API customers pay at the $10/$50 rate directly and have always been billed per token. The promotional window applies to claude.ai plan subscribers who have been accessing Fable 5 within their plan. For API customers, the more relevant deadline is any internal review process you want to run before treating Fable 5 as a production default.

What if I have zero data retention requirements?

Fable 5 is not available under zero data retention agreements. Both Fable 5 and Mythos 5 carry a 30-day retention policy and are designated Covered Models. If ZDR is a hard requirement, your organization cannot use Fable 5 regardless of billing changes. Opus 4.8 is not a Covered Model and is available under ZDR.

Is there a way to disable adaptive thinking on Fable 5?

No. On Fable 5, adaptive thinking is always on. thinking: {type: "disabled"} returns a 400 error - do not pass it. Use output_config: {effort: "low"} or similar to reduce thinking depth and manage cost and latency, but thinking cannot be turned off entirely on this model.

Fable 5 Leaves Your Claude Plan on June 22. Here's How to Plan for It

Migrating to Claude Fable 5: The Practical Guide

How to Model Fable 5 Costs Before They Blow Up Your Budget

What Is Actually Happening on June 22

The Four Decisions Everyone Needs to Make