TL;DR
Rewriting prompts and skills for Fable 5: what changes when you migrate agents from Opus 4.x, how effort interplay works, and which old workarounds now hurt.
Read next
Task budgets give Claude a token countdown for the whole agentic loop, so the model paces itself instead of discovering the limit when max_tokens truncates it. Here is how the beta works on Fable 5, what it does not enforce, and where it fits next to effort and the Usage API.
8 min readFable 5 posts an 80.3% SWE-Bench Pro score and costs 2x Opus 4.8 - here is the task-profile scoring guide that tells you when the premium pays off.
7 min readA practical playbook for running Claude Fable 5 as the orchestrator over Sonnet and Haiku workers, with verified cost math on when the premium pays off.
10 min readLast updated: June 11, 2026
Swapping claude-opus-4-8 for claude-fable-5 takes thirty seconds. Anthropic's own migration guide calls the move "mostly drop-in" at the API level. The actual work is everything on top of the API: system prompts tuned to push Opus into action, skills written as step-by-step procedures, scaffolding that forced progress updates and suppressed subagents. Anthropic is unusually blunt in the Fable 5 prompting guide: "Skills developed for prior models are often too prescriptive for Claude Fable 5 and can degrade output quality."
That sentence is the thesis here. Your Opus 4.x workarounds compensated for specific model weaknesses. Fable 5 fixed many of them, so the compensations are now liabilities. Here is what maps to what, what to delete, and an honest case for when not to migrate.
The API-level changes from Opus 4.8 are short, per the migration guide:
thinking: {type: "disabled"} returns a 400, and omitting the thinking field now means thinking runs, where on Opus 4.8 it meant off. Depth is controlled with output_config.effort.budget_tokens and assistant prefill stay removed. Both already 400 on Opus 4.7 and 4.8.refusal stop reason. The classifiers target offensive cyber, biology, and reasoning-extraction requests. A declined request returns HTTP 200 with stop_reason: "refusal" and a stop_details.category. Retry on Opus 4.8 via the beta fallbacks parameter, SDK middleware, or manually with fallback credit.count_tokens.For the code-level walkthrough with SDK examples, see our Fable 5 migration guide. This post is about the layer the checklist cannot automate.
The translation table for prompt and skill patterns that were correct on Opus 4.x and need rework on Fable 5, grounded in Anthropic's prompting guide and migration notes.
| Opus 4.x pattern | Fable 5 replacement | Why |
|---|---|---|
thinking: {type: "adaptive"} set explicitly | Omit the thinking field entirely (explicit adaptive still accepted) | Thinking is always on; disabled returns 400 |
effort: "xhigh" as the coding default | Start at high, reserve xhigh for capability-sensitive work | Lower effort on Fable 5 often exceeds xhigh on prior models |
| "CRITICAL: You MUST use this tool" escalation | Plain, brief instruction stating when to use the tool | Instruction following is strong enough that aggressive language overtriggers |
| Step-by-step procedural skills | Goal plus constraints; let the model plan the steps | Prescriptive skills "can degrade output quality" per Anthropic |
| "Explain your reasoning in the response" | Delete; read summarized thinking blocks instead | Can trigger the reasoning_extraction refusal category |
| "After every 3 tool calls, summarize progress" | Grounded progress instruction: audit claims against tool results | Forced cadence adds noise; grounding nearly eliminated fabricated status reports in Anthropic's testing |
| "Do not spawn subagents" guardrails | Encourage delegation with explicit when-to-delegate guidance | Fable 5 dispatches parallel subagents dependably, asynchronously |
| Remaining-token countdowns surfaced to the model | Hide the count, or add a reassurance line | Visible countdowns can trigger premature wrap-up in long sessions |
| No memory surface | A plain Markdown notes file with a lesson format | Fable 5 "performs particularly well" when it can record and reread lessons |
The pattern across every row: state the goal and the boundary, not the steps.
Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
From the archive
Jun 11, 2026 • 8 min read
Jun 11, 2026 • 10 min read
Jun 11, 2026 • 10 min read
Jun 10, 2026 • 8 min read
This is the most common silent regression, because the effort docs give different recommendations per model and most configs were tuned once and frozen.
On Opus 4.7 and 4.8, Anthropic's guidance was to start coding and agentic workloads at xhigh. On Fable 5 the guidance flips: start at high (the default), including workloads that ran at xhigh on Opus 4.8, and step down for routine work. The docs are explicit that "lower effort settings on Claude Fable 5 still perform well and often exceed xhigh performance on prior models."
Three interactions to watch:
thinking field) thinks adaptively on Fable 5, changing both latency and spend on routes you considered "cheap."max_tokens is a hard ceiling on thinking plus response text. At high and xhigh, set it generously or the model truncates mid-task. The migration guide says to revisit it specifically for workloads that previously ran without thinking.Run an effort sweep as part of migration, including low and medium. On a model this capable, paying xhigh prices for a routine route is the new overprovisioning. For whether the model itself is worth 2x Opus pricing, our Fable 5 vs Opus 4.8 decision guide and cost-per-task analysis cover that math.
Some Opus-era patterns do not just become unnecessary on Fable 5. They actively hurt.
The sharpest example: prompts or skills telling the model to "show your thinking" or "explain your internal reasoning in the response." On Fable 5 these can fire the reasoning_extraction refusal classifier, causing elevated fallbacks to Opus 4.8 on traffic that has nothing to do with safety. Anthropic's scaffolding guidance says to audit existing skills and system prompts for reflection instructions when migrating. If your application needs reasoning visibility, set thinking: {type: "adaptive", display: "summarized"} and read the structured thinking blocks instead. The broader transparency questions around Fable's classifiers are their own topic, covered in the silent guardrails post.
Skills written as numbered procedures were a rational response to models that improvised badly. Fable 5 plans well, and the official guidance is to "review and consider removing older instructions if default performance is better." The practical method: pick your two highest-traffic skills, write a de-prescribed variant stating the goal, constraints, and definition of done, and A/B it against the original on real tasks. The deletable lines are usually mid-procedure steps ("then run X, then check Y"); the keepers are boundaries ("never push to main") and facts ("the staging DB resets nightly").
Anti-delegation rules, forced progress cadence, and "do not explore, just answer" instructions all cap behaviors Fable 5 is now good at. Replace suppression with direction. The doc-recommended shape for delegation is one line: delegate independent subtasks and keep working while they run, intervening only if a subagent goes off track.
As a community observation rather than vendor guidance: Simon Willison's release-day testing (June 9, 2026) found the model "slow" and "expensive" but with remarkably strong instruction following on multi-step tasks, and noted the guardrails "trigger often enough" that the new fallback mechanisms exist for a reason. Steer with brief instructions, and treat refusal handling as a real code path.
For Claude Code users, the mechanics matter too. Per the Claude Code skills docs, a skill's body loads only when used, and custom commands have merged into skills. Three Fable-specific passes:
One more doc-backed behavior worth exploiting: Fable 5 "does a good job of updating skills on the fly based on what it learns from the task at hand." Give it permission to edit its own skills and review the diffs. That loop is the practical version of what we argued in why skills beat prompts for coding agents.
thinking config, prefills, effort values, and reasoning-echo phrases. Confirm your org is not on zero data retention.thinking: {type: "disabled"}, add stop_reason: "refusal" handling with stop_details.category logging, and configure fallback to Opus 4.8.low, medium, and high on routine routes; reserve xhigh for routes where evals show headroom. Raise max_tokens wherever thinking is new.Honest cases for not migrating, or not yet:
No. The API swap works with existing prompts. But Anthropic's guidance says prompts and skills written for prior models are often too prescriptive and can degrade Fable 5 output quality, so A/B your highest-traffic prompts with the scaffolding removed before calling the migration done.
Nothing directly. Thinking is always on and adaptive; output_config.effort (low, medium, high, xhigh, max) is the depth and spend control. thinking: {type: "disabled"} and budget_tokens both return 400 errors.
Probably not as a default. Anthropic recommends starting at high on Fable 5, including for workloads that ran at xhigh on Opus 4.8, because lower effort levels often exceed xhigh performance on prior models. Sweep low and medium on routine routes too.
The most common self-inflicted cause is an instruction asking the model to reproduce its internal reasoning in the response, which can trigger the reasoning_extraction refusal category. Audit prompts and skills for show-your-thinking language, and read summarized thinking blocks via thinking.display: "summarized" instead.
Technical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
Anthropic's agentic coding CLI. Runs in your terminal, edits files autonomously, spawns sub-agents, and maintains memory...
View ToolGives AI agents access to 250+ external tools (GitHub, Slack, Gmail, databases) with managed OAuth. Handles the auth and...
View ToolAnthropic's first generally available Mythos-class model, released June 9, 2026. 1M context, 128K max output, $10/$50 pe...
View ToolTurn a one-liner into a working Claude Code skill. From idea to installed in a minute.
View AppUnlock pro skills and share private collections with your team.
View AppBrowse 150+ ready-to-install skills for every kind of knowledge work.
View AppInstall the dd CLI and scaffold your first AI-powered app in under a minute.
Getting StartedWhat MCP servers are, how they work, and how to build your own in 5 minutes.
AI AgentsInstall Ollama and LM Studio, pull your first model, and run AI locally for coding, chat, and automation - with zero cloud dependency.
Getting StartedA practical playbook for running Claude Fable 5 as the orchestrator over Sonnet and Haiku workers, with verified cost ma...
Task budgets give Claude a token countdown for the whole agentic loop, so the model paces itself instead of discovering...
Fable 5 posts an 80.3% SWE-Bench Pro score and costs 2x Opus 4.8 - here is the task-profile scoring guide that tells you...
Claude agents vs skills, untangled: agents are workers with their own context window, skills are instructions loaded on...
Claude Code dynamic workflows turn orchestration into a JavaScript script that runs up to 1,000 agents per run - here is...
Anthropic's docs say the tokenizer introduced with Opus 4.7 can use up to 35% more tokens for the same text. Here is wha...

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.