TL;DR
Twelve documented Claude Fable 5 use patterns - agent orchestration, overnight runs, 1M-context refactors, effort tuning - each with a how-to seed and doc link.
Read next
How to use Claude Fable 5 across every access path: claude.ai plans through June 22, the Claude API, Amazon Bedrock, Vertex AI, and Microsoft Foundry, with setup effort and first-prompt tips.
8 min readUltracode is two documented things: a prompt keyword that turns one task into a dynamic workflow, and an /effort setting that pairs xhigh reasoning with automatic orchestration. Here is exactly what the docs say.
8 min readFable 5 posts an 80.3% SWE-Bench Pro score and costs 2x Opus 4.8 - here is the task-profile scoring guide that tells you when the premium pays off.
7 min readLast updated: June 11, 2026
Claude Fable 5 went GA on June 9, 2026 at $10 per million input tokens and $50 per million output tokens - double Opus 4.8 on every token type. Two days in, the question is not "is it good" but "what are people actually doing with it that justifies the rate." The answer is visible in Anthropic's docs, the Claude Code features that shipped alongside it, and the model's own prompting guide. Here are the twelve best-documented patterns, ranked by leverage, each with a starting point and the doc to read next.
Still deciding whether it belongs in your stack? Start with the Fable 5 vs Opus 4.8 decision guide and come back.
The headline use case. In the launch post, Stripe reported that Fable 5 "compressed months of engineering into days," completing a codebase-wide migration in a single day that Anthropic says would have taken a whole team over two months by hand. A vendor-published claim, but it matches the documented profile: the official prompting guide calls out "first-shot correctness on complex, well-specified problems," with early testers reporting single-pass implementations that previously took days of iteration.
Best for: API version upgrades, dependency replacements, schema changes spanning hundreds of files.
How to start: write the full migration spec up front - source pattern, target pattern, what counts as done - and hand it over as one well-specified task. Our migration walkthrough covers the prompt-level changes.
Fable 5's defining behavioral shift is turn length. Per the prompting guide, individual requests on hard tasks can run for many minutes and autonomous runs can extend for hours - Anthropic tells teams to adjust client timeouts and check on runs asynchronously rather than blocking. The recoverable-harness recipe is in Anthropic's engineering post on long-running agents: an initializer agent for one-time setup, a coding agent for incremental progress, and four artifacts carrying state between sessions - a JSON feature list with pass/fail flags, a progress file, an init.sh script, and meaningful git history.
Best for: greenfield builds and large feature backlogs you want worked while you sleep.
How to start: one feature at a time, verified before marked complete - detailed in our overnight agents workflow guide.
Running Fable 5 on every leg of a pipeline doubles your bill for work that does not need it. The cost-sane configuration keeps Fable 5 at the planning and integration layer and routes mechanical work to cheaper models. Claude Code subagent definitions support this directly: the model frontmatter field accepts sonnet, opus, haiku, or fable aliases per subagent, and the effort docs name subagents as the typical use case for low effort. Fable 5 is also a better dispatcher - the prompting guide calls it "significantly more dependable at dispatching and sustaining parallel subagents."
Best for: any multi-step pipeline where exploration, file edits, and test runs outnumber planning decisions.
How to start: define subagents with model: haiku or model: sonnet plus effort: low, keep the session model on Fable 5, and let it delegate.
When a task needs more agents than one conversation can coordinate, Claude Code dynamic workflows move the plan into a JavaScript script that Claude writes and a runtime executes - up to 16 concurrent agents and 1,000 agents total per run, with intermediate results in script variables instead of the context window. Trigger one with the ultracode keyword, or set /effort ultracode to let Claude plan a workflow for every substantive task. Good runs save to .claude/workflows/ as reusable slash commands.
Best for: codebase-wide bug sweeps, 500-file migrations, research questions needing cross-checked sources.
How to start: run the bundled /deep-research workflow once to see the shape, then try a workflow on a small slice of your real task first. For how the parallelism primitives compare, see subagents vs agent teams vs workflows.
Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
From the archive
Jun 11, 2026 • 10 min read
Jun 10, 2026 • 8 min read
Jun 10, 2026 • 9 min read
Jun 10, 2026 • 9 min read
The most interesting fleet pattern is not more workers - it is workers that check each other. The workflows doc describes runs where "independent agents adversarially review each other's findings before they're reported," and /deep-research votes on claims and filters the ones that fail cross-checking. The prompting guide adds a key finding: separate, fresh-context verifier subagents tend to outperform self-critique. The agent teams docs document the debugging variant - teammates on competing hypotheses actively trying to disprove each other, because "the theory that survives is much more likely to be the actual root cause."
Best for: root-cause debugging, research synthesis, any output where one confident wrong answer is expensive.
How to start: add a verifier agent to your pipeline that critiques output without seeing the reasoning that produced it.
Fable 5 ships with a 1M token context window and 128K max output, per the models overview. That is enough to load a medium-sized codebase and reason across it holistically - dependency tracing through full type definitions, cross-file refactors without retrieval plumbing. One honest caveat from the same page: the Opus 4.7 tokenizer means the same text produces roughly 30% more tokens than pre-4.7 models. The 1M window holds fewer words than it used to, and token budgets need re-baselining.
Best for: architecture reviews and refactors where the answer depends on seeing everything at once.
How to start: exclude generated code, lockfiles, and vendored dependencies before loading; put stable content first so prompt caching can do its work.
Effort is the primary cost and latency dial on Fable 5, set via output_config.effort in the API. The documented guidance: high is the default, xhigh is for long-horizon work over 30 minutes with token budgets in the millions, and medium or low cover routine work. The detail most teams miss: lower effort settings on Fable 5 "still perform well and often exceed xhigh performance on prior models" - so Fable 5 at medium can be a legitimate replacement for Opus 4.8 at xhigh, not just a degraded Fable.
Best for: controlling spend without switching models mid-pipeline.
How to start: run your eval suite at medium first; only climb to xhigh where you measure real headroom. The cost math is in our cost-per-task analysis.
Agent teams (experimental, enabled with CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1) run peer Claude Code sessions with a shared task list and inter-agent messaging. The leverage feature is the plan approval gate: a teammate works in read-only plan mode until the lead approves its approach; give the lead criteria like "only approve plans that include test coverage." Quality-gate hooks (TeammateIdle, TaskCreated, TaskCompleted) block actions with exit code 2. Official sizing guidance: 3-5 teammates, 5-6 tasks each, token costs scaling linearly with teammate count.
Best for: cross-layer changes where frontend, backend, and tests each need an owner.
How to start: begin with research or review tasks, not parallel implementation, and keep teammates off shared files.
Fable 5 "performs particularly well when it can record lessons from previous runs and reference them," per the prompting guide - the recommended implementation is one Markdown file per lesson. Claude Code wires this in natively: the subagent memory frontmatter field gives an agent a persistent directory (user, project, or local scope) that survives across conversations, with the first 200 lines (or 25KB, whichever comes first) of its MEMORY.md injected into the system prompt.
Best for: recurring agents - reviewers, triagers, deploy bots - that should get smarter every run.
How to start: have Fable 5 review past sessions with subagents and store the core lessons it finds - the bootstrap prompt is in the guide.
Code review splits into two documented tiers. Hosted Code Review (research preview, Team and Enterprise plans) runs a fleet of specialized agents per PR with a verification step that filters false positives, posts severity-tagged inline comments, and averages $15-25 per review, completing in 20 minutes on average. A REVIEW.md at repo root is injected into every reviewer agent as the highest-priority instruction block - use it to cap nit volume, set skip rules, and require file:line evidence. The lighter path is the local /code-review command in any Claude Code session, with --comment to post PR comments and --fix to apply findings.
Best for: teams whose review queue, not code generation, is the bottleneck.
How to start: run /code-review high locally on your next branch before paying for the hosted fleet.
The longer an agent runs unattended, the more a fabricated "all tests passing" report costs you. Anthropic's documented fix is simple: have the model audit each progress claim against an actual tool result from the session before reporting it. Per the prompting guide, this "nearly eliminated fabricated status reports even on tasks designed to elicit them" in Anthropic's testing. Pair it with the send-to-user tool pattern so a long-running agent can surface verbatim progress updates mid-task without ending its turn.
Best for: every autonomous pipeline.
How to start: paste the grounding instruction from the prompting guide into your long-run system prompt today. It costs nothing.
The counterintuitive pattern: leveraging Fable 5 sometimes means deleting instructions. The prompting guide states that skills developed for prior models "are often too prescriptive for Claude Fable 5 and can degrade output quality," and recommends removing older instructions where default performance is better. One hard gotcha: prompts that tell the model to echo or explain its internal reasoning can trigger the reasoning_extraction refusal category, causing elevated fallbacks to Opus 4.8. Mechanics in the safeguards and refusal architecture post.
Best for: anyone migrating an existing Claude Code setup or skill library.
How to start: audit skills and system prompts for show-your-thinking instructions and step-by-step micromanagement, then A/B the trimmed versions.
Through June 22, 2026, it is included at no extra cost on Pro, Max, Team, and seat-based Enterprise plans, per the launch post. From June 23 it requires usage credits. Details in our June 22 deadline breakdown.
$10 per million input tokens and $50 per million output tokens - exactly double Opus 4.8's $5/$25. And the tokenizer produces roughly 30% more tokens than pre-Opus-4.7 models, so per-token price alone understates the jump.
No, but it is the fastest path. Workflows, agent teams, subagent definitions, and /code-review are Claude Code features. Effort levels, memory patterns, grounded progress reporting, and the send-to-user tool all work through the raw API and the Agent SDK.
Anthropic reports the safeguards trigger in under 5% of sessions on average, targeting offensive cybersecurity, biology and life sciences content, and reasoning extraction. Benign work near those domains can still trip them, so configure fallback to Opus 4.8 if your workload sits close to the boundary.
Grounded progress reporting (entry 11). It is one paragraph of prompt text, applies to every autonomous run on any stack, and Anthropic's testing showed it nearly eliminated fabricated status reports.
Technical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
Anthropic's first generally available Mythos-class model, released June 9, 2026. 1M context, 128K max output, $10/$50 pe...
View ToolAnthropic's AI. Opus 4.6 for hard problems, Sonnet 4.6 for speed, Haiku 4.5 for cost. 200K context window. Best coding m...
View ToolAnthropic's smallest Claude 4.5 model. Near-frontier coding performance at one-third the cost of Sonnet 4 and up to 4-5x...
View ToolAnthropic's flagship reasoning model. Best-in-class for coding, long-context analysis, and agentic workflows. 1M token c...
View ToolEvery coding agent in one window. Stop alt-tabbing between Claude, Codex, and Cursor.
View AppTurn a one-liner into a working Claude Code skill. From idea to installed in a minute.
View AppUnlock pro skills and share private collections with your team.
View AppA practical walk-through of how to design, write, and ship a Claude Code skill - from choosing when to trigger, through allowed-tools, to the steps the agent will actually follow.
Getting StartedManaged scheduling on Anthropic infrastructure with API and GitHub triggers.
Claude CodeThe primary command-line entry point for Claude Code sessions.
Claude CodeFable 5 posts an 80.3% SWE-Bench Pro score and costs 2x Opus 4.8 - here is the task-profile scoring guide that tells you...
Claude agents vs skills, untangled: agents are workers with their own context window, skills are instructions loaded on...
Claude Code dynamic workflows turn orchestration into a JavaScript script that runs up to 1,000 agents per run - here is...
Anthropic's docs say the tokenizer introduced with Opus 4.7 can use up to 35% more tokens for the same text. Here is wha...
Fable 5 long-running requests can run for many minutes per turn and hours per autonomous run. Here is how to configure c...
Anthropic says persistent file-based memory improved Fable 5 three times more than it improved Opus 4.8. Here is the ful...

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.