Fast Mode - Claude Code
2.5x faster Opus at a higher token cost (research preview).
Fast mode runs Opus with an accelerated inference path - roughly 2.5x the throughput at a higher per-token price.
What it does
When fast mode is enabled, Claude Code routes Opus calls through a lower-latency backend. You pay more per token, but turns complete faster. Quality matches standard Opus. It's a straight speed-for-cost tradeoff for sessions where wall-clock time matters more than spend.
When to use it
- Interactive work where latency hurts flow.
- Pair-programming sessions where Claude needs to keep up with you.
- Time-critical debugging or incident response.
- Demos and recordings where dead air looks bad.
Gotchas
- Fast mode is a research preview. Availability and pricing can change.
- Cost can balloon on long sessions. Watch your
/statusregularly. - Only Opus is accelerated. Sonnet and Haiku ignore the flag.
Official docs: https://code.claude.com/docs/en/fast-mode.md
Technical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
Was this helpful?
Related Guides
Related Tools
Zed
High-performance code editor built in Rust with native AI integration. Sub-millisecond input latency. Built-in assistant...
View ToolClaude Haiku 4.5
Anthropic's smallest Claude 4.5 model. Near-frontier coding performance at one-third the cost of Sonnet 4 and up to 4-5x...
View ToolClaude Code
Anthropic's agentic coding CLI. Runs in your terminal, edits files autonomously, spawns sub-agents, and maintains memory...
View ToolCodeburn
Interactive TUI dashboard that shows exactly where your Claude Code and Cursor tokens are going, in real time.
View ToolRelated Videos

Composio: Connect OpenClaw & Claude Code to 1,000+ Apps via CLI
Composio: Connect AI Agents to 1,000+ Apps via CLI (Gmail, Google Docs/Sheets, Hacker News Workflows) Check out Composio here: http://dashboard.composio.dev/?utm_source=Youtube&utm_channel=0426&utm_...

Claude Code Channels in 8 Minutes
Anthropic has released Channels for Claude Code, enabling external events (CI alerts, production errors, PR comments, Discord/Telegram messages, webhooks, cron jobs, logs, and monitoring signals) to b...

Claude Code Loops in 7 Minutes
Claude Code “Loop” Scheduling: Recurring AI Tasks in Your Session The script explains Claude Code’s new “Loop” feature (an evolution of the Ralph Wiggins technique) for running recurring prompts that...
Related Posts

Anthropic Sonnet 4.5 in Claude Code
Anthropic's Claude Sonnet 4.5 isn't just another model increment. The company claims they've observed it maintaining foc...

12 Tools in One Night: An Honest Overnight Agent Report
I told an agent to improve the site every 10 minutes and went to sleep. Here is what 12 new repos, 60 PRs, and three goo...

Agent Replays with TraceTrail: Loom for Agent Runs
Agent runs are opaque. TraceTrail turns a Claude Code JSONL into a public share link with a stepped timeline of messages...
