TL;DR
Anthropic has released Claude Opus 4.5, positioning it as their most capable model yet for coding agents and computer use. The release brings significant price cuts, efficiency gains, and enough au...
Read next
Million-token context, agent teams that coordinate without an orchestrator, and benchmark scores that push the frontier. Opus 4.6 is Anthropic's biggest model drop yet.
8 min readAnthropic's Sonnet 4.6 narrows the gap to Opus on agentic tasks, leads computer use benchmarks, and ships with a beta million-token context window. Here's what actually changed.
6 min readClaude Code is Anthropic's terminal-based AI agent that ships code autonomously. Complete guide: install, CLAUDE.md memory, MCP, sub-agents, pricing, and workflows.
6 min readAnthropic has released Claude Opus 4.5, positioning it as their most capable model yet for coding agents and computer use. The release brings significant price cuts, efficiency gains, and enough autonomous capability to outscore human candidates on the company's notoriously difficult technical assessment.
Opus 4.5 drops to $5 per million input tokens and $25 per million output tokens - three times cheaper than its predecessor. The model is available across Anthropic's web app, Claude Code, and all major cloud providers. This price reduction makes high-performance agentic workflows economically viable at scale.
For model-selection context, compare this with What Is Claude Code? The Complete Guide for 2026 and 60 Claude Code Tips and Tricks for Power Users; the useful question is not only benchmark quality, but where the model fits in a real developer workflow.
On software engineering benchmarks, Opus 4.5 leads across the board. It tops SWE-bench Verified, TerminalBench, and shows strong performance on multilingual coding tasks with an 89.4% on Polyglot. Browser automation scores hit 72.9% on BrowserComp, and the model achieved $4,967 on VendingBench - though still trailing Gemini 3 Pro on that specific metric.

The headline metric, however, is token efficiency. Opus 4.5 matched Sonnet 4.5's best SWE-bench Verified score using 76% fewer output tokens. At maximum effort, it exceeds Sonnet 4.5 by 4.3 percentage points while consuming 48% fewer tokens. Raw performance is easy when you burn unlimited compute - efficiency at the frontier is what matters for production deployments.
The model introduces an effort parameter in the API, letting developers control how much compute to allocate per task. This pairs with new features including tool search, programmatic tool calling, tool use examples, and context compaction.

Anthropic emphasizes Opus 4.5's ability to manage teams of sub-agents and build complex multi-agent systems without constant intervention. The model handles ambiguous tasks, reasons through trade-offs, and operates autonomously without the handholding earlier models required. Early testers consistently report that Opus 4.5 "just gets it" when handed open-ended technical tasks.
Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
From the archive
Claude Code now ships as a desktop application alongside the existing CLI and web interfaces. The release adds Microsoft Office integrations for PowerPoint, Excel, and Word, plus expanded Chrome extension support. Conversation limits have increased, and the system supports longer-running agentic workflows.

Perhaps the most striking claim: Opus 4.5 is the first model to outperform human candidates on Anthropic's technical take-home exam. The assessment tests technical ability and judgment under time pressure - areas where the model now exceeds the strongest human applicants.
This result raises concrete questions about how AI reshapes engineering as a profession. Anthropic acknowledges their exam doesn't measure collaboration, communication, or the instincts developed over years of experience. But on core technical skills, the machine has crossed the threshold.
In a demo building a glassmorphism-themed SaaS landing page with Next.js, Opus 4.5 completed the task in approximately five minutes with minimal instruction. The model handled design decisions, component structure, and styling autonomously. Image understanding capabilities suggest it can interpret Figma screenshots and other visual references to match specific design requirements.

The shift is clear: less time prompting, more time reviewing. Opus 4.5 operates as a system you delegate to rather than direct step-by-step.
Claude Opus 4.5 is Anthropic's flagship AI model released in November 2025, optimized for coding agents and autonomous computer use. It represents a significant upgrade over Opus 4, with improved token efficiency (76% fewer output tokens for equivalent performance), lower pricing ($5/$25 per million input/output tokens), and the ability to manage multi-agent workflows without constant supervision.
Opus 4.5 exceeds Sonnet 4.5 by 4.3 percentage points on SWE-bench Verified while consuming 48% fewer tokens. The key difference is reasoning depth: Opus handles ambiguous, open-ended tasks where Sonnet would need more explicit guidance. Use Opus for complex autonomous work and Sonnet for faster, more straightforward tasks where cost matters more than maximum capability.
The effort parameter lets you control how much compute the model allocates to a task. Higher effort levels enable deeper reasoning and better results on complex problems, while lower effort saves tokens for simpler tasks. This gives developers fine-grained control over the cost-quality tradeoff per API call.
As of May 2026, Opus 4.6 and Opus 4.7 have been released with additional capabilities including adaptive thinking and agent teams. However, Opus 4.5 remains highly capable and more cost-effective for many use cases. The effort parameter and pricing make it a strong choice for high-volume autonomous workloads where the newest features are not required.
Context compaction is a feature that allows the model to summarize and compress its conversation history during long-running sessions. This prevents the context window from filling up and lets agents run for extended periods without losing track of earlier work. It is particularly useful for multi-hour coding sessions.
Yes. Anthropic reported that Opus 4.5 outperformed human candidates on their technical take-home exam, which tests coding ability and judgment under time pressure. However, the assessment does not measure collaboration, communication, or engineering intuition developed through years of experience. The result demonstrates strong autonomous technical capability, not full replacement of human engineers.
Opus 4.5 is available through the Anthropic API (model ID: claude-opus-4-5-20251101), Claude Code, the Claude web app, and major cloud providers including AWS Bedrock and Google Cloud Vertex AI. Claude Code on the Max plan ($200/month) includes Opus 4.5 access with high usage limits.
Three factors: token efficiency, autonomous judgment, and sub-agent management. The model completes SWE-bench tasks using far fewer tokens than competitors, handles ambiguous instructions without constant clarification, and can coordinate multiple sub-agents for parallel work. This combination makes it practical to run long-running autonomous coding workflows at scale.
Technical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
Anthropic's AI. Opus 4.6 for hard problems, Sonnet 4.6 for speed, Haiku 4.5 for cost. 200K context window. Best coding m...
View ToolAnthropic's flagship reasoning model. Best-in-class for coding, long-context analysis, and agentic workflows. 1M token c...
View ToolAnthropic's agentic coding CLI. Runs in your terminal, edits files autonomously, spawns sub-agents, and maintains memory...
View ToolAnthropic's smallest Claude 4.5 model. Near-frontier coding performance at one-third the cost of Sonnet 4 and up to 4-5x...
View ToolUnlock pro skills and share private collections with your team.
View AppPro hooks for Claude Code. Private bundles, team sync, one-click install.
View AppEvery coding agent in one window. Stop alt-tabbing between Claude, Codex, and Cursor.
View AppUse opus, sonnet, haiku, and best to switch models easily.
Claude CodeHybrid mode: Opus for planning, Sonnet for execution.
Claude CodeExtended context window for Opus and Sonnet on supported plans.
Claude Code
Anthropic Releases Claude Opus 4.7: Benchmarks, Vision Upgrades, Memory, Pricing & New Claude Code Features Anthropic has released Opus 4.7, and the video covers the announcement, benchmark results, ...

Nimbalyst Demo: A Visual Workspace for Codex + Claude Code with Kanban, Plans, and AI Commits Try it: https://nimbalyst.com/ Star Repo Here: https://github.com/Nimbalyst/nimbalyst This video demos N...

Claude Design by Anthropic: Generate a Design System From Your Repo + Build High-Fidelity UI Fast The video reviews Claude Design by Anthropic, calling it a highly differentiated product, and demonst...

Million-token context, agent teams that coordinate without an orchestrator, and benchmark scores that push the frontier....

Anthropic's Sonnet 4.6 narrows the gap to Opus on agentic tasks, leads computer use benchmarks, and ships with a beta mi...

Claude Code is Anthropic's terminal-based AI agent that ships code autonomously. Complete guide: install, CLAUDE.md memo...

Two platforms, two philosophies. Here is how Anthropic and OpenAI compare on APIs, SDKs, documentation, pricing, and the...

The definitive collection of Claude Code tips - sub-agents, hooks, worktrees, MCP, custom agents, keyboard shortcuts, an...

Anthropic's Claude Haiku 4.5 delivers Sonnet 4-level coding performance at one-third the cost and twice the speed. Here...

New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.