TL;DR
Anthropic has released Claude Opus 4.5, positioning it as their most capable model yet for coding agents and computer use. The release brings significant price cuts, efficiency gains, and enough au...
Anthropic has released Claude Opus 4.5, positioning it as their most capable model yet for coding agents and computer use. The release brings significant price cuts, efficiency gains, and enough autonomous capability to outscore human candidates on the company's notoriously difficult technical assessment.
Opus 4.5 drops to $5 per million input tokens and $25 per million output tokens - three times cheaper than its predecessor. The model is available across Anthropic's web app, Claude Code, and all major cloud providers. This price reduction makes high-performance agentic workflows economically viable at scale.
On software engineering benchmarks, Opus 4.5 leads across the board. It tops SWE-bench Verified, TerminalBench, and shows strong performance on multilingual coding tasks with an 89.4% on Polyglot. Browser automation scores hit 72.9% on BrowserComp, and the model achieved $4,967 on VendingBench - though still trailing Gemini 3 Pro on that specific metric.

The headline metric, however, is token efficiency. Opus 4.5 matched Sonnet 4.5's best SWE-bench Verified score using 76% fewer output tokens. At maximum effort, it exceeds Sonnet 4.5 by 4.3 percentage points while consuming 48% fewer tokens. Raw performance is easy when you burn unlimited compute - efficiency at the frontier is what matters for production deployments.
Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
The model introduces an effort parameter in the API, letting developers control how much compute to allocate per task. This pairs with new features including tool search, programmatic tool calling, tool use examples, and context compaction.

Anthropic emphasizes Opus 4.5's ability to manage teams of sub-agents and build complex multi-agent systems without constant intervention. The model handles ambiguous tasks, reasons through trade-offs, and operates autonomously without the handholding earlier models required. Early testers consistently report that Opus 4.5 "just gets it" when handed open-ended technical tasks.
Claude Code now ships as a desktop application alongside the existing CLI and web interfaces. The release adds Microsoft Office integrations for PowerPoint, Excel, and Word, plus expanded Chrome extension support. Conversation limits have increased, and the system supports longer-running agentic workflows.

Perhaps the most striking claim: Opus 4.5 is the first model to outperform human candidates on Anthropic's technical take-home exam. The assessment tests technical ability and judgment under time pressure - areas where the model now exceeds the strongest human applicants.
This result raises concrete questions about how AI reshapes engineering as a profession. Anthropic acknowledges their exam doesn't measure collaboration, communication, or the instincts developed over years of experience. But on core technical skills, the machine has crossed the threshold.
In a demo building a glassmorphism-themed SaaS landing page with Next.js, Opus 4.5 completed the task in approximately five minutes with minimal instruction. The model handled design decisions, component structure, and styling autonomously. Image understanding capabilities suggest it can interpret Figma screenshots and other visual references to match specific design requirements.

The shift is clear: less time prompting, more time reviewing. Opus 4.5 operates as a system you delegate to rather than direct step-by-step.
Technical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
Anthropic's AI. Opus 4.6 for hard problems, Sonnet 4.6 for speed, Haiku 4.5 for cost. 200K context window. Best coding m...
View ToolAnthropic's agentic coding CLI. Runs in your terminal, edits files autonomously, spawns sub-agents, and maintains memory...
View Tool
New tutorials, open-source projects, and deep dives on coding agents - delivered weekly.
Anthropic's Python SDK for building production agent systems. Tool use, guardrails, agent handoffs, and orchestration. R...
Configure Claude Code for maximum productivity -- CLAUDE.md, sub-agents, MCP servers, and autonomous workflows.
AI AgentsInstall Claude Code, configure your first project, and start shipping code with AI in under 5 minutes.
Getting StartedInstall Ollama, pull your first model, and run AI locally for coding, chat, and automation - with zero cloud dependency.
Getting Started
In this video, we dive into Anthrop's latest release, Claude Opus 4.5, touted as the best model for coding agents and computer use. We review the blog post and significant announcements, such...

Exploring Claude Opus 4.6: Features, Benchmarks, Anthropic's Latest Frontier Model In this video, I delve into the details of Claude Opus 4.6, highlighting key features and performance benchmarks. Th

In this video, we dive into Anthropic's newly launched Cowork, a user-friendly extension of Claude Code designed to streamline work for both developers and non-developers. This discussion includes an

Million-token context, agent teams that coordinate without an orchestrator, and benchmark scores that push the frontier....

Anthropic's Sonnet 4.6 narrows the gap to Opus on agentic tasks, leads computer use benchmarks, and ships with a beta mi...

Two platforms, two philosophies. Here is how Anthropic and OpenAI compare on APIs, SDKs, documentation, pricing, and the...