MCP Tool Search - Claude Code
Deferred tool loading reduces context overhead for large MCP suites.
MCP tool search solves the "my MCP server has 80 tools" problem. Tools are loaded on demand instead of all at once.
What it does
When a server has tool search enabled, Claude Code sees a searchable index rather than every tool loaded into context. The model queries the index when it needs a tool, loads the matching tool's schema, and calls it. You keep the expressive power of large tool suites without paying tokens for every tool, every turn.
When to use it
- Any MCP server exposing more than a handful of tools.
- Multi-server setups where combined tool count bloats context.
- Cost-sensitive workflows where tool schemas were eating the budget.
- Large internal platforms with hundreds of operations.
Gotchas
- Tool search adds a small latency for the first call to each tool.
- Search queries affect tool selection quality. Servers should provide good descriptions.
- Not every MCP server supports deferred loading yet - check the server docs.
Official docs: https://code.claude.com/docs/en/mcp.md#scale-with-mcp-tool-search
Technical content at the intersection of AI and development. Building with AI agents, Claude Code, and modern dev tools - then showing you exactly how it works.
Get the weekly deep dive
Tutorials on Claude Code, AI agents, and dev tools - delivered free every week.
Was this helpful?
Related Guides
Related Tools
Claude Code
Anthropic's agentic coding CLI. Runs in your terminal, edits files autonomously, spawns sub-agents, and maintains memory...
View ToolCodeburn
Interactive TUI dashboard that shows exactly where your Claude Code and Cursor tokens are going, in real time.
View Toolv0
Vercel's generative UI tool. Describe a component, get production-ready React code with shadcn/ui and Tailwind. Iterate...
View ToolZed
High-performance code editor built in Rust with native AI integration. Sub-millisecond input latency. Built-in assistant...
View ToolRelated Videos

Composio: Connect OpenClaw & Claude Code to 1,000+ Apps via CLI
Composio: Connect AI Agents to 1,000+ Apps via CLI (Gmail, Google Docs/Sheets, Hacker News Workflows) Check out Composio here: http://dashboard.composio.dev/?utm_source=Youtube&utm_channel=0426&utm_...

Claude Code Channels in 8 Minutes
Anthropic has released Channels for Claude Code, enabling external events (CI alerts, production errors, PR comments, Discord/Telegram messages, webhooks, cron jobs, logs, and monitoring signals) to b...

Claude Code Loops in 7 Minutes
Claude Code “Loop” Scheduling: Recurring AI Tasks in Your Session The script explains Claude Code’s new “Loop” feature (an evolution of the Ralph Wiggins technique) for running recurring prompts that...
Related Posts

Anthropic Sonnet 4.5 in Claude Code
Anthropic's Claude Sonnet 4.5 isn't just another model increment. The company claims they've observed it maintaining foc...

12 Tools in One Night: An Honest Overnight Agent Report
I told an agent to improve the site every 10 minutes and went to sleep. Here is what 12 new repos, 60 PRs, and three goo...

Agent Replays with TraceTrail: Loom for Agent Runs
Agent runs are opaque. TraceTrail turns a Claude Code JSONL into a public share link with a stepped timeline of messages...
