// MCP TOKEN COST

Your MCP servers spend tokens before the agent starts.

Every connected server's tool definitions — names, descriptions, JSON schemas — are loaded into the model's context on every request. GitHub + Linear + Supabase together: 24,116 tokens, 12% of a 200k window, before the first message.

MEASURED CORPUS 5,253 servers · 111,074 tool schemas · median 2,145 tokens · max 147,411 Method →

// CALCULATOR

Measure your stack.

Pick servers from the catalogue or paste your mcpServers config. Untick the tools you don't use to see what a scoped grant saves. Want the risk view instead? Check your stack →

Nothing selected yet. Add servers above — the totals update live.

// BY SERVER

Measured server by server.

Every measured server has its own breakdown — headline cost, per-tool table, and what a scoped grant saves. The popular names and the heaviest of the catalogue:

UnClick 1658 tools · 147,411 tokens SmartBear MCP 240 tools · 137,725 tokens Ainumbers Mcp Apps 358 tools · 124,023 tokens Gapup Mcp 271 tools · 102,080 tokens Mcp Knowledge 271 tools · 102,080 tokens VaultPilot MCP 189 tools · 82,778 tokens Google Super 200 tools · 77,058 tokens GPT 5 5 x402 Low Cost Agent Tools 212 tools · 72,665 tokens Civilquants 56 tools · 70,581 tokens Contaí — Calculadoras Fiscais Brasileiras 139 tools · 68,378 tokens Yaver 780 tools · 65,739 tokens Leaper Vision Toolkit 169 tools · 65,642 tokens AdButler 622 tools · 65,582 tokens Trello 200 tools · 63,945 tokens Pulsenetwork 74 tools · 61,430 tokens Alibabacloud Dataworks 186 tools · 54,307 tokens GoCreative Agent API 546 tools · 52,033 tokens Dialogbrain 220 tools · 51,410 tokens Intervals Icu Api 146 tools · 48,993 tokens Stable Baseline 196 tools · 46,634 tokens GitHub 86 tools · 14,406 tokens Linear 66 tools · 7,149 tokens Supabase 29 tools · 2,561 tokens Filesystem 14 tools · 1,666 tokens

5,253 servers measured in total — search the full set in the calculator above, or browse the tool catalogue.

// WHY THIS HAPPENS

Tool definitions are context overhead, paid on every request.

Every schema rides along

An MCP client sends each connected server's tools/list — name, description, full JSON input schema per tool — to the model so it knows what it can call. Connecting a server means paying for all of it.

It compounds across servers

Stacks accumulate: a few popular servers reach tens of thousands of tokens of definitions. That space competes directly with your code, documents and conversation history.

Most of it is never used

Agents typically call a handful of tools per session, but pay for every definition on every request. The fix is structural: expose only the tools you actually grant.

// DEFERRED LOADING

Some clients now defer loading. It reduces the bill — it doesn't remove it.

Deferral trades upfront cost for per-use cost: tool searches and schema reloads are themselves charged against the window. And several major clients don't defer at all.

Client	Tool definition loading
Claude Code	Defers MCP schemas behind tool search by default — you pay per search and per reload
VS Code (Copilot)	Experimental tool grouping — partial deferral, off by default
Cursor	Loads all tool definitions upfront, every request
Windsurf	Loads all tool definitions upfront, every request
Gemini CLI	Loads all tool definitions upfront, every request

Client behaviour verified 04-06-2026. A scoped grant cuts the cost either way: fewer definitions to load upfront, fewer to search and reload when deferred.

// THE CORPUS

The median server is cheap. The ones you actually run aren't.

Across 5,253 servers with complete schema coverage in the PolicyLayer scan database, definition cost is heavily long-tailed.

2,145

MEDIAN TOKENS / SERVER

11,440

P90

27,687

P99

147,411

HEAVIEST — 74% OF A 200K WINDOW

The heaviest measured server (UnClick) consumes 74% of a 200k context window with tool definitions alone. Popular productivity servers cluster well above the median — GitHub + Linear + Supabase average 8,039 tokens each.

//FAQ

MCP token-cost questions.

How much of my context window do MCP servers use?+

It depends entirely on the servers. Across 5,253 measured servers the median is 2,145 tokens of tool definitions, but the heavy hitters dominate: GitHub + Linear + Supabase together consume 24,116 tokens — 12% of a 200k window — before the first message.

Why do MCP tool definitions consume tokens?+

MCP clients send every connected server's tool definitions — name, description, and JSON input schema — to the model so it knows what it can call. That payload counts against the context window on every request, whether or not any tool is used.

Does deferred tool loading (Claude Code tool search) solve this?+

It helps in clients that support it, but it is not free: each tool search and schema reload costs tokens, and definitions still enter context once loaded. Cursor, Windsurf and Gemini CLI load all definitions upfront. Cutting the exposed tool set reduces cost in every client.

How do I reduce MCP token usage?+

Expose fewer tools. Routing servers through a PolicyLayer grant means only the tools you explicitly allow are visible to the client — ungranted definitions never enter the context window, and every call that does run is policy-checked.

How are these token counts measured?+

Each tool is serialised the way a tools/list response carries it ({name, description, inputSchema}) using schemas from the PolicyLayer scan database, then counted with tiktoken o200k_base. Clients vary slightly in serialisation, so treat counts as close estimates rather than exact invoices.

// METHOD & REVIEW

How these numbers were measured.

Serialisation

Each tool is serialised as a tools/list entry — name, description, input schema — from the schemas in the PolicyLayer scan database. Clients differ slightly in framing, so treat counts as close estimates.

Tokeniser

tiktoken o200k_base (GPT-4o/o-series). Anthropic's current tokeniser isn't published, so Claude's exact counts will differ; for English text and JSON schemas the totals are close enough to treat these as estimates.

Coverage

Only servers with (near-)complete schema coverage are measured — 5,253 of the catalogue. Partial coverage is disclosed per page rather than estimated away.

Computed 15-07-2026 from the PolicyLayer scan database. Counts refresh with every site build. Sources: the MCP specification (tools/list), tiktoken, and our State of MCP research.

Pay for the tools you use — not every tool on every server.

A PolicyLayer grant exposes only the tools you allow. Ungranted definitions never enter your context window, and every call that does run is checked against policy first.

CUT YOUR TOKEN COST →

Instant setup, no code required.

46,500+ MCP servers and 515,000+ tools scanned and risk-classified.