// GLOSSARY -- MCP & TOOL INFRASTRUCTURE

What is MCP Token Cost?

2 min read Updated Jun 11, 2026

MCP token cost is the context-window overhead incurred by connecting MCP servers: every connected server's tool definitions — names, descriptions, and input schemas — are injected into the model's prompt, consuming tokens on every request whether or not the tools are used.

WHY IT MATTERS

For the model to know what it can call, the client serialises each server's tools/list output into the prompt. A single verbose tool — long description, large JSON schema — can cost hundreds of tokens; a server exposing dozens of tools can cost thousands; a typical multi-server setup can consume a five-figure token count before the user types anything. That overhead recurs on every request in the session.

The costs compound in three ways. Money: those tokens are billed as input on every call. Latency: larger prompts take longer to process. Quality: tool definitions compete with actual work for context window space, and models pick tools less reliably as the catalogue grows — the tool sprawl problem measured in tokens.

Mitigation options, roughly in order of effort:

Connect fewer servers — disconnect servers a project does not use; the cheapest token is one never injected.
Tool filtering — expose only the subset of a server's tools you actually call, where the client or an intermediary supports it.
Virtual servers — aggregate many upstreams behind one curated facade exposing a minimal tool set (see MCP virtual server).
Measurement — count the serialised definition size per server so trimming decisions are data-driven rather than guesswork.

HOW POLICYLAYER USES THIS

PolicyLayer publishes per-server token-cost data at policylayer.com/token-cost, measured from each scanned server's actual tool definitions, so you can see what a server costs your context window before connecting it. The gateway's virtual server support lets teams expose a filtered tool subset from registered upstreams, cutting injected definitions to the tools policy actually permits.

Look up any server’s token cost →

IN THE CATALOGUE

Measured across 5,494 MCP servers (117,877 tools): connecting a server loads its full tool definitions into the context window on every request.

2,149 tokens — median server

11,752 tokens — 90th percentile

183,480 tokens — largest measured (Ainumbers Mcp Apps)

Server	Tool definitions	Tokens per request
GitHub	86	14,406
Linear	66	7,149
Supabase	29	2,561
Filesystem	14	1,666

Look up any server’s token cost →

FREQUENTLY ASKED QUESTIONS

Do MCP tools cost tokens even when not used?

Yes. Tool definitions are injected into the prompt so the model knows they exist, and that happens on every request regardless of whether any tool is called.

How many tokens does a typical MCP server cost?

It varies enormously — from under a hundred tokens for a small, tersely documented server to several thousand for servers with many tools and large schemas. Measuring the serialised tools/list output is the only reliable answer.

What is the most effective way to reduce MCP token cost?

Reduce what gets injected: disconnect unused servers, filter to the tools you call, or front multiple upstreams with a virtual server exposing a minimal curated set.

What is MCP Token Cost?

WHY IT MATTERS

HOW POLICYLAYER USES THIS

IN THE CATALOGUE

FREQUENTLY ASKED QUESTIONS

FURTHER READING

Take your agents live. Without losing control.

What is MCP Token Cost?

WHY IT MATTERS

HOW POLICYLAYER USES THIS

IN THE CATALOGUE

FREQUENTLY ASKED QUESTIONS

RELATED TERMS

FURTHER READING

Take your agents live. Without losing control.