// MCP TOKEN COST

The GoldenMatch MCP server costs 3,644 tokens before the first call.

Connect GoldenMatch and its 42 tool definitions are loaded into the model's context on every request — 1.8% of a 200k window spent before your agent does anything.

QUICK ANSWER The GoldenMatch MCP server's tool definitions consume 3,644 tokens — around the median MCP server (1,905 tokens). A scoped grant exposing only the tools you use cuts that roughly in proportion.

MEASURED FROM SCHEMAS 42 tools · 3,644 tokens · 1.8% of 200k · 0.4% of 1M Method →

// CONTEXT WINDOW SHARE

What that buys before your agent starts working.

Tool definitions are overhead: they occupy context on every request and compete with your code, documents and conversation history for the same window.

200K WINDOW 1.8%

1M WINDOW 0.4%

Corpus context: GoldenMatch ranks #1198 of 3,213 measured MCP servers by definition cost. The median is 1,905 tokens, p90 is 7,952, and the heaviest (Fusionauth) is 183,337 — 92% of a 200k window on its own.

// PER-TOOL BREAKDOWN

Where the 3,644 tokens go.

Each row is one tool definition as a tools/list entry — name, description and input schema — counted with o200k_base. Average: 87 tokens per tool.

Tool	Category	Tokens	% of server
create_domain	Write	304	8.3%
add_correction	Write	178	4.9%
pprl_link	Execute	177	4.9%
fix_quality	Execute	165	4.5%
match_record	Read	152	4.2%
suggest_config	Read	149	4.1%
run_transforms	Execute	117	3.2%
learn_thresholds	Read	113	3.1%
pprl_auto_config	Read	111	3.0%
scan_quality	Read	102	2.8%
list_corrections	Read	101	2.8%
find_duplicates	Read	97	2.7%
auto_configure	Write	94	2.6%
agent_approve_reject	Write	93	2.6%
identity_resolve	Write	91	2.5%
test_domain	Read	88	2.4%
identity_merge	Write	87	2.4%
identity_split	Read	84	2.3%
list_clusters	Read	83	2.3%
export_results	Write	78	2.1%
unmerge_record	Destructive	76	2.1%
memory_export	Write	75	2.1%
explain_match	Read	70	1.9%
agent_explain_pair	Read	67	1.8%
identity_list	Read	67	1.8%
controller_telemetry	Read	66	1.8%
memory_stats	Read	66	1.8%
shatter_cluster	Read	62	1.7%
agent_match_sources	Read	58	1.6%
identity_history	Read	57	1.6%
get_cluster	Read	54	1.5%
agent_deduplicate	Execute	50	1.4%
get_golden_record	Read	50	1.4%
agent_compare_strategies	Read	49	1.3%
agent_explain_cluster	Read	42	1.2%
analyze_data	Read	42	1.2%
identity_conflicts	Read	42	1.2%
suggest_pprl	Read	42	1.2%
agent_review_queue	Read	38	1.0%
profile_data	Read	37	1.0%
get_stats	Read	36	1.0%
list_domains	Read	34	0.9%

// SCOPED GRANT

Most agents use a handful of these tools. They pay for all 42.

A PolicyLayer grant exposes only the tools you allow — ungranted definitions are filtered out of the tool list, so they never enter the context window. Estimates below assume typical-weight tools (87 tokens each).

Grant scope	Definition cost	Reduction
All 42 tools (no gateway)	3,644 tokens	—
3 granted tools	~260 tokens	−93%
5 granted tools	~434 tokens	−88%
10 granted tools	~868 tokens	−76%

Model your own stack in the token-cost calculator, or see the GoldenMatch policy for what a sensible grant looks like.

//FAQ

GoldenMatch token-cost questions.

How many tokens does the GoldenMatch MCP server use?+

Its 42 tool definitions total 3,644 tokens — 1.8% of a 200k context window — measured with tiktoken o200k_base over the serialised tools/list payload. Exact counts vary slightly by client and model.

Why does GoldenMatch consume tokens before I send a message?+

MCP clients load every connected server's tool definitions — name, description, and input schema — into the model's context so it knows what it can call. That payload is charged against your context window on every request, whether or not a tool is used.

How do I reduce GoldenMatch's token usage?+

Expose fewer tools. A PolicyLayer grant scopes GoldenMatch to only the tools you allow — ungranted definitions are filtered out of the tool list, so they never enter the context window. A grant of 3 typical tools costs roughly 260 tokens, a 93% reduction.

Does deferred tool loading fix this?+

Partially, in some clients. Claude Code defers MCP tool schemas behind a tool-search step by default, and VS Code has experimental grouping — but you still pay tokens per search and reload, and Cursor, Windsurf and Gemini CLI load definitions upfront. Reducing the exposed tool set cuts the cost in every client.

// METHOD & REVIEW

How these numbers were measured.

Serialisation

Each tool is serialised as a tools/list entry — name, description, input schema — from the schemas in the PolicyLayer scan database. Clients differ slightly in framing, so treat counts as close estimates.

Tokeniser

tiktoken o200k_base (GPT-4o/o-series). Anthropic's current tokeniser isn't published, so Claude's exact counts will differ; for English text and JSON schemas the totals are close enough to treat these as estimates.

Deferred loading

Some clients now defer schema loading (Claude Code's tool search; VS Code experimental grouping). You still pay per search and reload — and Cursor, Windsurf and Gemini CLI load everything upfront.

Computed 07-06-2026 from the PolicyLayer scan database over all 42 catalogued GoldenMatch tools. Counts refresh with every site build.

Expose only the tools you use — the rest never enter your context.

A PolicyLayer grant scopes GoldenMatch to the tools you actually allow. Ungranted definitions never load, and every call that does run is checked against policy first.

SCOPE YOUR GRANT →

Free to start. No card required.

4,600+ MCP servers and 31,000+ tools scanned and risk-classified.