Home / Token cost / Mila

The Mila MCP server costs 3,441 tokens before the first call.

Connect Mila and its 23 tool definitions are loaded into the model's context on every request — 1.7% of a 200k window spent before your agent does anything.

QUICK ANSWER The Mila MCP server's tool definitions consume 3,441 tokens — around the median MCP server (1,905 tokens). A scoped grant exposing only the tools you use cuts that roughly in proportion.

MEASURED FROM SCHEMAS 23 tools · 3,441 tokens · 1.7% of 200k · 0.3% of 1M Method →

What that buys before your agent starts working.

Tool definitions are overhead: they occupy context on every request and compete with your code, documents and conversation history for the same window.

200K WINDOW 1.7%
1M WINDOW 0.3%

Corpus context: Mila ranks #1233 of 3,213 measured MCP servers by definition cost. The median is 1,905 tokens, p90 is 7,952, and the heaviest (Fusionauth) is 183,337 — 92% of a 200k window on its own.

Where the 3,441 tokens go.

Each row is one tool definition as a tools/list entry — name, description and input schema — counted with o200k_base. Average: 150 tokens per tool.

ToolCategoryTokens% of server
update_sheet_tab Write 337 9.8%
create_sheet Write 334 9.7%
append_slides Read 230 6.7%
create_slide_presentation Write 221 6.4%
append_rows Read 210 6.1%
list_documents Read 210 6.1%
list_sheets Read 202 5.9%
list_slides Read 191 5.6%
update_slide_presentation Write 189 5.5%
create_sheet_tab Write 179 5.2%
create_document Write 140 4.1%
update_document Write 110 3.2%
get_sheet_tab Read 100 2.9%
delete_sheet_tab Destructive 97 2.8%
append_to_document Read 93 2.7%
update_sheet Write 90 2.6%
get_sheet Read 83 2.4%
get_document Read 82 2.4%
get_slide_presentation Read 77 2.2%
delete_sheet Destructive 76 2.2%
delete_slide_presentation Destructive 73 2.1%
delete_document Destructive 70 2.0%
list_servers Read 47 1.4%

Most agents use a handful of these tools. They pay for all 23.

A PolicyLayer grant exposes only the tools you allow — ungranted definitions are filtered out of the tool list, so they never enter the context window. Estimates below assume typical-weight tools (150 tokens each).

Grant scopeDefinition costReduction
All 23 tools (no gateway) 3,441 tokens
3 granted tools ~449 tokens −87%
5 granted tools ~748 tokens −78%
10 granted tools ~1,496 tokens −57%

Mila token-cost questions.

How many tokens does the Mila MCP server use?+

Its 23 tool definitions total 3,441 tokens — 1.7% of a 200k context window — measured with tiktoken o200k_base over the serialised tools/list payload. Exact counts vary slightly by client and model.

Why does Mila consume tokens before I send a message?+

MCP clients load every connected server's tool definitions — name, description, and input schema — into the model's context so it knows what it can call. That payload is charged against your context window on every request, whether or not a tool is used.

How do I reduce Mila's token usage?+

Expose fewer tools. A PolicyLayer grant scopes Mila to only the tools you allow — ungranted definitions are filtered out of the tool list, so they never enter the context window. A grant of 3 typical tools costs roughly 449 tokens, a 87% reduction.

Does deferred tool loading fix this?+

Partially, in some clients. Claude Code defers MCP tool schemas behind a tool-search step by default, and VS Code has experimental grouping — but you still pay tokens per search and reload, and Cursor, Windsurf and Gemini CLI load definitions upfront. Reducing the exposed tool set cuts the cost in every client.

How these numbers were measured.

01
Serialisation

Each tool is serialised as a tools/list entry — name, description, input schema — from the schemas in the PolicyLayer scan database. Clients differ slightly in framing, so treat counts as close estimates.

02
Tokeniser

tiktoken o200k_base (GPT-4o/o-series). Anthropic's current tokeniser isn't published, so Claude's exact counts will differ; for English text and JSON schemas the totals are close enough to treat these as estimates.

03
Deferred loading

Some clients now defer schema loading (Claude Code's tool search; VS Code experimental grouping). You still pay per search and reload — and Cursor, Windsurf and Gemini CLI load everything upfront.

Computed 07-06-2026 from the PolicyLayer scan database over all 23 catalogued Mila tools. Counts refresh with every site build.

Expose only the tools you use — the rest never enter your context.

A PolicyLayer grant scopes Mila to the tools you actually allow. Ungranted definitions never load, and every call that does run is checked against policy first.

Free to start. No card required.

4,600+ MCP servers and 31,000+ tools scanned and risk-classified.

// GET IN TOUCH

Have a question or want to learn more? Send us a message.

Message sent.

We'll get back to you soon.