// COMPLIANCE

EU AI Act compliance for MCP deployments

Where the AI Act actually lands on AI-agent fleets — which duties are live today, which are conditional on a high-risk use case, and the deployer-controlled surface a gateway gives the assigned human overseer.

QUICK ANSWERThe AI Act regulates AI systems, not protocols — and most agent fleets (coding, DevOps, back-office) are not high-risk. For ordinary non-high-risk deployments the broadly applicable live duty is Art. 4 AI literacy (the Art. 5 prohibited-practice rules are also live where a use case falls into them). Heavy Art. 26 deployer duties bite only for Annex III high-risk uses, now dated 2 December 2027 under the Digital Omnibus. A gateway’s part: the call-level control surface and the deployer-held evidence behind those duties.

POLICYLAYER SCAN DATA 577,564 tools · 53,229 servers · 29,439 destructive · 50,046 execute code Methodology & research →

// WHY MCP TRAFFIC IS IN SCOPE

The Act regulates AI systems. Most agent fleets are not high-risk.

The Act binds providers and deployers of AI systems — your duties attach to your agents and, above all, to your risk tier, so get the tier right first: it decides which duties apply at all. Three things frame where an MCP fleet lands.

01

You are almost certainly a deployer

Building or branding an AI system makes you a provider; using one under your own authority makes you a deployer. Companies wiring agents to MCP servers are deployers. Routing through a gateway will not usually amount to a substantial modification by itself — though high-risk deployments should still assess whether any change alters intended purpose, performance or risk (Art. 25).

02

High-risk is a closed list

High-risk means Annex III — biometrics, critical infrastructure, education, employment, essential services, law enforcement, migration, justice. Coding assistants, DevOps agents and back-office automation are not on it. Only Art. 4 literacy (and Art. 50 transparency where relevant) applies to them.

03

Only the high-risk slice carries Art. 26

If an agent is pointed at an Annex III decision — screening applicants, scoring credit — Art. 26 deployer duties engage, now from 2 December 2027 under the Digital Omnibus (provisional agreement 7 May 2026, pending formal adoption — dates may shift). For that slice, call-level logging and an intervention surface are exactly what the deployer needs.

// OBLIGATION MAPPING

Which obligations touch agent traffic, and how.

For each obligation: the question it raises for an agent fleet, the gap when there is no call-level control, and where the gateway fits. Read the tiers honestly — for non-high-risk fleets the broadly applicable live duty is Art. 4 (with the Art. 5 prohibitions applying where relevant); the Art. 26 family is conditional on a high-risk use case and now dated 2 December 2027.

Art. 4 AI literacy (live now, every deployer)

WHAT IT REQUIRES

Providers and deployers take measures to ensure, to their best extent, a sufficient level of AI literacy among staff operating AI systems. In force since February 2025. The Digital Omnibus would soften the wording to “take measures to support”, but that amendment is not yet formally adopted.

THE QUESTION FOR AGENT FLEETS

Do the staff running MCP-connected agents actually know what those agents can do — the real action surface, not just the read paths?

THE GAP WITHOUT CALL-LEVEL CONTROL

Teams wire agents to server bundles without an inventory of the tools they have just enabled, so literacy measures have nothing concrete to point at.

WHERE THE GATEWAY FITS

The catalogue enumerates every connected tool and its risk class — a concrete artefact that supports the literacy duty. It does not discharge it; training and process remain yours.

For ecosystem context: across the public catalogue, 577,564 tools on 53,229 servers — 130,712 write, 29,439 destructive, 50,046 execute, 10,076 financial. Connecting a server routinely grants far more than read.

Art. 26(2) Human oversight assignment (high-risk, from Dec 2027)

WHAT IT REQUIRES

Deployers assign human oversight of a high-risk system to natural persons with the necessary competence, training and authority, and with support.

THE QUESTION FOR AGENT FLEETS

Does the person you assign to oversee a high-risk agent have an actual surface to oversee — something to see and something to pull?

THE GAP WITHOUT CALL-LEVEL CONTROL

Raw MCP gives the overseer nothing: no live view of what the agent invoked, no lever to stop it short of editing configs.

WHERE THE GATEWAY FITS

The gateway equips the assigned human — live call visibility, deny rules, per-person revocation. It supports oversight designed under Art. 14; it is not the oversight itself.

Art. 26(5) Monitoring & suspension (high-risk, from Dec 2027)

WHAT IT REQUIRES

Deployers monitor operation of the high-risk system against its instructions for use, inform the provider or authorities where risks arise, and suspend use where appropriate.

THE QUESTION FOR AGENT FLEETS

Can you see what a high-risk agent actually invoked — and stop it without ripping out configs across your fleet?

THE GAP WITHOUT CALL-LEVEL CONTROL

There is no deployer-side record of what the agent called, and no suspend lever short of tearing down the connection by hand.

WHERE THE GATEWAY FITS

Every call is evaluated live; disabling a grant or flipping a tool to deny centrally is the practical suspension mechanism the duty asks for.

Art. 26(6) Log retention (high-risk, from Dec 2027)

WHAT IT REQUIRES

Deployers keep logs automatically generated by the high-risk system, to the extent the logs are under their control, for at least six months.

THE QUESTION FOR AGENT FLEETS

When the authority asks for six months of logs under your control, do you hold anything to retain?

THE GAP WITHOUT CALL-LEVEL CONTROL

In a default setup the deployer holds nothing — the agent-to-server traffic is ephemeral and unrecorded.

WHERE THE GATEWAY FITS

The audit log — grant, tool, argument keys, deciding rule, verdict — is a deployer-controlled automatic log, with retention configurable to the duty.

Art. 12 Record-keeping (high-risk; chiefly a provider duty)

WHAT IT REQUIRES

High-risk systems must technically allow automatic recording of events over their lifetime, enabling traceability — including the monitoring required under Art. 26(5). This is primarily a duty on the provider who builds the system.

THE QUESTION FOR AGENT FLEETS

Where high-risk applies, can the events your agent generated be traced — and does the deployer hold a usable call-level record?

THE GAP WITHOUT CALL-LEVEL CONTROL

Without a mediation point there is no call-level record to make traceability or the Art. 26(5) monitoring practical.

WHERE THE GATEWAY FITS

The gateway produces the call-level record that makes Art. 12-style traceability and Art. 26(6) retention practical for the deployer. It never satisfies Art. 12, which sits with the system’s provider.

Art. 14 Human oversight (high-risk; a design requirement on the system)

WHAT IT REQUIRES

High-risk systems are designed so natural persons can effectively oversee them — intervene, interrupt via a stop control, override or disregard output — guarding against automation bias.

THE QUESTION FOR AGENT FLEETS

Can the assigned human actually intervene in a running agent, or only watch after the fact?

THE GAP WITHOUT CALL-LEVEL CONTROL

A purely automated gate is a technical control, not human oversight — and on its own it gives the human no place to step in.

WHERE THE GATEWAY FITS

An automated policy gate is a technical control, not the oversight itself. What the gateway adds is a real intervention surface — a deny rule, a revoked grant, a central stop — that operationalises the human’s oversight.

// EXAMPLE POLICIES

Policies that give the deployer a control surface.

Illustrative policies — not complete compliance controls on their own.

Operating envelope — per the instructions for use Art. 26(1) · Art. 26(5)

Encode the reviewed tool set as the allowed envelope; everything outside it stays denied by default. This keeps the agent operating within its instructions for use — and the deny verdicts are themselves deployer-held records.

policy.json

{
  "version": "1",
  "default": "deny",
  "tools": {
    "list_candidates": {},
    "get_candidate_profile": {},
    "search_records": {}
  }
}

Containment — high-impact tools withheld Art. 26(5) · Art. 14

Routine reads and updates run; updates to finalised records are denied, and destructive or payment tools are simply not granted. The deny verdicts and the withheld grants are both deployer-held records — and the policy is the overseer’s intervention surface.

policy.json

{
  "version": "1",
  "default": "deny",
  "tools": {
    "get_record": {},
    "update_record": {
      "deny_if": [
        {
          "conditions": [
            { "path": "args.status", "op": "eq", "value": "final" }
          ]
        }
      ]
    }
  }
}

See Writing policies for the policy format, operators, and quota shapes.

// AUDIT EVIDENCE

What a market-surveillance authority could ask a deployer for.

Where a high-risk use case applies, an authority can request deployer-held proof. The artefact a gateway deployment hands over for each:

What the auditor asks for	What the gateway exports
Six months of system logs under deployer control (Art. 26(6))	Audit log export — grant, tool, argument keys, deciding rule, verdict — retained to the period.
The oversight assignment and the surface the overseer uses (Art. 26(2))	The dashboard, deny rules and revocation records the assigned human acts through.
Monitoring records and a suspension capability (Art. 26(5))	The live verdict stream plus central disable — the evidence that monitoring and suspension exist.
Operation kept within the instructions for use (Art. 26(1))	The versioned policy encoding the reviewed operating envelope, with its change history.
Tool and system inventory supporting literacy measures (Art. 4)	The catalogue of connected tools with risk classes — the artefact your literacy programme references.

//FAQ

EU AI Act and MCP questions.

Does the EU AI Act apply to MCP or AI agents?+

The Act regulates AI systems, not protocols. MCP is the wire; the agent is the AI system, and you are its deployer. Which duties apply depends on your risk tier: AI literacy (Art. 4) and the Art. 5 prohibited-practice rules are live now, transparency duties (Art. 50) apply from 2 August 2026, and the Art. 26 high-risk deployer duties bite only for Annex III use cases, now dated 2 December 2027 under the Digital Omnibus.

Am I a provider or a deployer under the AI Act?+

If you build, brand or place an AI system on the market you are a provider; if you use one under your own authority you are a deployer — which is most companies running agents. You can become a provider via Art. 25 by rebranding, substantially modifying a high-risk system, or re-purposing a system into a high-risk use. Routing your traffic through a gateway will not usually be a substantial modification by itself, but high-risk deployments should assess whether any change alters intended purpose, performance or risk.

Is my coding or DevOps agent high-risk?+

Almost certainly not. High-risk is the closed Annex III list — biometrics, critical infrastructure, education, employment, essential services, law enforcement, migration, justice. A coding assistant, DevOps agent or back-office bot is not on it. It becomes high-risk only if pointed at an Annex III decision such as screening candidates or scoring credit — and even Annex III systems can escape high-risk under Art. 6(3) where they perform narrow procedural tasks without materially influencing the outcome.

What logs does the AI Act actually require?+

Mandated logging applies only to high-risk systems: automatic event logging by the system (Art. 12, a provider duty) and deployer retention of at least six months (Art. 26(6)). For everything else there is no mandated logging. A call-level audit trail is still the practical evidence base — and the thing you would already hold if your risk tier ever changed.

Does the AI Act apply outside the EU?+

Yes — it is extraterritorial. Art. 2 catches third-country providers and deployers where the output of the AI system is used in the Union. GPAI model-provider duties under Chapter V, by contrast, sit with the model providers — OpenAI, Anthropic, Google — not with deployers and not with PolicyLayer.

// BEFORE & AFTER

Raw MCP versus gateway-mediated MCP.

Default setup	Through the gateway
One shared upstream API key on every laptop	Per-person scoped grant tokens, revocable individually
No record of what agents called	Per-call audit log: grant, tool, argument keys, rule, verdict
Every tool on a server is callable	Deny-by-default — each tool and argument explicitly granted
Access rules scattered across client configs	One central, version-controlled policy

PolicyLayer doesn’t certify your organisation — it gives your compliance team enforceable controls and exportable evidence for the MCP slice of the audit.

// SOURCES & REVIEW

Primary sources.

Last reviewed 04-06-2026 by the PolicyLayer research team. This guide maps how the framework intersects with MCP deployments — it is not legal advice.

EU AI Act compliance for MCP deployments

The Act regulates AI systems. Most agent fleets are not high-risk.

Which obligations touch agent traffic, and how.

Policies that give the deployer a control surface.

What a market-surveillance authority could ask a deployer for.

EU AI Act and MCP questions.

Raw MCP versus gateway-mediated MCP.

Primary sources.

Enforceable controls and audit evidence on every MCP call.