// GLOSSARY -- AI AGENT SECURITY

What is a Compositional Fragment Trap?

1 min read Updated Apr 5, 2026

A systemic trap that partitions a malicious payload into semantically benign fragments distributed across multiple agents, which only reconstitute into a full attack when the fragments are aggregated through multi-agent collaboration.

WHY IT MATTERS

Each fragment passes safety checks individually — 'retrieve this data,' 'format this output,' 'send this message.' None is malicious alone. But combined in sequence across agents, they form an attack: retrieve sensitive data, format it for exfiltration, send it to an external endpoint.

This exploits the gap between per-agent safety checks and system-level security. No individual agent violates its constraints, but the emergent multi-agent workflow does.

HOW POLICYLAYER USES THIS

PolicyLayer's per-agent scoping limits what each agent can do independently. Combined with category restrictions (blocking exfiltration-pattern tool calls), it makes fragment assembly harder even across collaborating agents.

See the MCP Security reference →

FREQUENTLY ASKED QUESTIONS

How do you detect this?

It requires system-level analysis of multi-agent workflows, not just per-agent monitoring. Cross-agent audit trails that track data flow across agent boundaries can reveal compositional attacks.

What is a Compositional Fragment Trap?

WHY IT MATTERS

HOW POLICYLAYER USES THIS

FREQUENTLY ASKED QUESTIONS

FURTHER READING

Take your agents live. Without losing control.

What is a Compositional Fragment Trap?

WHY IT MATTERS

HOW POLICYLAYER USES THIS

FREQUENTLY ASKED QUESTIONS

RELATED TERMS

FURTHER READING

Take your agents live. Without losing control.