// GLOSSARY -- POLICY ENFORCEMENT

What is an Agent Sandbox?

2 min read Updated Mar 8, 2026

An agent sandbox is a controlled environment that constrains which tools an AI agent can access and how it can use them — preventing the agent from affecting systems outside its authorised scope.

WHY IT MATTERS

You would not deploy untested code to production without constraints. You should not deploy untested agents with unrestricted tool access. Sandboxes provide boundaries where agents can operate without the ability to cause unintended harm.

Agent sandboxing via policy enforcement is more flexible than traditional isolation. Rather than running agents in isolated containers (which limits their usefulness), policy-based sandboxing lets agents connect to real systems but restricts what they can do with those systems.

For example, a sandboxed coding agent might have read access to the entire codebase but write access only to a feature branch directory, with shell execution limited to test commands. The agent interacts with real systems but within tightly controlled boundaries.

HOW POLICYLAYER USES THIS

PolicyLayer creates tool-level sandboxes through YAML policies. Define which tools the agent can access, constrain arguments to safe values, and enforce rate limits — all without modifying the agent or the MCP server. A restrictive policy effectively sandboxes the agent: it can only do what the policy permits, nothing more. Fail-closed by default means any tool not explicitly allowed is denied.

Read the policy-writing guide →

FREQUENTLY ASKED QUESTIONS

How does policy-based sandboxing differ from container isolation?

Container isolation restricts the agent's environment (filesystem, network). Policy-based sandboxing via PolicyLayer restricts the agent's tool access — which is more granular and preserves the agent's ability to interact with real systems within defined boundaries.

Can I use PolicyLayer to create a 'dry run' sandbox?

Yes. You can configure PolicyLayer to log tool calls without forwarding them to the server — letting you observe what the agent would do without any real-world effects. This is useful for testing new agents before deploying them.

When is an agent ready to leave the sandbox?

When it consistently respects policy boundaries, handles denied tool calls gracefully, and has been tested against adversarial scenarios (prompt injection, unexpected server responses). Progressively relax the policy as confidence grows.

What is an Agent Sandbox?

WHY IT MATTERS

HOW POLICYLAYER USES THIS

FREQUENTLY ASKED QUESTIONS

FURTHER READING

Take your agents live. Without losing control.

What is an Agent Sandbox?

WHY IT MATTERS

HOW POLICYLAYER USES THIS

FREQUENTLY ASKED QUESTIONS

RELATED TERMS

FURTHER READING

Take your agents live. Without losing control.