Medium Risk

flagpost_add_flag

Adds a new feature flag definition to an existing flagpost.config.ts file.

How to control flagpost_add_flag ↓

What flagpost_add_flag does on Clocktower

AI agents use flagpost_add_flag to create or update resources in Clocktower — usually the action step of a workflow, after the agent has gathered context. Every call changes real data in your Clocktower environment.

Medium Risk

Why flagpost_add_flag needs a policy

This tool creates or modifies configuration data (feature flag definitions) in a reversible manner. It does not delete data (Destructive), execute arbitrary operations (Execute), or involve financial transactions (Financial). The severity is medium because misconfigured feature flags could affect application behavior and user experience, but changes are reversible via updates or deletion of the flag definition.

From the tool's definition Tool description states it 'Adds a new feature flag definition' to a configuration file. The verb 'Adds' indicates creation of new data within a file.

Documented attack patterns abuse exactly the kind of access flagpost_add_flag gives an agent:

How to control flagpost_add_flag

PolicyLayer is an MCP gateway — it sits between your AI agents and Clocktower, and nothing reaches the server without passing your rules. This is the rule we recommend for flagpost_add_flag:

policy.json
{
  "version": "1",
  "default": "deny",
  "tools": {
    "flagpost_add_flag": {
      "limits": [
        {
          "counter": "flagpost_add_flag_rate",
          "window": "minute",
          "max": 30,
          "scope": "grant"
        }
      ]
    }
  }
}

flagpost_add_flag stays usable, but capped — an agent stuck in a loop can't make hundreds of changes a minute. Everything else on the server is denied unless you say otherwise.

  1. Create a free account and register Clocktower — nothing to install.
  2. Add this policy — paste it, or build it visually.
  3. Point your MCP client (Claude, Cursor, anything) at your gateway URL.
LIMIT THIS TOOL →

Free to start. No card required.

Related tools and policies

Go deeper

Questions about flagpost_add_flag

What does the flagpost_add_flag tool do? +

Adds a new feature flag definition to an existing flagpost.config.ts file. It is categorised as a Write tool in the Clocktower MCP Server, which means it can create or modify data. Consider rate limits to prevent runaway writes.

How do I enforce a policy on flagpost_add_flag? +

Register the Clocktower MCP server in PolicyLayer and add a rule for flagpost_add_flag: allow, deny, rate-limit, or require approval. Point your MCP client at the PolicyLayer proxy URL and the rule is enforced on every call, before it reaches Clocktower. Nothing to install.

What risk level is flagpost_add_flag? +

flagpost_add_flag is a Write tool with medium risk. Write tools should be rate-limited to prevent accidental bulk modifications.

Can I rate-limit flagpost_add_flag? +

Yes. Add a rate_limit block to the flagpost_add_flag rule in your PolicyLayer policy. For example, setting max: 10 and window: 60 limits the tool to 10 calls per minute. Rate limits are tracked per agent session and reset automatically.

How do I block flagpost_add_flag completely? +

Set action: deny in the PolicyLayer policy for flagpost_add_flag. The AI agent will receive a policy violation error and cannot call the tool. You can also include a reason field to explain why the tool is blocked.

What MCP server provides flagpost_add_flag? +

flagpost_add_flag is provided by the Clocktower MCP server (sathergate/sathergate-toolkit). PolicyLayer sits as a proxy in front of this server to enforce policies before tool calls reach the server.

Enforce policy on every Clocktower tool call.

Start from Clocktower, add the rest of your stack, and see everything your agents can call. Then put policy on all of it.

Free to start. No card required.

26 Clocktower tools catalogued and risk-classified — across an index of 43,000+ MCP servers.

// GET IN TOUCH

Have a question or want to learn more? Send us a message.

Message sent.

We'll get back to you soon.