Critical Risk →

synthesize_text

Convert text to speech using the configured TTS engine (Chatterbox Turbo or Kokoro). For Chatterbox Turbo, use paralinguistic tags directly in text for expressive speech: [laugh], [sigh], [cough], [chuckle], [gasp], [groan], [clear throat], [sniff], [shush]. Supports voice cloning via reference a...

Part of the Local Voice MCP server. Enforce policies on this tool with Intercept, the open-source MCP proxy.

@codecraftersllc/local-voice-mcp Destructive Risk 4/5

AI agents may call synthesize_text to permanently remove or destroy resources in Local Voice. Without a policy, an autonomous agent could delete critical data in a loop with no way to undo the damage. Intercept blocks destructive tools by default and requires explicit human approval before enabling them.

Without a policy, an AI agent could call synthesize_text in a loop, permanently destroying resources in Local Voice. There is no undo for destructive operations. Intercept blocks this tool by default and only allows it when a human explicitly approves the action.

Destructive tools permanently remove data. Block by default. Only enable with explicit approval workflows.

io-github-codecraftersllc-local-voice-mcp.yaml
tools:
  synthesize_text:
    rules:
      - action: deny
        reason: "Blocked by default — enable with approval"

See the full Local Voice policy for all 3 tools.

Tool Name synthesize_text
Category Destructive
Risk Level Critical

Agents calling destructive-class tools like synthesize_text have been implicated in these attack patterns. Read the full case and prevention policy for each:

Browse the full MCP Attack Database →

Other tools in the Destructive risk category across the catalogue. The same policy patterns (deny, require_approval) apply to each.

synthesize_text is one of the critical-risk operations in Local Voice. For the full severity-focused view — only the critical-risk tools with their recommended policies — see the breakdown for this server, or browse all critical-risk tools across every MCP server.

What does the synthesize_text tool do? +

Convert text to speech using the configured TTS engine (Chatterbox Turbo or Kokoro). For Chatterbox Turbo, use paralinguistic tags directly in text for expressive speech: [laugh], [sigh], [cough], [chuckle], [gasp], [groan], [clear throat], [sniff], [shush]. Supports voice cloning via reference audio.. It is categorised as a Destructive tool in the Local Voice MCP Server, which means it can permanently delete or destroy data. Block by default and require explicit approval.

How do I enforce a policy on synthesize_text? +

Add a rule in your Intercept YAML policy under the tools section for synthesize_text. You can allow, deny, rate-limit, or validate arguments. Then run Intercept as a proxy in front of the Local Voice MCP server.

What risk level is synthesize_text? +

synthesize_text is a Destructive tool with critical risk. Critical-risk tools should be blocked by default and only enabled with explicit human approval.

Can I rate-limit synthesize_text? +

Yes. Add a rate_limit block to the synthesize_text rule in your Intercept policy. For example, setting max: 10 and window: 60 limits the tool to 10 calls per minute. Rate limits are tracked per agent session and reset automatically.

How do I block synthesize_text completely? +

Set action: deny in the Intercept policy for synthesize_text. The AI agent will receive a policy violation error and cannot call the tool. You can also include a reason field to explain why the tool is blocked.

What MCP server provides synthesize_text? +

synthesize_text is provided by the Local Voice MCP server (@codecraftersllc/local-voice-mcp). Intercept sits as a proxy in front of this server to enforce policies before tool calls reach the server.

Enforce policies on Local Voice

Open source. One binary. Zero dependencies.

npx -y @policylayer/intercept
github.com/policylayer/intercept →
// GET IN TOUCH

Have a question or want to learn more? Send us a message.

Message sent.

We'll get back to you soon.