ugc_moderation_classifier

SERVERGapup Mcp SOURCEhttps://mcp.gapup.io/mcp

High RISK CLASS

Category Execute

Parameters 51 required

Recommended Rate-limitedsee the rule below

Registry record Grade F, identity unverified Pull the record →

This record as markdown: /tools/io-github-getgapup-gapup-mcp/ugc-moderation-classifier.md

WHAT IT DOES

What ugc_moderation_classifier does on Gapup Mcp

AI agents invoke ugc_moderation_classifier to trigger actions in Gapup Mcp. What it does depends on the arguments the agent supplies, and its effects often reach beyond the immediate call: builds kicked off, notifications sent, workflows started.

Parameter	Type	Required	Description
`lang`	string	—	Language override. If omitted, language is auto-detected.
`async`	boolean	—	If true, returns a job_id immediately (<200ms) instead of waiting for the result. Poll the result with job_result(job_id). Use for slow tools to avoid client ti
`content`	string	Yes	Text content to moderate (comment, review, post, chat message).
`policies`	array	—	Policies to check. Default: all 9 policies.
`content_type`	string	—	Type of content. Affects recommended_action heuristic. Default: comment.

Parameters from the server's own tool schema.

RISK

Why ugc_moderation_classifier is rated High

This tool processes and analyzes input text content to classify it against multiple policy categories. It executes a classification/analysis pipeline on provided content (running detection logic across hate, sexual, violence, self_harm, harassment, scam policies).

From the tool's definition Detects policy violations in text content across 9 policies and 12 languages without external API calls

Risk signalsAccepts raw HTML/template content (content)

Attacks that exploit this kind of access

RECOMMENDED RULE

The rule that runs ugc_moderation_classifier safely

PolicyLayer is an MCP gateway: it sits between your AI agents and Gapup Mcp, and checks every tool call against a rule you set before the call runs. Nothing changes on the server itself. For ugc_moderation_classifier, this is the rule to start with:

ugc_moderation_classifier Rate-limited

ugc_moderation_classifier stays usable, but rate-capped: a runaway agent can't fire it dozens of times a minute. Everything else on the server is denied unless you say otherwise.

View as policy code

policy.json

{
  "version": "1",
  "default": "deny",
  "tools": {
    "ugc_moderation_classifier": {
      "limits": [
        {
          "counter": "ugc_moderation_classifier_rate",
          "window": "minute",
          "max": 10,
          "scope": "grant"
        }
      ]
    }
  }
}

RATE-LIMIT THIS TOOL → Instant setup, no code required.

The button opens the PolicyLayer dashboard: create your workspace, connect Gapup Mcp, apply this rule, and every ugc_moderation_classifier call is checked against it from then on.

FAQ

Questions about ugc_moderation_classifier

What does the ugc_moderation_classifier tool do? +

Multi-language UGC content moderation for marketplaces, social platforms and comment systems. Detects policy violations in text content across 9 policies and 12 languages without external API calls. Policies checked: • hate — hate speech, slurs, dehumanization (50+ terms × 12 languages) • sexual — explicit sexual content, pornography references, nudity solicitation • violence — threats, weapon references, graphic violence • self_harm — suicidal ideation, self-injury, eating disorder promotion • harassment — doxxing, stalking, cyberbullying, blackmail • scam — phishing, investment fraud, romance scam, lottery fraud • spam — bots, keyword stuffing, excessive caps, emoji storms, suspicious URLs • copyright — piracy, leaked content, serial keys, streaming fraud • minor_safety — grooming signals, CSAM references, minor + adult content combos Languages: en / fr / de / es / it / pt / nl / zh / ja / ko / ar / ru (auto-detected) Output includes severity (low/medium/high/severe), confidence (0-100), matched patterns, excerpt, recommended action, age appropriateness (adult/teen/child), and signals. No API key required. Stateless — no content is stored or logged. It is categorised as a Execute tool in the Gapup Mcp MCP Server, which means it can trigger actions or run processes. Use rate limits and argument validation.

What parameters does ugc_moderation_classifier accept? +

ugc_moderation_classifier accepts 5 parameters: lang, async, content, policies, content_type. Required: content. The full parameter table on this page comes from the server's own tool schema.

How do I enforce a policy on ugc_moderation_classifier? +

Register the Gapup MCP server in PolicyLayer and add a rule for ugc_moderation_classifier: allow, deny, rate-limit, or require approval. Point your MCP client at the PolicyLayer proxy URL and the rule is enforced on every call, before it reaches Gapup Mcp. Nothing to install.

What risk level is ugc_moderation_classifier? +

ugc_moderation_classifier is a Execute tool with high risk. Execute tools should be rate-limited and have argument validation enabled.

Can I rate-limit ugc_moderation_classifier? +

Yes. Add a rate_limit block to the ugc_moderation_classifier rule in your PolicyLayer policy. For example, setting max: 10 and window: 60 limits the tool to 10 calls per minute. Rate limits are tracked per agent session and reset automatically.

How do I block ugc_moderation_classifier completely? +

Set action: deny in the PolicyLayer policy for ugc_moderation_classifier. The AI agent will receive a policy violation error and cannot call the tool. You can also include a reason field to explain why the tool is blocked.

What MCP server provides ugc_moderation_classifier? +

ugc_moderation_classifier is provided by the Gapup MCP server (https://mcp.gapup.io/mcp). PolicyLayer sits as a proxy in front of this server to enforce policies before tool calls reach the server.

KEEP EXPLORING

More on Gapup, and thousands of servers like it.

This server

All 271 Gapup tools → High-risk tools on this server →

Across the catalogue

All high-risk MCP tools → The MCP Attack Database →

Guides

Govern code and CI agents →Agent sandbox →Least privilege for MCP →