run_dingo_evaluation

THE RISK

High Risk

Why run_dingo_evaluation needs a policy

This tool executes evaluation logic on user-provided files with outcomes that depend on the evaluation type and input data. While primarily analytical rather than destructive, it represents code/logic execution triggered by an AI agent's arguments. The ability to run LLM-based evaluations and generate output files places it in the Execute category rather than Read (which would be passive data retrieval).

From the tool's definition Tool description states it 'Runs a Dingo evaluation' on files, which triggers external operations. The evaluation can be 'rule-based or LLM-based', indicating execution of arbitrary evaluation logic.

Documented attack patterns abuse exactly the kind of access run_dingo_evaluation gives an agent:

POLICY

How to control run_dingo_evaluation

PolicyLayer is an MCP gateway — it sits between your AI agents and Dingo MCP Server, and nothing reaches the server without passing your rules. This is the rule we recommend for run_dingo_evaluation:

policy.json

{
  "version": "1",
  "default": "deny",
  "tools": {
    "run_dingo_evaluation": {
      "limits": [
        {
          "counter": "run_dingo_evaluation_rate",
          "window": "minute",
          "max": 10,
          "scope": "grant"
        }
      ]
    }
  }
}

run_dingo_evaluation stays usable, but rate-capped — a runaway agent can't fire it dozens of times a minute. Everything else on the server is denied unless you say otherwise.

Create a free account and register Dingo MCP Server — nothing to install.
Add this policy — paste it, or build it visually.
Point your MCP client (Claude, Cursor, anything) at your gateway URL.

RATE-LIMIT THIS TOOL →

Free to start. No card required.

EXPLORE

Related tools and policies

More Dingo MCP Server tools

Execute run_quick_evaluation Run a simplified Dingo evaluation based on a high-level goal. Automatically infers th Read get_llm_details Get detailed information about a specific Dingo LLM. Retrieves information including Read get_prompt_details Get detailed information about an LLM's embedded prompt. In the new architecture, pro Read get_rule_details Get detailed information about a specific Dingo rule. Retrieves information including Read list_dingo_components Lists available Dingo rule groups, registered LLM model identifiers, and prompt definition

All 6 Dingo MCP Server tools →

Execute tools on other servers

Android Forensics ADB MCP Server adb_shell_command EMBA-MCP run_emba_scan Frida Game Hacking MCP remove_breakpoint Procmon request_elevation

Go deeper

High-risk tools in Dingo MCP Server → Every high-risk tool this server exposes, in one view.
All high-risk MCP tools → Every high-risk tool across the catalogue, and the policy pattern that contains them.
The MCP Attack Database → Documented attack patterns against MCP deployments — and the policies that stop them.
Agent sandbox → Bounding what an agent can execute and where the boundary should sit.
Least privilege for MCP → Granting agents the minimum tool surface the task needs.
Rate Limiting MCP Tool Calls: A Practical Guide →
MCP Security: Why Prompt Guardrails Aren't Enough →

FAQ

Questions about run_dingo_evaluation

What does the run_dingo_evaluation tool do? +

Runs a Dingo evaluation (rule-based or LLM-based) on a file. Infers data_format from input_path extension (.json, .jsonl, .txt) if not provided in kwargs. Defaults dataset to 'local' if input_path is provided and dataset is not in kwargs. If output_dir is not specified via kwargs or environment variables, creates output relative to input_path. API keys for LLMs should be set via environment variables in mcp.json or system environment. Args: input_path: Path to the input file or directory. evaluation_type: Type of evaluation ('rule' or 'llm'), defaults to 'rule'. eval_group_name: The specific rule group or LLM model name. Defaults to empty, Dingo will use 'default' for rules or infer from custom_config for LLMs. (Optional when custom_config is provided for LLM evaluations via kwargs) kwargs: Dictionary containing additional arguments compatible with dingo.io.InputArgs. Use for: output_dir, task_name, save_data, save_correct, dataset, data_format, column_content, column_id, column_prompt, column_image, custom_config, max_workers, batch_size, etc. Returns: For Smithery deployment: The content of the result file (summary.json or first .jsonl) Otherwise: The absolute path to the primary output file (summary.json or first .jsonl). It is categorised as a Execute tool in the Dingo MCP Server MCP Server, which means it can trigger actions or run processes. Use rate limits and argument validation.

How do I enforce a policy on run_dingo_evaluation? +

Register the Dingo MCP Server MCP server in PolicyLayer and add a rule for run_dingo_evaluation: allow, deny, rate-limit, or require approval. Point your MCP client at the PolicyLayer proxy URL and the rule is enforced on every call, before it reaches Dingo MCP Server. Nothing to install.

What risk level is run_dingo_evaluation? +

run_dingo_evaluation is a Execute tool with high risk. Execute tools should be rate-limited and have argument validation enabled.

Can I rate-limit run_dingo_evaluation? +

Yes. Add a rate_limit block to the run_dingo_evaluation rule in your PolicyLayer policy. For example, setting max: 10 and window: 60 limits the tool to 10 calls per minute. Rate limits are tracked per agent session and reset automatically.

How do I block run_dingo_evaluation completely? +

Set action: deny in the PolicyLayer policy for run_dingo_evaluation. The AI agent will receive a policy violation error and cannot call the tool. You can also include a reason field to explain why the tool is blocked.

What MCP server provides run_dingo_evaluation? +

run_dingo_evaluation is provided by the Dingo MCP Server MCP server (migoxlab/dingo). PolicyLayer sits as a proxy in front of this server to enforce policies before tool calls reach the server.

What run_dingo_evaluation does on Dingo MCP Server