High Risk →

firecrawl_generate_llmstxt

Generate a standardized llms.txt (and optionally llms-full.txt) file for a given domain. This file defines how large language models should interact with the site. Best for: Creating machine-readable permission guidelines for AI models. Not recommended for: General content extraction or research....

How to control firecrawl_generate_llmstxt ↓

What firecrawl_generate_llmstxt does on Firecrawl Web Scraping Server

AI agents invoke firecrawl_generate_llmstxt to trigger actions in Firecrawl Web Scraping Server. What it does depends on the arguments the agent supplies, and its effects often reach beyond the immediate call — builds kicked off, notifications sent, workflows started.

ParameterTypeRequiredDescription
url string The URL to generate LLMs.txt from
maxUrls number Maximum number of URLs to process (1-100, default: 10)
showFullText boolean Whether to show the full LLMs-full.txt in the response

Parameters from the server's own tool schema.

High Risk

Why firecrawl_generate_llmstxt needs a policy

This tool analyzes a domain and generates/produces a file defining AI interaction permissions. It triggers an external operation (crawling/analyzing the domain) and produces an output artifact. It goes beyond simple reading since it synthesizes and generates a structured output file, making Execute the most appropriate category.

From the tool's definition Generate a standardized llms.txt (and optionally llms-full.txt) file for a given domain. This file defines how large language models should interact with the site.

Risk signalsAccepts URL/endpoint input (url)

Documented attack patterns abuse exactly the kind of access firecrawl_generate_llmstxt gives an agent:

How to control firecrawl_generate_llmstxt

PolicyLayer is an MCP gateway — it sits between your AI agents and Firecrawl Web Scraping Server, and nothing reaches the server without passing your rules. This is the rule we recommend for firecrawl_generate_llmstxt:

policy.json
{
  "version": "1",
  "default": "deny",
  "tools": {
    "firecrawl_generate_llmstxt": {
      "limits": [
        {
          "counter": "firecrawl_generate_llmstxt_rate",
          "window": "minute",
          "max": 10,
          "scope": "grant"
        }
      ]
    }
  }
}

firecrawl_generate_llmstxt stays usable, but rate-capped — a runaway agent can't fire it dozens of times a minute. Everything else on the server is denied unless you say otherwise.

  1. Create a free account and register Firecrawl Web Scraping Server — nothing to install.
  2. Add this policy — paste it, or build it visually.
  3. Point your MCP client (Claude, Cursor, anything) at your gateway URL.
RATE-LIMIT THIS TOOL →

Free to start. No card required.

Related tools and policies

Go deeper

Questions about firecrawl_generate_llmstxt

What does the firecrawl_generate_llmstxt tool do? +

Generate a standardized llms.txt (and optionally llms-full.txt) file for a given domain. This file defines how large language models should interact with the site. Best for: Creating machine-readable permission guidelines for AI models. Not recommended for: General content extraction or research. Arguments: - url (string, required): The base URL of the website to analyze. - maxUrls (number, optional): Max number of URLs to include (default: 10). - showFullText (boolean, optional): Whether to include llms-full.txt contents in the response. Prompt Example: "Generate an LLMs.txt file for example.com." Usage Example: json { "name": "firecrawl_generate_llmstxt", "arguments": { "url": "https://example.com", "maxUrls": 20, "showFullText": true } } Returns: LLMs.txt file contents (and optionally llms-full.txt). It is categorised as a Execute tool in the Firecrawl Web Scraping Server MCP Server, which means it can trigger actions or run processes. Use rate limits and argument validation.

What parameters does firecrawl_generate_llmstxt accept? +

firecrawl_generate_llmstxt accepts 3 parameters: url, maxUrls, showFullText. The full parameter table on this page comes from the server's own tool schema.

How do I enforce a policy on firecrawl_generate_llmstxt? +

Register the Firecrawl Web Scraping Server MCP server in PolicyLayer and add a rule for firecrawl_generate_llmstxt: allow, deny, rate-limit, or require approval. Point your MCP client at the PolicyLayer proxy URL and the rule is enforced on every call, before it reaches Firecrawl Web Scraping Server. Nothing to install.

What risk level is firecrawl_generate_llmstxt? +

firecrawl_generate_llmstxt is a Execute tool with high risk. Execute tools should be rate-limited and have argument validation enabled.

Can I rate-limit firecrawl_generate_llmstxt? +

Yes. Add a rate_limit block to the firecrawl_generate_llmstxt rule in your PolicyLayer policy. For example, setting max: 10 and window: 60 limits the tool to 10 calls per minute. Rate limits are tracked per agent session and reset automatically.

How do I block firecrawl_generate_llmstxt completely? +

Set action: deny in the PolicyLayer policy for firecrawl_generate_llmstxt. The AI agent will receive a policy violation error and cannot call the tool. You can also include a reason field to explain why the tool is blocked.

What MCP server provides firecrawl_generate_llmstxt? +

firecrawl_generate_llmstxt is provided by the Firecrawl Web Scraping Server MCP server (0xzapata/firecrawl-mcp-server). PolicyLayer sits as a proxy in front of this server to enforce policies before tool calls reach the server.

Enforce policy on every Firecrawl Web Scraping Server tool call.

Start from Firecrawl Web Scraping Server, add the rest of your stack, and see everything your agents can call. Then put policy on all of it.

Free to start. No card required.

8 Firecrawl Web Scraping Server tools catalogued and risk-classified — across an index of 43,000+ MCP servers.

// GET IN TOUCH

Have a question or want to learn more? Send us a message.

Message sent.

We'll get back to you soon.