Generate a standardized llms.txt (and optionally llms-full.txt) file for a given domain. This file defines how large language models should interact with the site. Best for: Creating machine-readable permission guidelines for AI models. Not recommended for: General content extraction or research....
AI agents invoke firecrawl_generate_llmstxt to trigger actions in Firecrawl Web Scraping Server. What it does depends on the arguments the agent supplies, and its effects often reach beyond the immediate call — builds kicked off, notifications sent, workflows started.
| Parameter | Type | Required | Description |
|---|---|---|---|
url | string | — | The URL to generate LLMs.txt from |
maxUrls | number | — | Maximum number of URLs to process (1-100, default: 10) |
showFullText | boolean | — | Whether to show the full LLMs-full.txt in the response |
Parameters from the server's own tool schema.
This tool analyzes a domain and generates/produces a file defining AI interaction permissions. It triggers an external operation (crawling/analyzing the domain) and produces an output artifact. It goes beyond simple reading since it synthesizes and generates a structured output file, making Execute the most appropriate category.
From the tool's definition Generate a standardized llms.txt (and optionally llms-full.txt) file for a given domain. This file defines how large language models should interact with the site.
Risk signalsAccepts URL/endpoint input (url)
Documented attack patterns abuse exactly the kind of access firecrawl_generate_llmstxt gives an agent:
PolicyLayer is an MCP gateway — it sits between your AI agents and Firecrawl Web Scraping Server, and nothing reaches the server without passing your rules. This is the rule we recommend for firecrawl_generate_llmstxt:
{
"version": "1",
"default": "deny",
"tools": {
"firecrawl_generate_llmstxt": {
"limits": [
{
"counter": "firecrawl_generate_llmstxt_rate",
"window": "minute",
"max": 10,
"scope": "grant"
}
]
}
}
} firecrawl_generate_llmstxt stays usable, but rate-capped — a runaway agent can't fire it dozens of times a minute. Everything else on the server is denied unless you say otherwise.
Free to start. No card required.
Generate a standardized llms.txt (and optionally llms-full.txt) file for a given domain. This file defines how large language models should interact with the site. Best for: Creating machine-readable permission guidelines for AI models. Not recommended for: General content extraction or research. Arguments: - url (string, required): The base URL of the website to analyze. - maxUrls (number, optional): Max number of URLs to include (default: 10). - showFullText (boolean, optional): Whether to include llms-full.txt contents in the response. Prompt Example: "Generate an LLMs.txt file for example.com." Usage Example: json { "name": "firecrawl_generate_llmstxt", "arguments": { "url": "https://example.com", "maxUrls": 20, "showFullText": true } } Returns: LLMs.txt file contents (and optionally llms-full.txt). It is categorised as a Execute tool in the Firecrawl Web Scraping Server MCP Server, which means it can trigger actions or run processes. Use rate limits and argument validation.
firecrawl_generate_llmstxt accepts 3 parameters: url, maxUrls, showFullText. The full parameter table on this page comes from the server's own tool schema.
Register the Firecrawl Web Scraping Server MCP server in PolicyLayer and add a rule for firecrawl_generate_llmstxt: allow, deny, rate-limit, or require approval. Point your MCP client at the PolicyLayer proxy URL and the rule is enforced on every call, before it reaches Firecrawl Web Scraping Server. Nothing to install.
firecrawl_generate_llmstxt is a Execute tool with high risk. Execute tools should be rate-limited and have argument validation enabled.
Yes. Add a rate_limit block to the firecrawl_generate_llmstxt rule in your PolicyLayer policy. For example, setting max: 10 and window: 60 limits the tool to 10 calls per minute. Rate limits are tracked per agent session and reset automatically.
Set action: deny in the PolicyLayer policy for firecrawl_generate_llmstxt. The AI agent will receive a policy violation error and cannot call the tool. You can also include a reason field to explain why the tool is blocked.
firecrawl_generate_llmstxt is provided by the Firecrawl Web Scraping Server MCP server (0xzapata/firecrawl-mcp-server). PolicyLayer sits as a proxy in front of this server to enforce policies before tool calls reach the server.
Start from Firecrawl Web Scraping Server, add the rest of your stack, and see everything your agents can call. Then put policy on all of it.
Free to start. No card required.
8 Firecrawl Web Scraping Server tools catalogued and risk-classified — across an index of 43,000+ MCP servers.