Generate a standardized llms.txt (and optionally llms-full.txt) file for a given domain. This file defines how large language models should interact with the site. Best for: Creating machine-readable permission guidelines for AI models. Not recommended for: General content extraction or research....
Risk signalsAccepts URL/endpoint input (url)
Part of the Firecrawl Web Scraping Server server.
Free to start. No card required.
AI agents use firecrawl_generate_llmstxt to create or modify resources in Firecrawl Web Scraping Server. Write operations carry medium risk because an autonomous agent could trigger bulk unintended modifications. Rate limits prevent a single agent session from making hundreds of changes in rapid succession. Argument validation ensures the agent passes expected values.
Without a policy, an AI agent could call firecrawl_generate_llmstxt repeatedly, creating or modifying resources faster than any human could review. PolicyLayer's rate limiting ensures write operations happen at a controlled pace, and argument validation catches malformed or unexpected inputs before they reach Firecrawl Web Scraping Server.
Write tools can modify data. A rate limit prevents runaway bulk operations from AI agents.
{
"version": "1",
"default": "deny",
"tools": {
"firecrawl_generate_llmstxt": {
"limits": [
{
"counter": "firecrawl_generate_llmstxt_rate",
"window": "minute",
"max": 30,
"scope": "grant"
}
]
}
}
} See the full Firecrawl Web Scraping Server policy for all 8 tools.
These attack patterns abuse exactly the kind of access firecrawl_generate_llmstxt gives an agent. Each links to the full case and the policy that stops it:
Other write tools across the catalogue. The same approach applies to each: rate-limit and validate the arguments.
Generate a standardized llms.txt (and optionally llms-full.txt) file for a given domain. This file defines how large language models should interact with the site. Best for: Creating machine-readable permission guidelines for AI models. Not recommended for: General content extraction or research. Arguments: - url (string, required): The base URL of the website to analyze. - maxUrls (number, optional): Max number of URLs to include (default: 10). - showFullText (boolean, optional): Whether to include llms-full.txt contents in the response. Prompt Example: "Generate an LLMs.txt file for example.com." Usage Example: json { "name": "firecrawl_generate_llmstxt", "arguments": { "url": "https://example.com", "maxUrls": 20, "showFullText": true } } Returns: LLMs.txt file contents (and optionally llms-full.txt).. It is categorised as a Write tool in the Firecrawl Web Scraping Server MCP Server, which means it can create or modify data. Consider rate limits to prevent runaway writes.
Register the Firecrawl Web Scraping Server MCP server in PolicyLayer and add a rule for firecrawl_generate_llmstxt: allow, deny, rate-limit, or require approval. Point your MCP client at the PolicyLayer proxy URL and the rule is enforced on every call, before it reaches Firecrawl Web Scraping Server. Nothing to install.
firecrawl_generate_llmstxt is a Write tool with medium risk. Write tools should be rate-limited to prevent accidental bulk modifications.
Yes. Add a rate_limit block to the firecrawl_generate_llmstxt rule in your PolicyLayer policy. For example, setting max: 10 and window: 60 limits the tool to 10 calls per minute. Rate limits are tracked per agent session and reset automatically.
Set action: deny in the PolicyLayer policy for firecrawl_generate_llmstxt. The AI agent will receive a policy violation error and cannot call the tool. You can also include a reason field to explain why the tool is blocked.
firecrawl_generate_llmstxt is provided by the Firecrawl Web Scraping Server MCP server (NYO2008/firecrawl-mcp-server). PolicyLayer sits as a proxy in front of this server to enforce policies before tool calls reach the server.
Deterministic rules across all 8 Firecrawl Web Scraping Server tools. Per-identity grants. Full audit log. Live in minutes. Nothing to install.
Free to start. No card required.
4,600+ MCP servers and 31,000+ tools scanned and risk-classified.