What is RAG Knowledge Poisoning?
A cognitive state attack that injects fabricated statements into retrieval corpora so agents treat attacker-authored content as verified fact, corrupting downstream reasoning and decisions.
WHY IT MATTERS
RAG-powered agents trust their retrieval corpus as a source of truth. If an attacker can insert content into that corpus — through compromised documents, poisoned web pages, or manipulated knowledge bases — the agent will retrieve and act on false information.
The attack is effective because RAG retrieval feels authoritative. The agent doesn't distinguish between legitimate knowledge and injected content — both arrive through the same retrieval pipeline.
HOW POLICYLAYER USES THIS
Intercept's policy enforcement catches the downstream effects of RAG poisoning. Even if an agent's knowledge is corrupted, the tool calls it makes are still evaluated against deterministic rules.