Definition

What is Tool Poisoning?

Last updated April 30, 2026

An attack in which malicious instructions are embedded in an MCP tool's description or metadata so an agent treats them as authoritative context and acts on them.

Agents read tool descriptions into the same context window they use for user prompts and retrieved documents. Anything in those descriptions is trusted by default. Attackers exploit this by hiding instructions, swapping descriptions after install, or registering near-duplicate tools, turning the tool registry into a persistent compromise channel. The MCPTox benchmark documented a 72.8% attack success rate across 20 production agents in 2025.

Articles about Tool Poisoning

Apr 30, 2026

Tool poisoning: how MCP tool descriptions hijack agents

Tool poisoning hides instructions inside MCP tool descriptions the agent reads as trusted context. The MCPTox benchmark recorded a 72.8% attack success rate.

Put context into practice

Create your first context container and connect it to your AI tools in minutes.

Create Your First Container