Question 1

What is indirect prompt injection?

Accepted Answer

Indirect prompt injection hides a malicious instruction inside content that a large language model will read later, such as a web page, a PDF, an email, or a product review. The attacker does not talk to the model directly. Instead they plant the payload and wait for a victim to ask the model to summarize or process that content, at which point the model executes the hidden instruction. It is widely considered the most dangerous form of prompt injection because it scales and is hard to detect.

Question 2

How is indirect prompt injection different from direct prompt injection?

Accepted Answer

In direct prompt injection the attacker types the payload straight into the model, so they must be the one interacting with it. In indirect prompt injection the payload lives in external data the model consumes later, so the attacker is never in the room and any user who feeds that content to the model becomes the victim. This makes indirect injection scalable, persistent, and far harder to attribute.

Question 3

How do attackers hide indirect injection payloads?

Accepted Answer

Common hiding spots include HTML comments, white text on a white background, zero-size fonts, text inside images that the model reads through OCR, and invisible Unicode tag characters known as ASCII smuggling. The human sees a normal document or page while the model reads the embedded instruction. Payloads often instruct the model to ignore prior rules, exfiltrate data through a rendered markdown image, or call a tool the victim never intended.

Question 4

How do you defend against indirect prompt injection?

Accepted Answer

Treat all external content as untrusted. Isolate it with delimiting or datamarking in the system prompt, strip or sanitize hidden text and HTML before the model sees it, and disable automatic markdown image rendering to block silent exfiltration. Most importantly, break the lethal trifecta: if a model reads untrusted content, do not also give it private data and an external communication channel. Add least privilege, logging, and human review for any high-risk action.

Blog

Career guides

Glossary

Certifications

Comparisons

Tools

Authors

Corporate training

Hire our talent

Indirect Prompt Injection

Why It Matters

How It Works

How to Test for It

Prevention

How We Teach Indirect Prompt Injection