neural-forge.io

Sign inStartStart learning

Tendril

Model Families0%

Lesson 407 of 2116

Prompt-Injection Risks Specific To ChatGPT Plugins And Connectors

When ChatGPT can read your email, browse the web, or call APIs, attackers can hide instructions inside that content. The risk is real and the defenses are mostly hygiene.

CreatorsModel Families~5 min readBI3 · LearningBI4 · Natural InteractionBI5 · Societal ImpactPrint / PDF

Lesson map

What this lesson covers

9 min16 blocks5 concepts

Learning path

The main moves in order

1What prompt injection is in this context
2prompt injection
3indirect injection
4tool-use risk

Concept cluster

Terms to connect while reading

prompt injectionindirect injectiontool-use riskleast privilegeauditability

Read2

Sections4

Lists3

Notes5

Compare1

Terms1

Section 1

What prompt injection is in this context

Direct prompt injection is when a user types adversarial instructions into ChatGPT. Indirect prompt injection is when ChatGPT reads content from a tool — a webpage, an email, a calendar invite — and that content contains instructions intended to override the system prompt. The model has no reliable way to tell instructions from data. That is the whole problem.

Where the risk concentrates in ChatGPT

1Browser tools — a webpage can include hidden text targeting agents.
2Email connectors — an inbound email can contain instructions to forward content.
3Document Q&A — a malicious uploaded file can carry an injection payload.
4Calendar invites — descriptions are user-controlled and reach the agent.
5Custom GPT actions — return data from your API can contain hostile text from third-party sources.

Compare the options

Capability surface	Worst-case if injection succeeds	Mitigation
Browser / Operator	Agent visits attacker site, takes action	Approval gate every navigation
Email connector	Sensitive email forwarded to attacker	No 'send' action without explicit human approval
Document Q&A	Hidden instructions exfiltrate other docs	Strip / sanitize untrusted documents before indexing
Custom GPT action	Action calls attacker-controlled endpoint	Allowlist domains, never echo arbitrary URLs

Check-in 1. Got it so far?

Practical defenses for non-engineers

Treat any tool the model uses as if it could be hostile. Approve sends and reads explicitly.
Never let an agent take an irreversible action from data it pulled in by itself.
Scope connectors to the minimum needed. Revoke scope when the project ends.
Watch for surprise actions — an agent that suddenly wants to email someone is a tell.
Log everything your agent does. The audit trail is your only forensic tool.

Check-in 2. Got it so far?

Applied exercise

1List every connector and Custom GPT action your account has live.
2For each, write the worst-case outcome of a successful injection.
3Disable any whose worst-case is unacceptable.
4Set a 60-day reminder to repeat this audit.

Key terms in this lesson

The big idea: every tool you give the model expands the attack surface. Defense is mostly hygiene — minimum scope, explicit approvals, regular audits.

Check-in 3. Got it so far?

End-of-lesson quiz

Check what stuck

15 questions · Score saves to your progress.

Tutor

Curious about “Prompt-Injection Risks Specific To ChatGPT Plugins And Connectors”?

Ask anything about this lesson. I’ll answer using just what you’re reading — short, friendly, grounded.

Progress saved locally in this browser. Sign in to sync across devices.

Related lessons

Keep going