返回 2026-06-07
🔒 安全

OpenAI 帮助文档:锁定模式 (Lockdown Mode)OpenAI Help: Lockdown Mode

simonwillison.net·2026-06-05 节选正文

OpenAI 宣布其“锁定模式”(Lockdown Mode)现已正式上线,并开始向符合条件的个人账户(包括 Free、Go、Plus 和 Pro)以及自助服务的 ChatGPT Business 账户推出。该功能的主要设计初衷是帮助用户有效防止数据泄露的最后阶段。此前该功能在 2 月份首次预告,如今已进入实质性的全面部署阶段。这标志着 OpenAI 在提升企业级和个人数据安全防护方面迈出了重要一步。

Simon Willison

5th June 2026 - Link Blog

OpenAI Help: Lockdown Mode. OpenAI first teased this in February, but now it's live and "rolling out to eligible personal accounts, including Free, Go, Plus, and Pro, and self-serve ChatGPT Business accounts":

Lockdown Mode is designed to help prevent the final stage of data exfiltration from a prompt injection attack by limiting outbound network requests that could transfer sensitive data to an attacker. Lockdown Mode does not prevent prompt injections from appearing in the content ChatGPT processes. For example, a prompt injection could appear in cached web content or in an uploaded file, and could still affect the behavior or accuracy of a response.

This looks really good to me.

The Lethal Trifecta occurs when an LLM system has access to all three of access to private data, exposure to untrusted content and a way to steal data and transmit it back to the attacker.

The only way to solve the trifecta is to cut off one of the three legs, and by far the easiest leg to restrict without making your LLM systems far less useful is the exfiltration vectors to steal data.

It looks to me like lockdown mode directly attacks that leg, using mechanisms that are deterministic and, crucially, are not evaluated by AI systems that themselves can be subverted by sufficiently devious attacks.

The existence of lockdown mode does however imply that ChatGPT, in its default settings, does not provide robust protection against sufficiently determined data exfiltration attacks!

需要完整排版与评论请前往来源站点阅读。