OpenAI Rolls Out ChatGPT Lockdown Mode to Prevent Exfiltration

AI-Agents Research Tools

TL;DR: OpenAI has launched Lockdown Mode to block data exfiltration from prompt injection attacks by limiting outbound network requests.

Summary: OpenAI is rolling out Lockdown Mode to eligible personal and self-serve ChatGPT Business accounts. The security feature limits outbound network requests to prevent the final stage of data exfiltration when a prompt injection occurs. While it stops attackers from transferring sensitive data, it does not prevent prompt injections from appearing in cached web content or uploaded files.

Why it matters: This reduces the risk of sensitive user data being leaked via malicious prompt injections in third-party files or web content. Developers building on top of ChatGPT or advising clients on AI safety should note this new mitigation layer and its limitations.

Source: r/stablediffusion