OpenAI Flags High Cybersecurity Risks in Future AI Models

OpenAI (the maker of ChatGPT) expects its upcoming artificial intelligence (AI) models to potentially reach high levels of cybersecurity capability, potentially enabling the exploitation of previously unknown software vulnerabilities or assisting in complex enterprise or industrial intrusions. The company emphasized that these risks are part of broader dual-use capabilities that may also benefit defenders.

To address these concerns, OpenAI is investing in defensive measures, including tools for auditing code, patching vulnerabilities, and supporting security workflows. The company is implementing a layered, defense-in-depth approach including "access controls, infrastructure hardening, egress controls," along with monitoring, detection systems, and threat intelligence programs, as mentioned in a blog post by OpenAI.

OpenAI plans to introduce a trusted access program allowing qualified cyberdefense professionals to employ advanced capabilities for defensive purposes. It is also establishing the Frontier Risk Council, an advisory group of cybersecurity professionals to guide safe deployment and evaluate potential misuse.

Additional initiatives include Aardvark, an agentic security researcher currently in private beta designed to identify and help patch software vulnerabilities, and collaboration with the Frontier Model Forum to develop shared understanding of threat models across the AI industry. OpenAI frames these measures as ongoing, long-term investments to strengthen defenses and mitigate risks associated with increasingly capable AI systems.

Notes: This post was drafted with the assistance of AI tools and reviewed, edited, and published by humans. Image: DIW-AIgen

Read next: Google Expands Android’s Safety Features With Emergency Live Video Rollout

OpenAI Flags High Cybersecurity Risks in Future AI Models

Ayaz Khan

You might like