What is OpenAI Codex Security and how does it work?

Codex Security is an AI-powered application security agent that builds a project-specific threat model, searches for vulnerabilities using agentic reasoning, validates findings in sandboxed environments, and proposes fixes aligned with the system's architecture.

Who can access OpenAI Codex Security?

Codex Security is currently available in research preview to ChatGPT Enterprise, Business, and Edu customers, with free usage included for the first month.

How accurate is Codex Security at finding real vulnerabilities?

During beta testing, false positive rates dropped by over 50% and over-reported severity findings fell by more than 90%. The agent has already helped report 14 CVEs across major open-source projects.

OpenAI Launches Codex Security, an AI Agent That Finds and Fixes Code Vulnerabilities

OpenAI has entered the application security market with Codex Security, an autonomous AI agent designed to discover, validate, and patch code vulnerabilities that traditional scanning tools miss. The tool launched on March 6 in research preview for enterprise customers.

How Codex Security Works

Unlike conventional static analysis tools that rely on pattern matching, Codex Security takes an agentic approach to security auditing. The system first builds a deep, project-specific threat model by analyzing the codebase's architecture, dependencies, and data flows. It then uses OpenAI's frontier reasoning models to search for complex vulnerabilities — including logic flaws and multi-step attack chains — that rule-based scanners typically overlook.

Critically, Codex Security validates its findings by attempting to exploit discovered vulnerabilities in sandboxed environments before reporting them. This dramatically reduces the false positive noise that has long plagued automated security tools.

Impressive Early Results

The numbers from beta testing tell a compelling story. False positive rates on detections fell by more than 50% across all tested repositories, while over-reported severity findings — a persistent pain point for security teams drowning in alerts — dropped by more than 90%.

In the 30 days leading up to launch, the agent scanned over 1.2 million commits, identifying 792 critical findings and 10,561 high-severity issues. Perhaps most notably, Codex Security discovered and helped report 14 CVEs across major open-source projects including OpenSSH, GnuTLS, PHP, and Chromium.

Availability and Pricing

Codex Security is available now to ChatGPT Enterprise, Business, and Edu customers. OpenAI is offering free usage for the first month, after which it will be bundled into existing enterprise pricing tiers. The research preview designation means the product is still being refined based on real-world feedback.

Market Implications

The launch positions OpenAI as a direct competitor to established application security vendors like Snyk, Veracode, and Checkmarx. Cybersecurity stocks showed mixed reactions following the announcement, with some incumbents dipping while the broader security sector remained stable.

For development teams, the promise is significant: a security tool that understands code intent rather than just code patterns, integrated directly into the workflows they already use. If Codex Security delivers on its beta results at scale, it could reshape how organizations approach application security — shifting from periodic audits and noisy scanners to continuous, context-aware vulnerability management powered by AI.

OpenAI Launches Codex Security, an AI Agent That Finds and Fixes Code Vulnerabilities

How Codex Security Works

Impressive Early Results

Availability and Pricing

Market Implications

More in Tools

Google Turns Chrome Into an AI Coworker With Auto Browse, Powered by Gemini 3

OpenAI Launches Workspace Agents, Retires Custom GPTs for Teams

Cloudflare Launches Agent Memory Private Beta to Give AI Agents Persistent Recall