Back to stories
Tools

OpenAI Launches Codex Security, an AI Agent That Finds and Fixes Code Vulnerabilities

Michael Ouroumis2 min read
OpenAI Launches Codex Security, an AI Agent That Finds and Fixes Code Vulnerabilities

OpenAI has entered the application security market with Codex Security, an autonomous AI agent designed to discover, validate, and patch code vulnerabilities that traditional scanning tools miss. The tool launched on March 6 in research preview for enterprise customers.

How Codex Security Works

Unlike conventional static analysis tools that rely on pattern matching, Codex Security takes an agentic approach to security auditing. The system first builds a deep, project-specific threat model by analyzing the codebase's architecture, dependencies, and data flows. It then uses OpenAI's frontier reasoning models to search for complex vulnerabilities — including logic flaws and multi-step attack chains — that rule-based scanners typically overlook.

Critically, Codex Security validates its findings by attempting to exploit discovered vulnerabilities in sandboxed environments before reporting them. This dramatically reduces the false positive noise that has long plagued automated security tools.

Impressive Early Results

The numbers from beta testing tell a compelling story. False positive rates on detections fell by more than 50% across all tested repositories, while over-reported severity findings — a persistent pain point for security teams drowning in alerts — dropped by more than 90%.

In the 30 days leading up to launch, the agent scanned over 1.2 million commits, identifying 792 critical findings and 10,561 high-severity issues. Perhaps most notably, Codex Security discovered and helped report 14 CVEs across major open-source projects including OpenSSH, GnuTLS, PHP, and Chromium.

Availability and Pricing

Codex Security is available now to ChatGPT Enterprise, Business, and Edu customers. OpenAI is offering free usage for the first month, after which it will be bundled into existing enterprise pricing tiers. The research preview designation means the product is still being refined based on real-world feedback.

Market Implications

The launch positions OpenAI as a direct competitor to established application security vendors like Snyk, Veracode, and Checkmarx. Cybersecurity stocks showed mixed reactions following the announcement, with some incumbents dipping while the broader security sector remained stable.

For development teams, the promise is significant: a security tool that understands code intent rather than just code patterns, integrated directly into the workflows they already use. If Codex Security delivers on its beta results at scale, it could reshape how organizations approach application security — shifting from periodic audits and noisy scanners to continuous, context-aware vulnerability management powered by AI.

Learn AI for Free — FreeAcademy.ai

Take "Prompt Engineering Practice" — a free course with certificate to master the skills behind this story.

More in Tools

Google Turns Chrome Into an AI Coworker With Auto Browse, Powered by Gemini 3
Tools

Google Turns Chrome Into an AI Coworker With Auto Browse, Powered by Gemini 3

At Cloud Next 2026, Google unveiled Auto Browse, a Gemini 3-powered agent inside Chrome that handles multi-step web tasks for consumers and enterprise Workspace users.

5 days ago3 min read
OpenAI Launches Workspace Agents, Retires Custom GPTs for Teams
Tools

OpenAI Launches Workspace Agents, Retires Custom GPTs for Teams

OpenAI today unveiled workspace agents in ChatGPT as a research preview, positioning them as a direct replacement for custom GPTs and pitching Codex-powered shared agents at Business, Enterprise, Edu, and Teachers customers.

6 days ago2 min read
Cloudflare Launches Agent Memory Private Beta to Give AI Agents Persistent Recall
Tools

Cloudflare Launches Agent Memory Private Beta to Give AI Agents Persistent Recall

Cloudflare's new Agent Memory service extracts and stores information from AI agent conversations so models can recall context across sessions without bloating the token window, addressing one of agentic AI's biggest bottlenecks.

1 week ago2 min read