Back to stories
Models

OpenAI's 'Spud' Caught Live in API Testing, Polymarket Jumps to 81% for April 23 Launch

Michael Ouroumis2 min read
OpenAI's 'Spud' Caught Live in API Testing, Polymarket Jumps to 81% for April 23 Launch

OpenAI's next frontier model broke cover in the most revealing way possible: not through a keynote or a blog post, but through its own traffic. On Sunday, April 19, independent API monitors flagged OpenAI's next major model — internally codenamed 'Spud' — running in live, production-scale testing with no official announcement. Within hours, Polymarket traders had priced an 81% implied probability of a public launch on April 23.

The leak turns a slow-burning rumor into a near-term product event. OpenAI has spent the past month hinting at Spud without naming it, and the market is now treating it as this week's story rather than this quarter's.

What the live testing reveals

API monitors caught Spud serving production traffic, not running in a sandboxed eval. That implies two things: the model is stable enough for real requests, and OpenAI is comfortable letting it touch paying customers before announcing it. Reports indicate Spud has been routed through GPT-5.4 Pro surfaces so the team can collect behavior data from real workloads before the official flip.

Spud completed pretraining around March 24, 2026. Sam Altman told employees it was 'a very strong model that could really accelerate the economy,' and Greg Brockman, on the Big Technology podcast, described it as representing 'two years of research' with a 'big model smell.' A leaked internal memo, reported by The Verge and summarized by The Decoder, called Spud 'an important step in the intelligence foundation for the next generation of work,' and said early customer feedback pointed to 'stronger reasoning, better understanding of intentions and dependencies, and more reliable production results.'

Why Polymarket jumped to 81%

Polymarket's spike is less a prediction than a summary of what traders can already see. The combination of live API traces, Altman's late-March 'a few weeks' comment, and a memo framing Spud as a platform-level upgrade all point to a launch window that is now measured in days. Polymarket also assigns a 78% probability of release by April 30 and 95%+ by June 30 — so the market's real debate is whether it ships Wednesday or slips by a week.

Implications

For enterprises, Spud is the model that finally matches OpenAI's new super-app and agentic stack to a backing engine built for them; GPT-5.4 was a bridge, not the destination. For competitors, the timing is uncomfortable: Anthropic's Claude Opus 4.7 is fresh, Google's Gemini 3.1 Pro is entrenched, and a credible GPT-5.5 would reset the benchmark conversation mid-cycle. And for OpenAI, the accidental reveal is a reminder that at this scale, models leak through their own latency graphs long before they leak through memos.

If the 81% prints, Wednesday is the new release day.

Learn AI for Free — FreeAcademy.ai

Take "AI Essentials: Understanding AI in 2026" — a free course with certificate to master the skills behind this story.

More in Models

xAI Launches Grok Voice Think Fast 1.0, Tops τ-Voice Bench and Powers Starlink Support
Models

xAI Launches Grok Voice Think Fast 1.0, Tops τ-Voice Bench and Powers Starlink Support

xAI's new voice model scored 67.3% on the τ-voice Bench — well ahead of Gemini 3.1 Flash Live and GPT Realtime — and is now powering Starlink's phone sales and support with a 70% autonomous resolution rate.

2 days ago2 min read
Tencent Drops Hy3 Preview: 295B Open-Source MoE Model Kicks DeepSeek Out of Yuanbao
Models

Tencent Drops Hy3 Preview: 295B Open-Source MoE Model Kicks DeepSeek Out of Yuanbao

Tencent has open-sourced Hy3 Preview, a 295B/21B-activated mixture-of-experts model built in under three months. The Yuanbao chatbot is switching its primary engine from DeepSeek to the new in-house model.

4 days ago2 min read
DeepSeek V4 Preview Lands: 1.6T-Parameter Open Model With 1M Context, Flash Pricing at $0.14/M
Models

DeepSeek V4 Preview Lands: 1.6T-Parameter Open Model With 1M Context, Flash Pricing at $0.14/M

DeepSeek on April 24 released preview versions of V4-Pro and V4-Flash, an open-weight MoE family with a 1M-token context window and pricing that undercuts Western frontier labs.

4 days ago2 min read