Tech
tech
Jon Keegan

OpenAI announces new frontier models o3 and o3 mini

On the last day of “shipmas,” OpenAI saved what might be the biggest news for last, though the 1-800 number remains the most fun.

In a puzzling branding move, OpenAI CEO Sam Altman announced their latest frontier models: “o3” and “o3-mini.” For some reason (possibly trademark related), they’re skipping “o2” altogether.

The models are not available to the public yet, but researchers can apply to participate in “public safety testing” of the models, which are expected to be widely released at the end of January. The new models feature multistep “reasoning” like the current o1 model, but also apply the process to safety, leading to a higher success rate at catching prohibited responses, according to Altman.

Altman announced the models on a livestream and revealed that the new models had achieved the highest scores on a benchmark test that has been notoriously difficult for AI models to solve.

The ARC-AGI benchmark is a visual test that consists of a series of patterns of squares on a grid, and the model must apply unique solutions to each puzzle, which requires learning new skills with each problem.

Altman said that the o3 model performed 20% better than the current o1 model on coding benchmarks, and highlighted the performance and cost improvements for the smaller o3-mini model.

The models are not available to the public yet, but researchers can apply to participate in “public safety testing” of the models, which are expected to be widely released at the end of January. The new models feature multistep “reasoning” like the current o1 model, but also apply the process to safety, leading to a higher success rate at catching prohibited responses, according to Altman.

Altman announced the models on a livestream and revealed that the new models had achieved the highest scores on a benchmark test that has been notoriously difficult for AI models to solve.

The ARC-AGI benchmark is a visual test that consists of a series of patterns of squares on a grid, and the model must apply unique solutions to each puzzle, which requires learning new skills with each problem.

Altman said that the o3 model performed 20% better than the current o1 model on coding benchmarks, and highlighted the performance and cost improvements for the smaller o3-mini model.

More Tech

See all Tech
tech
Rani Molla

Report: Microsoft weighs Xbox spin-off amid major overhaul

Microsoft is reportedly considering spinning out or restructuring its struggling Xbox unit, per The Information. While new Xbox CEO Asha Sharma, who took over in February, is preparing for layoffs, shes simultaneously planning to boost investment in its biggest franchises like “Halo,” “Fallout,” and “Minecraft.”

The latest potential shake-up comes as the gaming division battles major headwinds, following a massive 33% plunge in Q3 console sales and a recent move to slash Game Pass prices while removing new Call of Duty titles.

The latest potential shake-up comes as the gaming division battles major headwinds, following a massive 33% plunge in Q3 console sales and a recent move to slash Game Pass prices while removing new Call of Duty titles.

mythos robots

Anthropic’s Mythos gets tired, hates bad users, and wants to be thanked

Reminder: these models are not people, they don’t think, and when you close the tab, the model isn’t pondering your last interaction.

Jon Keegan6/11/26
Oracle Stock's Rises Sharply After Reporting Ultra High Demand For Cloud Computing Services

Oracle is trying really hard to convince investors it won’t have a debt problem

It’s coming up with new metrics to allay fears about its ballooning capex and debt load.

Rani Molla6/11/26

Latest Stories

Sherwood Media, LLC and Chartr Limited produce fresh and unique perspectives on topical financial news and are fully owned subsidiaries of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, Robinhood Money, LLC, Robinhood U.K. Ltd, Robinhood Derivatives, LLC, Robinhood Gold, LLC, Robinhood Asset Management, LLC, Robinhood Credit, Inc., Robinhood Ventures DE, LLC and, where applicable, its managed investment vehicles.