OpenAI’s record-breaking test score might have cost $30,000 per puzzle

In December, OpenAI CEO Sam Altman announced that its new o3 “reasoning” model had, for the first time, achieved a winning score on the ARC-AGI benchmark, a notoriously difficult test that had stumped all prior AI models.

But that power came at a steep price. The ARC Foundation (which maintains the test) estimated that the price for the “high compute” configuration of the winning test was about $3,400 per puzzle.

But OpenAI has not yet released the computing costs of its winning tests.

Last week, the ARC Foundation updated its leaderboard of test results, and o3’s winning score was no longer on the chart:

“Only systems which required less than $10,000 to run are shown. Notably missing from this chart is o3 (high compute). For more information on this see our announcement blog post.”

The ARC Foundation thinks the actual o3 costs are more in line with the superexpensive o1-pro model, which is the most expensive in the industry. Based on the sky-high pricing of the o1-pro model, that means it may have cost up to $30,000 of computation to solve each puzzle. The ARC Foundation wrote:

“o3 pricing costs have been updated to use o1-pro pricing. We will update again once official o3 pricing is publicly available. The amount of compute was roughly 172x the low-compute configuration.”

OpenAI did not immediately respond to a request for comment.

OpenAI's o3 model might be costlier to run than originally estimated | TechCrunch

But that power came at a steep price. The ARC Foundation (which maintains the test) estimated that the price for the “high compute” configuration of the winning test was about $3,400 per puzzle.

But OpenAI has not yet released the computing costs of its winning tests.

Last week, the ARC Foundation updated its leaderboard of test results, and o3’s winning score was no longer on the chart:

“Only systems which required less than $10,000 to run are shown. Notably missing from this chart is o3 (high compute). For more information on this see our announcement blog post.”

“o3 pricing costs have been updated to use o1-pro pricing. We will update again once official o3 pricing is publicly available. The amount of compute was roughly 172x the low-compute configuration.”

OpenAI did not immediately respond to a request for comment.

OpenAI’s record-breaking test score might have cost $30,000 per puzzle

OpenAI's o3 model might be costlier to run than originally estimated | TechCrunch

More Tech

Prediction markets have, predictably, been given a boost by the summer of sports

2026 World Cup Reverses Seasonal Lull for Sports Betting Apps

Gold Tesla Cybercabs are piling up, but they’re not picking up passengers yet

Anthropic pulls Fable and Mythos access worldwide after Trump administration bars their use by foreign nationals

Statement on the US government directive to suspend access to Fable 5 and Mythos 5

Latest Stories