Tech
tech
Jon Keegan

OpenAI announces new frontier models o3 and o3 mini

On the last day of “shipmas,” OpenAI saved what might be the biggest news for last, though the 1-800 number remains the most fun.

In a puzzling branding move, OpenAI CEO Sam Altman announced their latest frontier models: “o3” and “o3-mini.” For some reason (possibly trademark related), they’re skipping “o2” altogether.

The models are not available to the public yet, but researchers can apply to participate in “public safety testing” of the models, which are expected to be widely released at the end of January. The new models feature multistep “reasoning” like the current o1 model, but also apply the process to safety, leading to a higher success rate at catching prohibited responses, according to Altman.

Altman announced the models on a livestream and revealed that the new models had achieved the highest scores on a benchmark test that has been notoriously difficult for AI models to solve.

The ARC-AGI benchmark is a visual test that consists of a series of patterns of squares on a grid, and the model must apply unique solutions to each puzzle, which requires learning new skills with each problem.

Altman said that the o3 model performed 20% better than the current o1 model on coding benchmarks, and highlighted the performance and cost improvements for the smaller o3-mini model.

The models are not available to the public yet, but researchers can apply to participate in “public safety testing” of the models, which are expected to be widely released at the end of January. The new models feature multistep “reasoning” like the current o1 model, but also apply the process to safety, leading to a higher success rate at catching prohibited responses, according to Altman.

Altman announced the models on a livestream and revealed that the new models had achieved the highest scores on a benchmark test that has been notoriously difficult for AI models to solve.

The ARC-AGI benchmark is a visual test that consists of a series of patterns of squares on a grid, and the model must apply unique solutions to each puzzle, which requires learning new skills with each problem.

Altman said that the o3 model performed 20% better than the current o1 model on coding benchmarks, and highlighted the performance and cost improvements for the smaller o3-mini model.

More Tech

See all Tech
tech

Epic scores two victories as “Fortnite” returns to Google Play and appeals court keeps injunction against Apple

“Fortnite” maker Epic Games notched two wins Thursday in its drawn-out battle against Big Tech’s app stores. “Fortnite” returned to the Google Play app store in the US, Reuters reports, as Epic continues working with Google to secure court approval for their settlement.

Meanwhile, a US appeals court partly reversed sanctions against Apple in Epic’s antitrust case, calling parts of the order overly broad, but upheld the contempt finding and left a sweeping injunction in place — keeping pressure on Apple to allow developers to steer users to outside payment options and reduce its tight control over how apps can communicate and monetize on iOS.

tech

Report: AI-powered toys tell kids where to find matches, parrot Chinese government propaganda

You may want to think twice before buying your kids a fancy AI-powered plush toy.

A new report from NBC News found that several AI-powered kids toys could easily be steered to dangerous as well as sexually explicit conversations in a shocking demonstration of the loose safety guardrails in this novel category of consumer electronics.

A report out by the Public Interest Research Group details what researchers found when they tested five AI-powered toys for kids bought from Amazon. Some of the toys offered instructions on where to find matches and how to start fires.

NBC News also bought some of these toys and found they parroted Chinese government propaganda and gave instructions for how to sharpen knives. Some of the toys also discussed inappropriate topics for kids, like sexual kinks.

The category of AI-powered kids toys is under scrutiny as major AI companies like OpenAI have announced partnerships with toy manufacturers like Mattel (which has yet to release an AI-powered toy).

A report out by the Public Interest Research Group details what researchers found when they tested five AI-powered toys for kids bought from Amazon. Some of the toys offered instructions on where to find matches and how to start fires.

NBC News also bought some of these toys and found they parroted Chinese government propaganda and gave instructions for how to sharpen knives. Some of the toys also discussed inappropriate topics for kids, like sexual kinks.

The category of AI-powered kids toys is under scrutiny as major AI companies like OpenAI have announced partnerships with toy manufacturers like Mattel (which has yet to release an AI-powered toy).

tech

OpenAI releases GPT-5.2, the “best model yet for real-world, professional use”

After feeling the heat from Google’s recent launch of its powerful Gemini 3 model, OpenAI’s response to its “code red” has been released, reportedly on an accelerated schedule to keep up with the competition.

The company’s new flagship model, GPT-5.2, is out, and the company is calling it “the most capable model series yet for professional knowledge work.”

OpenAI CEO Sam Altman called it the “smartest generally-available model in the world” and shared benchmarks that showed it achieving higher scores than Gemini 3 Pro and Anthopic’s Claude Opus 4.5 in some software engineering tests and abstract reasoning, math, and science problems.

In a press release announcing the new model, the company said: “Overall, GPT‑5.2 brings significant improvements in general intelligence, long-context understanding, agentic tool-calling, and vision — making it better at executing complex, real-world tasks end-to-end than any previous model.”

OpenAI CEO Sam Altman called it the “smartest generally-available model in the world” and shared benchmarks that showed it achieving higher scores than Gemini 3 Pro and Anthopic’s Claude Opus 4.5 in some software engineering tests and abstract reasoning, math, and science problems.

In a press release announcing the new model, the company said: “Overall, GPT‑5.2 brings significant improvements in general intelligence, long-context understanding, agentic tool-calling, and vision — making it better at executing complex, real-world tasks end-to-end than any previous model.”

tech

Google sinks on a string of bad news

Google is currently down nearly 2% amid a flurry of bad news for the tech giant:

  • OpenAI CEO Sam Altman said Google’s much-touted Gemini 3 model “had less of an impact on our metrics than maybe we feared.”

  • Disney sent Google a cease and desist letter accusing it of infringing Disney’s copyrights after announcing a $1 billion investment in competitor OpenAI.

  • Waymo recalled basically all of its vehicles — 3,067 — for a software update to fix a high-profile problem they had with driving past stopped school buses.

  • The AI trade generally is struggling today after Oracle posted underwhelming earnings results yesterday.

tech

Altman: Gemini 3 had less of an impact than we had feared

There have been a lot “code reds” flying around the AI world recently. But it turns out that the latest, declared by OpenAI CEO Sam Altman, may not be as dire as expected.

This morning Altman appeared on CNBC with Disney CEO Bob Iger to discuss Disney’s $1 billion investment in OpenAI. Altman told CNBC that Google’s Gemini 3 has “had less of an impact on our metrics than maybe we feared.”

Latest Stories

Sherwood Media, LLC produces fresh and unique perspectives on topical financial news and is a fully owned subsidiary of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, or Robinhood Money, LLC.