Tech
DeepSeek App
(Greg Baker/Getty Images)

DeepSeek’s $6 million AI model just blew a $1 trillion hole in the market. Here’s the only explainer you’ll need on this “Sputnik moment”

A fast-moving story is shaking up the AI industry in many different ways.

Over the weekend, the DeepSeek AI story really exploded. There are a lot of different aspects to this story that strike right at the heart of the moment of this AI frenzy from the biggest tech companies in the world. Let’s break this complicated but fascinating story down.

To catch you up, Chinese startup DeepSeek released a group of new “DeepSeek R1” AI models, which have burst onto the scene and caused the entire AI industry (and the investors giving them billions to spend freely) to freak out in different ways. These models are free, mostly open-source, and appear to be beating the latest state-of-the-art models from OpenAI and Meta.

Faster, cheaper, better

What makes these models so noteworthy? Unlike OpenAI and Anthropic’s AI models, they are free for anyone to download, refine, and use for any purpose. Meta did a similar thing with its Llama 3 AI model, making it free for anyone to download, modify, and use. DeepSeek’s latest models were actually based off Llama. But there are lots of free models you can use today that are all pretty good.

The big thing that makes DeepSeek’s latest R1 models special is that they use multistep “reasoning,” just like OpenAI’s o1 models, which up until last week were considered best in class. The reasoning process is a bit slower, but it leads to better responses and reveals a “chain of thought” that shows the steps it takes.

DeepSeek is offering up models with the same secret sauce that OpenAI is charging a significant amount for. And OpenAI offers its models only on its own hosted platform, meaning companies can’t just download and host their own AI servers and control the data that flows to the model. With DeepSeek, you can host this on your own hardware and control your own stack, which obviously appeals to a lot of industries with sensitive data.

DeepSeek does offer hosted access to its models, too, but at a fraction of the cost of OpenAI. For example, OpenAI charges $15 per 1 million input “tokens” (pieces of text that get entered into a chat, which could be a word or letter in a sentence). But DeepSeek’s hosted model charges just $0.14 for 1 million input tokens. That’s a jaw-dropping difference if you’re running any kind of volume of AI queries.

Another crazy part of this story — and the one that’s likely moving the market today — is how this Chinese startup built this model. DeepSeek’s researchers said it cost only $5.6 million to train their foundational DeepSeek-V3 model, using just 2,048 Nvidia H800 GPUs (which were apparently acquired before the US slapped export restrictions on them).

For comparison, Meta has been hoarding more than 600,000 of the more powerful Nvidia H100 GPUs, and plans on ending the year with more than 1.3 million GPUs. DeepSeek’s V3 model was trained using 2.78 million GPU hours (a sum of the computing time required for training) while Meta’s Llama 3 took 30.8 million GPU hours.

And this faster, cheaper approach didn’t just result in a model that matched the leaders’ models; in some cases, it beat them. DeepSeek’s R1 models are beating OpenAI o1 in some math and coding benchmarks.

Did we bet on the wrong horse?

So a better, faster, cheaper Chinese AI model just dropped, and it could upend the industry’s big plans for the next generation of AI models. The biggest tech companies (Meta, Microsoft, Amazon, and Google) have been bracing their investors for years of massive capital expenditures because of the consensus that more GPUs and more data leads to exponential leaps in AI model capabilities. Recently, there are signs that this “AI scaling law” may have reached a plateau, and Nvidia’s place at the top of the AI food chain may be in peril.

A lot of the success DeepSeek had was a result of its using other AI models to generate “synthetic data” to train its models, rather than hunting for new stores of human-written texts.

If that bet on zillions of GPUs, Manhattan-size data centers, and hundreds of billions in AI infrastructure investment is wrong, what are we doing here? Cue the massive freak-out in the market today.

Top of the App Store

As if this story couldn’t get any crazier, this weekend the DeepSeek chatbot app soared to the top of the iOS App Store “Free Apps” list. Observers are calling this a “Sputnik moment” in the global race for AI dominance, but there are a lot of things we don’t know.

One thing we do know is that for all of Washington’s freak-out over TikTok leaking Americans’ personal data to China, this AI chatbot is absolutely sending your data to China, and is even subject to Chinese censorship policies. So don’t go asking DeepSeek about Tiananmen Square, the plight of Uyghurs in China, or Taiwan’s pro-democracy movement, and who knows what else.

Fallout

This weekend, The Information reported that inside Meta they’re indeed freaking out, setting up war rooms and rethinking AI strategy.

The new Trump administration is not going to like this, either, as it’s highlighted a vision of American domination of AI and plans to expedite approvals for new power plants and infrastructure to build massive data centers.

It’s unclear how the admin and lawmakers will react to these developments, but events are moving much faster than any branch of government can.

More Tech

See all Tech
tech

WSJ: Anduril’s weapons systems have failed during several tests

Autonomous drones by sea, land, and air. Futuristic AI-powered support fighter jets, and swarms of networked drones controlled by sophisticated software. These are some of the visions for the future of warfare pitched by defense tech startup Anduril. Cofounded by Oculus founder Palmer Luckey, the Peter Thiel-backed startup has landed some major national security contracts based on this futuristic outlook for battlefield AI.

But according to a report from The Wall Street Journal, the company’s tech is failing key tests in the real world, raising concerns about the viability and safety of Anduril’s systems within the military command.

Anduril’s Altius drones proved vulnerable to Russian jamming while deployed in Ukraine and have been pulled from the battlefield, per the report.

More than a dozen sea-based drone ships powered by Anduril’s Lattice command and control software recently shut down during a Navy test, creating a hazard for other vessels in the exercise.

And this summer, during a drone intercept test, Anduril’s counter-drone system crashed and caused a 22-acre fire at a California airport, the report found.

Anduril told the WSJ that the failures are just part of its rapid iterative development process:

“We recognize that our highly iterative model of technology development — moving fast, testing constantly, failing often, refining our work, and doing it all over again — can make the job of our critics easier. That is a risk we accept. We do fail… a lot.”

But according to a report from The Wall Street Journal, the company’s tech is failing key tests in the real world, raising concerns about the viability and safety of Anduril’s systems within the military command.

Anduril’s Altius drones proved vulnerable to Russian jamming while deployed in Ukraine and have been pulled from the battlefield, per the report.

More than a dozen sea-based drone ships powered by Anduril’s Lattice command and control software recently shut down during a Navy test, creating a hazard for other vessels in the exercise.

And this summer, during a drone intercept test, Anduril’s counter-drone system crashed and caused a 22-acre fire at a California airport, the report found.

Anduril told the WSJ that the failures are just part of its rapid iterative development process:

“We recognize that our highly iterative model of technology development — moving fast, testing constantly, failing often, refining our work, and doing it all over again — can make the job of our critics easier. That is a risk we accept. We do fail… a lot.”

tech

OpenAI’s partners shouldering $100 billion of debt, taking on all the risk

OpenAI’s ambitious plans for global AI infrastructure projects — like its series of massive Stargate AI data centers — will require tens of billions of dollars funded by debt, but you won’t find much of that on OpenAI’s balance sheet.

According to a new analysis by the Financial Times, OpenAI has somehow convinced its many partners to shoulder at least $100 billion in debt on its behalf, as well as the risks that come with it.

Partners Oracle, SoftBank, CoreWeave, Crusoe, and Blue Owl Capital are all taking on debt in the form of bonds, loans, and credit deals to meet their obligations with OpenAI for infrastructure and computing resources.

Having close ties with OpenAI has been an anchor for many publicly traded companies in recent weeks. The company’s cash burn and the rise of Gemini 3 have seemingly darkened its outlook and fostered guilt by association for many of its close partners and investors. Most notably, Oracle’s aggressive capital expenditure plans to support demand from OpenAI have sparked a sell-off in its stock while widening its credit default swap spreads.

A senior OpenAI executive told the FT: “That’s been kind of the strategy. How does [OpenAI] leverage other people’s balance sheets?”

Partners Oracle, SoftBank, CoreWeave, Crusoe, and Blue Owl Capital are all taking on debt in the form of bonds, loans, and credit deals to meet their obligations with OpenAI for infrastructure and computing resources.

Having close ties with OpenAI has been an anchor for many publicly traded companies in recent weeks. The company’s cash burn and the rise of Gemini 3 have seemingly darkened its outlook and fostered guilt by association for many of its close partners and investors. Most notably, Oracle’s aggressive capital expenditure plans to support demand from OpenAI have sparked a sell-off in its stock while widening its credit default swap spreads.

A senior OpenAI executive told the FT: “That’s been kind of the strategy. How does [OpenAI] leverage other people’s balance sheets?”

tech

Chinese tech giants are training their models offshore to sidestep US curbs on Nvidia’s chips

Nvidia can’t sell its best AI chips in the world’s second-largest economy. That’s an Nvidia problem. But it’s also a China problem — and it’s one that the region’s tech giants have resorted to solving by training their AI models overseas, according to a new report from the Financial Times.

Citing two people with direct knowledge of the matter, the FT reported that “Alibaba and ByteDance are among the tech groups training their latest large language models in data centers across south-east Asia.” Clusters of data centers have particularly boomed in Singapore and Malaysia, with many of the sites kitted out with Nvidia’s latest architecture.

One exception, per the FT, is DeepSeek, which continues to be trained domestically, having reportedly built up a stockpile of Nvidia chips before the US export ban came into effect.

Last week, Nvidia spiked on the news that the Trump administration was reportedly considering letting the tech giant sell its best Hopper chips — the generation of chips that preceded Blackwell — to China.

Citing two people with direct knowledge of the matter, the FT reported that “Alibaba and ByteDance are among the tech groups training their latest large language models in data centers across south-east Asia.” Clusters of data centers have particularly boomed in Singapore and Malaysia, with many of the sites kitted out with Nvidia’s latest architecture.

One exception, per the FT, is DeepSeek, which continues to be trained domestically, having reportedly built up a stockpile of Nvidia chips before the US export ban came into effect.

Last week, Nvidia spiked on the news that the Trump administration was reportedly considering letting the tech giant sell its best Hopper chips — the generation of chips that preceded Blackwell — to China.

tech
Millie Giles

Alibaba unveils its first AI glasses, taking on Meta directly in the wearables race

Retail and tech giant Alibaba launched its first consumer-ready, AI-powered smart glasses on Thursday, marking its entrance into the growing wearables market.

Announced back in July, the Quark AI glasses just went on sale in the Chinese retailer’s home market, with two versions currently available: the S1, starting at 3,799 Chinese yuan (~$536), and the G1, at 1,899 yuan (~$268) — a considerably lower price than Meta’s $799 Ray-Ban Display glasses, released in September.

tech
Jon Keegan

Musk: Tesla’s Austin Robotaxi fleet to “roughly double” next month, but falls well short of earlier goals

Yesterday, Elon Musk jumped onto a frustrated user’s post on X, who was complaining that they were unable to book a Robotaxi ride in Austin. Musk aimed to reassure the would-be customer that the company was expanding service in the city:

“The Tesla Robotaxi fleet in Austin should roughly double next month,” Musk wrote.

While that sounds impressive, there are reports that Austin has only 29 vehicles in service.

But last month, Musk said the Robotaxi goal was to have “probably 500 or more in the greater Austin area” by the end of the year.

Meanwhile, Google’s Waymo has more than 100 autonomous taxis running in Austin, and 1,000 more in the San Francisco Bay Area.

“The Tesla Robotaxi fleet in Austin should roughly double next month,” Musk wrote.

While that sounds impressive, there are reports that Austin has only 29 vehicles in service.

But last month, Musk said the Robotaxi goal was to have “probably 500 or more in the greater Austin area” by the end of the year.

Meanwhile, Google’s Waymo has more than 100 autonomous taxis running in Austin, and 1,000 more in the San Francisco Bay Area.

Latest Stories

Sherwood Media, LLC produces fresh and unique perspectives on topical financial news and is a fully owned subsidiary of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, or Robinhood Money, LLC.