Tech
DeepSeek App
(Greg Baker/Getty Images)

DeepSeek’s $6 million AI model just blew a $1 trillion hole in the market. Here’s the only explainer you’ll need on this “Sputnik moment”

A fast-moving story is shaking up the AI industry in many different ways.

Over the weekend, the DeepSeek AI story really exploded. There are a lot of different aspects to this story that strike right at the heart of the moment of this AI frenzy from the biggest tech companies in the world. Let’s break this complicated but fascinating story down.

To catch you up, Chinese startup DeepSeek released a group of new “DeepSeek R1” AI models, which have burst onto the scene and caused the entire AI industry (and the investors giving them billions to spend freely) to freak out in different ways. These models are free, mostly open-source, and appear to be beating the latest state-of-the-art models from OpenAI and Meta.

Faster, cheaper, better

What makes these models so noteworthy? Unlike OpenAI and Anthropic’s AI models, they are free for anyone to download, refine, and use for any purpose. Meta did a similar thing with its Llama 3 AI model, making it free for anyone to download, modify, and use. DeepSeek’s latest models were actually based off Llama. But there are lots of free models you can use today that are all pretty good.

The big thing that makes DeepSeek’s latest R1 models special is that they use multistep “reasoning,” just like OpenAI’s o1 models, which up until last week were considered best in class. The reasoning process is a bit slower, but it leads to better responses and reveals a “chain of thought” that shows the steps it takes.

DeepSeek is offering up models with the same secret sauce that OpenAI is charging a significant amount for. And OpenAI offers its models only on its own hosted platform, meaning companies can’t just download and host their own AI servers and control the data that flows to the model. With DeepSeek, you can host this on your own hardware and control your own stack, which obviously appeals to a lot of industries with sensitive data.

DeepSeek does offer hosted access to its models, too, but at a fraction of the cost of OpenAI. For example, OpenAI charges $15 per 1 million input “tokens” (pieces of text that get entered into a chat, which could be a word or letter in a sentence). But DeepSeek’s hosted model charges just $0.14 for 1 million input tokens. That’s a jaw-dropping difference if you’re running any kind of volume of AI queries.

Another crazy part of this story — and the one that’s likely moving the market today — is how this Chinese startup built this model. DeepSeek’s researchers said it cost only $5.6 million to train their foundational DeepSeek-V3 model, using just 2,048 Nvidia H800 GPUs (which were apparently acquired before the US slapped export restrictions on them).

For comparison, Meta has been hoarding more than 600,000 of the more powerful Nvidia H100 GPUs, and plans on ending the year with more than 1.3 million GPUs. DeepSeek’s V3 model was trained using 2.78 million GPU hours (a sum of the computing time required for training) while Meta’s Llama 3 took 30.8 million GPU hours.

And this faster, cheaper approach didn’t just result in a model that matched the leaders’ models; in some cases, it beat them. DeepSeek’s R1 models are beating OpenAI o1 in some math and coding benchmarks.

Did we bet on the wrong horse?

So a better, faster, cheaper Chinese AI model just dropped, and it could upend the industry’s big plans for the next generation of AI models. The biggest tech companies (Meta, Microsoft, Amazon, and Google) have been bracing their investors for years of massive capital expenditures because of the consensus that more GPUs and more data leads to exponential leaps in AI model capabilities. Recently, there are signs that this “AI scaling law” may have reached a plateau, and Nvidia’s place at the top of the AI food chain may be in peril.

A lot of the success DeepSeek had was a result of its using other AI models to generate “synthetic data” to train its models, rather than hunting for new stores of human-written texts.

If that bet on zillions of GPUs, Manhattan-size data centers, and hundreds of billions in AI infrastructure investment is wrong, what are we doing here? Cue the massive freak-out in the market today.

Top of the App Store

As if this story couldn’t get any crazier, this weekend the DeepSeek chatbot app soared to the top of the iOS App Store “Free Apps” list. Observers are calling this a “Sputnik moment” in the global race for AI dominance, but there are a lot of things we don’t know.

One thing we do know is that for all of Washington’s freak-out over TikTok leaking Americans’ personal data to China, this AI chatbot is absolutely sending your data to China, and is even subject to Chinese censorship policies. So don’t go asking DeepSeek about Tiananmen Square, the plight of Uyghurs in China, or Taiwan’s pro-democracy movement, and who knows what else.

Fallout

This weekend, The Information reported that inside Meta they’re indeed freaking out, setting up war rooms and rethinking AI strategy.

The new Trump administration is not going to like this, either, as it’s highlighted a vision of American domination of AI and plans to expedite approvals for new power plants and infrastructure to build massive data centers.

It’s unclear how the admin and lawmakers will react to these developments, but events are moving much faster than any branch of government can.

More Tech

See all Tech
tech

Meta reportedly strikes multibillion-dollar AI chip deal with Google as it struggles to design its own

Meta has signed a deal with Google to rent tensor processing units to develop new AI models and is in talks to buy the chips for its data centers, The Information reports.

The agreement comes on top of a recently announced “multi-generational” partnership with Nvidia and a chip supply deal with Advanced Micro Devices that could be worth more than $100 billion, as Meta scrapped its most advanced in-house AI training chip amid design challenges.

A Meta deal with Google, which has been rumored since November, would position the search giant more directly as a competitor to Nvidia in its core business of AI processors. Some analysts have said selling its custom chips to outside customers could become a business worth hundreds of billions of dollars for Google.

A Meta deal with Google, which has been rumored since November, would position the search giant more directly as a competitor to Nvidia in its core business of AI processors. Some analysts have said selling its custom chips to outside customers could become a business worth hundreds of billions of dollars for Google.

tech

Delays in permitting, power, and zoning cause first drop in data center construction since 2020

Despite incredible demand, the number of data centers under construction in North America fell for the first time since 2020, according to new research from CBRE.

Total data center capacity under construction dropped about 5.6% year on year from 6.35 megawatts in 2024 to 5.99 megawatts by the end of 2025.

What’s causing the delay? Slow permitting, constrained supply chains, and growing public engagement with how deals are approved at the local level. Labor constraints also were cited in the report; a tight supply of skilled workers will increase costs.

What’s causing the delay? Slow permitting, constrained supply chains, and growing public engagement with how deals are approved at the local level. Labor constraints also were cited in the report; a tight supply of skilled workers will increase costs.

-13%📱

Smartphone shipments are expected to decline 13% — the biggest drop ever — to 1.12 billion in 2026, according to new data from IDC, as the memory shortage drives up costs and prices for phones. The firm expects the average smartphone selling price to jump 14% to a record $523 this year.

The shortfall will mostly affect makers of lower-end smartphones, whose customers are more cost-conscious, while higher-end manufacturers like Samsung and Apple are likely to be more insulated from the pressure.

“The memory crisis will cause more than a temporary decline; it marks a structural reset of the entire market, fundamentally reshaping long‑term TAM (Total Addressable Market), the vendor landscape, and the product mix,” said Nabila Popal, senior research director with IDCs Worldwide Quarterly Mobile Phone Tracker. “We expect consolidation as smaller players exit, and low-end vendors to face sharp shipment declines amid supply constraints and lower demand at higher price points.”

tech

Google drops new Nano Banana

Google is hoping to recapture the viral boost it received when it released its Nano Banana image generation model. Nano Banana 2 arrives today, which Google has rolled into its Gemini app.

The new model promises more accurate text rendering and translation and “advanced world knowledge,” which “pulls from Gemini’s real-world knowledge base, and is powered by real-time information and images from web search to more accurately render specific subjects,” according to the company’s press release.

New creative controls let users keep groups of characters consistent across scenes, render images with higher resolution, and parse complex prompts.

The first version of Nano Banana became popular for making action figures out of users, and helped catapult the Gemini AI app to the top of the charts, bumping ChatGPT from its perch.

New creative controls let users keep groups of characters consistent across scenes, render images with higher resolution, and parse complex prompts.

The first version of Nano Banana became popular for making action figures out of users, and helped catapult the Gemini AI app to the top of the charts, bumping ChatGPT from its perch.

tech
Rani Molla

Tesla’s ride-hailing service is looking a lot more like Uber’s than Waymo’s

Despite numerous promises about amassing a giant network of driverless cars, so far it seems like Tesla’s Robotaxis are a lot more similar to Uber’s plain old ride-hailing service than Waymo’s expanding autonomous fleet.

In California, where Tesla has its largest ride-hailing service, the company has taken no formal steps to gain approval for a truly driverless car service, according to Reuters. Throughout 2025, Tesla failed to log a single mile of autonomous test driving on state roads, and has not applied for the necessary permits to test or deploy vehicles without a human present. Currently, Tesla holds only a basic permit that requires a human safety monitor to remain in the driver’s seat at all times.

Currently, Tesla’s California Robotaxi service consists of roughly 300 Teslas operated by human drivers using the company’s supervised Full Self-Driving tech. In Austin, where the company has about 45 vehicles, Tesla made a big show earlier this year of announcing it was removing the safety monitors sitting in the front seats during rides. However, to date, only a handful of those vehicles have been reported to be actually operating without a safety monitor onboard.

In other words, it’s performing a service more akin to a tech-heavy Uber ride than the one operated by Alphabet subsidiary Waymo, which earlier this week announced it now has driverless rides available to the public in 10 markets. Even Uber is trying to put space between itself and the old driver-having Ubers of yore: this week its autonomous software partner said the company plans to launch a driverless service in London this year, with plans for 10 markets.

During its earnings report last month, Tesla said it planned to offer Robotaxi service in a half dozen new cities in the first half of this year, including Phoenix, Miami, and Las Vegas. Judging by Tesla’s progress so far, it’s likely those services will also feature a human in the front seat.

In California, where Tesla has its largest ride-hailing service, the company has taken no formal steps to gain approval for a truly driverless car service, according to Reuters. Throughout 2025, Tesla failed to log a single mile of autonomous test driving on state roads, and has not applied for the necessary permits to test or deploy vehicles without a human present. Currently, Tesla holds only a basic permit that requires a human safety monitor to remain in the driver’s seat at all times.

Currently, Tesla’s California Robotaxi service consists of roughly 300 Teslas operated by human drivers using the company’s supervised Full Self-Driving tech. In Austin, where the company has about 45 vehicles, Tesla made a big show earlier this year of announcing it was removing the safety monitors sitting in the front seats during rides. However, to date, only a handful of those vehicles have been reported to be actually operating without a safety monitor onboard.

In other words, it’s performing a service more akin to a tech-heavy Uber ride than the one operated by Alphabet subsidiary Waymo, which earlier this week announced it now has driverless rides available to the public in 10 markets. Even Uber is trying to put space between itself and the old driver-having Ubers of yore: this week its autonomous software partner said the company plans to launch a driverless service in London this year, with plans for 10 markets.

During its earnings report last month, Tesla said it planned to offer Robotaxi service in a half dozen new cities in the first half of this year, including Phoenix, Miami, and Las Vegas. Judging by Tesla’s progress so far, it’s likely those services will also feature a human in the front seat.

Latest Stories

Sherwood Media, LLC produces fresh and unique perspectives on topical financial news and is a fully owned subsidiary of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, or Robinhood Money, LLC.