(Greg Baker/Getty Images)

DeepSeek’s $6 million AI model just blew a $1 trillion hole in the market. Here’s the only explainer you’ll need on this “Sputnik moment”

A fast-moving story is shaking up the AI industry in many different ways.

1/27/25 11:41AM

Over the weekend, the DeepSeek AI story really exploded. There are a lot of different aspects to this story that strike right at the heart of the moment of this AI frenzy from the biggest tech companies in the world. Let’s break this complicated but fascinating story down.

To catch you up, Chinese startup DeepSeek released a group of new “DeepSeek R1” AI models, which have burst onto the scene and caused the entire AI industry (and the investors giving them billions to spend freely) to freak out in different ways. These models are free, mostly open-source, and appear to be beating the latest state-of-the-art models from OpenAI and Meta.

Faster, cheaper, better

What makes these models so noteworthy? Unlike OpenAI and Anthropic’s AI models, they are free for anyone to download, refine, and use for any purpose. Meta did a similar thing with its Llama 3 AI model, making it free for anyone to download, modify, and use. DeepSeek’s latest models were actually based off Llama. But there are lots of free models you can use today that are all pretty good.

The big thing that makes DeepSeek’s latest R1 models special is that they use multistep “reasoning,” just like OpenAI’s o1 models, which up until last week were considered best in class. The reasoning process is a bit slower, but it leads to better responses and reveals a “chain of thought” that shows the steps it takes.

DeepSeek is offering up models with the same secret sauce that OpenAI is charging a significant amount for. And OpenAI offers its models only on its own hosted platform, meaning companies can’t just download and host their own AI servers and control the data that flows to the model. With DeepSeek, you can host this on your own hardware and control your own stack, which obviously appeals to a lot of industries with sensitive data.

DeepSeek does offer hosted access to its models, too, but at a fraction of the cost of OpenAI. For example, OpenAI charges $15 per 1 million input “tokens” (pieces of text that get entered into a chat, which could be a word or letter in a sentence). But DeepSeek’s hosted model charges just $0.14 for 1 million input tokens. That’s a jaw-dropping difference if you’re running any kind of volume of AI queries.

Another crazy part of this story — and the one that’s likely moving the market today — is how this Chinese startup built this model. DeepSeek’s researchers said it cost only $5.6 million to train their foundational DeepSeek-V3 model, using just 2,048 Nvidia H800 GPUs (which were apparently acquired before the US slapped export restrictions on them).

For comparison, Meta has been hoarding more than 600,000 of the more powerful Nvidia H100 GPUs, and plans on ending the year with more than 1.3 million GPUs. DeepSeek’s V3 model was trained using 2.78 million GPU hours (a sum of the computing time required for training) while Meta’s Llama 3 took 30.8 million GPU hours.

And this faster, cheaper approach didn’t just result in a model that matched the leaders’ models; in some cases, it beat them. DeepSeek’s R1 models are beating OpenAI o1 in some math and coding benchmarks.

Did we bet on the wrong horse?

So a better, faster, cheaper Chinese AI model just dropped, and it could upend the industry’s big plans for the next generation of AI models. The biggest tech companies (Meta, Microsoft, Amazon, and Google) have been bracing their investors for years of massive capital expenditures because of the consensus that more GPUs and more data leads to exponential leaps in AI model capabilities. Recently, there are signs that this “AI scaling law” may have reached a plateau, and Nvidia’s place at the top of the AI food chain may be in peril.

A lot of the success DeepSeek had was a result of its using other AI models to generate “synthetic data” to train its models, rather than hunting for new stores of human-written texts.

If that bet on zillions of GPUs, Manhattan-size data centers, and hundreds of billions in AI infrastructure investment is wrong, what are we doing here? Cue the massive freak-out in the market today.

Top of the App Store

As if this story couldn’t get any crazier, this weekend the DeepSeek chatbot app soared to the top of the iOS App Store “Free Apps” list. Observers are calling this a “Sputnik moment” in the global race for AI dominance, but there are a lot of things we don’t know.

One thing we do know is that for all of Washington’s freak-out over TikTok leaking Americans’ personal data to China, this AI chatbot is absolutely sending your data to China, and is even subject to Chinese censorship policies. So don’t go asking DeepSeek about Tiananmen Square, the plight of Uyghurs in China, or Taiwan’s pro-democracy movement, and who knows what else.

Fallout

This weekend, The Information reported that inside Meta they’re indeed freaking out, setting up war rooms and rethinking AI strategy.

The new Trump administration is not going to like this, either, as it’s highlighted a vision of American domination of AI and plans to expedite approvals for new power plants and infrastructure to build massive data centers.

It’s unclear how the admin and lawmakers will react to these developments, but events are moving much faster than any branch of government can.

Rani Molla2h

Elon Musk’s SpaceX reportedly in talks to merge with xAI

Tesla CEO Elon Musk is reportedly exploring a merger between SpaceX and his artificial intelligence startup xAI, a move that would bundle rockets, satellites, the social media site X, and AI under one company ahead of SpaceX’s long-anticipated IPO.

According to Reuters reporting, the deal would swap xAI shares for SpaceX stock, potentially valuing the combined operation north of $1 trillion.

Reuters reports:

Two entities have been set up in Nevada to facilitate the transaction, the person said.

Reuters could not determine the value of the deal, its ‌primary rationale, or its potential timing.

Corporate filings in Nevada show that those entities were set up on January 21. One of them, a limited liability company, lists SpaceX and Bret Johnsen, the company's chief financial officer, as managing members, while the other lists Johnsen as the company's only officer, the filings show.

The combined companies could also set the narrative groundwork for putting data centers in space — an idea that Musk and a number of other tech billionaires have been floating lately but that may not get off the ground.

In its earnings filings yesterday, Tesla disclosed that it recently made a $2 billion investment in xAI. Last year Musk’s xAI bought Musk’s X in an all-stock deal.

Exclusive: Musk's SpaceX in merger talks with xAI ahead of planned IPO, source says

Reuters reports:

Two entities have been set up in Nevada to facilitate the transaction, the person said.

Reuters could not determine the value of the deal, its ‌primary rationale, or its potential timing.

Corporate filings in Nevada show that those entities were set up on January 21. One of them, a limited liability company, lists SpaceX and Bret Johnsen, the company's chief financial officer, as managing members, while the other lists Johnsen as the company's only officer, the filings show.

In its earnings filings yesterday, Tesla disclosed that it recently made a $2 billion investment in xAI. Last year Musk’s xAI bought Musk’s X in an all-stock deal.

Translating Microsoft’s CEO Satya Nadella

Translating Nadella’s jargon to understand his strategy for meeting intense demand for AI computing.

Jon Keegan2h

Rani Molla3h

Driverless Waymo struck a child near school in California

A Google Waymo struck a child near a Santa Monica elementary school during morning drop-off last week, as self-driving cars by Waymo, Tesla, and others continue their expansion across the country. In a blog post, Waymo said the fully driverless car detected the child as they emerged from behind a parked SUV, braked sharply, and reduced speed from approximately 17 mph to under 6 mph before striking the child. The child suffered minor injuries and walked away.

The company reported the incident to the National Highway Traffic Safety Administration, which is currently investigating, adding fresh scrutiny to how robotaxis perform in the wild.

Jon Keegan4h

Digging into Microsoft’s cloud backlog

Microsoft’s Azure cloud computing unit is seeing huge demand. In yesterday’s second-quarter earnings call, Microsoft CFO Amy Hood said the company’s commercial bookings increased 230% thanks to large commitments from OpenAI and Anthropic and healthy demand for its Azure cloud computing platform.

Hood said that the company’s “remaining performance obligations” (RPO) ballooned to a staggering $625 billion, up 110% from the same period last year. How long will it take for Microsoft to fulfill these booked services? Hood said the weighted average duration was “approximately two and a half years,” but a quarter of that will be recognized in revenue in the next 12 months.

Shares of Microsoft tanked today, down over 11%, despite the strong beat on revenue and earnings. The drop puts the stock on track to have its worst single-day drop since March of 2020.

Investors may be concerned that while huge, that extra demand was coming only from OpenAI, an issue that Oracle recently experienced.

But Hood said the non-OpenAI RPO still grew 28% year on year, which reflects “ongoing broad customer demand across the portfolio.”

Meta and Tesla are funding the future with their core businesses — but only one of them is still growing

The two tech giants, on back-to-back earnings calls, made it sound like they’re selling the same AI-powered future. But the picture of the underlying businesses, and how they’re using AI to furnish current sales, couldn’t be more different.

Rani Molla5h