(VCG/Getty Images)

The trillion-dollar mystery surrounding DeepSeek’s Nvidia GPUs

There’s a cloud of suspicion hanging over the type and number of Nvidia GPUs DeepSeek used to train its R1 models.

1/29/25 11:23AM

At the center of the story of DeepSeek’s breakthrough achievement with its R1 models lies the Nvidia hardware that powered the servers that trained those models.

In December 2024, DeepSeek researchers released a paper that outlined the development and capabilities of the new DeepSeek-V3 large language model. In the paper, the researchers said they were able to train their powerful, efficient model over 2.78 million GPU hours of computing time on a cluster of only 2,048 Nvidia H800 GPUs. That is a very small number of GPUs for a model that matched or beat OpenAI’s state-of-the-art o1 model in some benchmarks.

For comparison, Meta trained its Llama 3.1 models on two clusters, using a total of 39.3 million GPU hours with 49,152 Nvidia H100 GPUs. Last week, Mark Zuckerberg said that Meta is planning on ending 2025 with over 1.3 million GPUs.

Released in 2023, the H800 is a GPU that’s similar to the H100 but is tailored for the Chinese market to comply with US export controls concerning national security parameters that the Biden administration rolled out in 2022. Reuters reported that the main thing Nvidia changed in the H800 was that it “reduced the chip-to-chip data transfer rate to about half the rate.”

But The Wall Street Journal reports that government officials found the H800 exploited technical loopholes that met the strict requirements of the ban, but still gave Chinese buyers very powerful AI chips. To close the loophole, in October 2023, the US government banned the export of H800s as well.

It appears that DeepSeek was able to acquire its H800s during that short window of availability.

DeepSeek’s claims are drawing suspicion from some observers in the AI industry, but most appear to be just speculation. Scale AI CEO Alexandr Wang told CNBC that he suspected DeepSeek has “about 50,000 H100s, which they can’t talk about obviously because it is against the export controls that the United States has put in place,” and in a tweet, Elon Musk replied, “Obviously.” Musk, meanwhile, has bragged about xAI’s “Colossus supercluster,” which is powered by 100,000 H100 GPUs, and that he plans to scale up to 1 million of the expensive Nvidia chips.

There have been reports of H100s being smuggled into China through a series of intermediaries on the black market, but no evidence that DeepSeek did so.

Adding to the confusion, DeepSeek cofounder Liang Wenfeng said that the company does own a cluster of 10,000 Nvidia A100 GPUs, a cheaper and less powerful AI chip.

The H100 has earned a status of being one of the most coveted pieces of computer hardware in the AI age. Even when other chips are used, the power is sometimes expressed as a number of “H100-equivalent” GPUs.

Nvidia is in the process of rolling out its next-gen H200 Blackwell GPUs, and last year CEO Jensen Huang hand-delivered the first DGX H200 server to OpenAI headquarters.

Rani Molla29m

Driverless Waymo struck a child near school in California

A Google Waymo struck a child near a Santa Monica elementary school during morning drop-off last week, as self-driving cars by Waymo, Tesla, and others continue their expansion across the country. In a blog post, Waymo said the fully driverless car detected the child as they emerged from behind a parked SUV, braked sharply, and reduced speed from approximately 17 mph to under 6 mph before striking the child. The child suffered minor injuries and walked away.

The company reported the incident to the National Highway Traffic Safety Administration, which is currently investigating, adding fresh scrutiny to how robotaxis perform in the wild.

Jon Keegan2h

Digging into Microsoft’s cloud backlog

Microsoft’s Azure cloud computing unit is seeing huge demand. In yesterday’s second-quarter earnings call, Microsoft CFO Amy Hood said the company’s commercial bookings increased 230% thanks to large commitments from OpenAI and Anthropic and healthy demand for its Azure cloud computing platform.

Hood said that the company’s “remaining performance obligations” (RPO) ballooned to a staggering $625 billion, up 110% from the same period last year. How long will it take for Microsoft to fulfill these booked services? Hood said the weighted average duration was “approximately two and a half years,” but a quarter of that will be recognized in revenue in the next 12 months.

Shares of Microsoft tanked today, down over 11%, despite the strong beat on revenue and earnings. The drop puts the stock on track to have its worst single-day drop since March of 2020.

Investors may be concerned that while huge, that extra demand was coming only from OpenAI, an issue that Oracle recently experienced.

But Hood said the non-OpenAI RPO still grew 28% year on year, which reflects “ongoing broad customer demand across the portfolio.”

Meta and Tesla are funding the future with their core businesses — but only one of them is still growing

The two tech giants, on back-to-back earnings calls, made it sound like they’re selling the same AI-powered future. But the picture of the underlying businesses, and how they’re using AI to furnish current sales, couldn’t be more different.

Rani Molla2h

Rani Molla

19h

Tesla to stop making Model S and Model X next quarter to pivot to Optimus production

Tesla is not bringing S3XY back.