Tech
DeepSeek And Nvidia Logos
(VCG/Getty Images)

The trillion-dollar mystery surrounding DeepSeek’s Nvidia GPUs

There’s a cloud of suspicion hanging over the type and number of Nvidia GPUs DeepSeek used to train its R1 models.

At the center of the story of DeepSeek’s breakthrough achievement with its R1 models lies the Nvidia hardware that powered the servers that trained those models.

In December 2024, DeepSeek researchers released a paper that outlined the development and capabilities of the new DeepSeek-V3 large language model. In the paper, the researchers said they were able to train their powerful, efficient model over 2.78 million GPU hours of computing time on a cluster of only 2,048 Nvidia H800 GPUs. That is a very small number of GPUs for a model that matched or beat OpenAI’s state-of-the-art o1 model in some benchmarks.

For comparison, Meta trained its Llama 3.1 models on two clusters, using a total of 39.3 million GPU hours with 49,152 Nvidia H100 GPUs. Last week, Mark Zuckerberg said that Meta is planning on ending 2025 with over 1.3 million GPUs.

Released in 2023, the H800 is a GPU thats similar to the H100 but is tailored for the Chinese market to comply with US export controls concerning national security parameters that the Biden administration rolled out in 2022. Reuters reported that the main thing Nvidia changed in the H800 was that it “reduced the chip-to-chip data transfer rate to about half the rate.”

But The Wall Street Journal reports that government officials found the H800 exploited technical loopholes that met the strict requirements of the ban, but still gave Chinese buyers very powerful AI chips. To close the loophole, in October 2023, the US government banned the export of H800s as well.

It appears that DeepSeek was able to acquire its H800s during that short window of availability.

DeepSeek’s claims are drawing suspicion from some observers in the AI industry, but most appear to be just speculation. Scale AI CEO Alexandr Wang told CNBC that he suspected DeepSeek has “about 50,000 H100s, which they can’t talk about obviously because it is against the export controls that the United States has put in place,” and in a tweet, Elon Musk replied, “Obviously.” Musk, meanwhile, has bragged about xAI’s “Colossus supercluster,” which is powered by 100,000 H100 GPUs, and that he plans to scale up to 1 million of the expensive Nvidia chips.

There have been reports of H100s being smuggled into China through a series of intermediaries on the black market, but no evidence that DeepSeek did so.

Adding to the confusion, DeepSeek cofounder Liang Wenfeng said that the company does own a cluster of 10,000 Nvidia A100 GPUs, a cheaper and less powerful AI chip.

The H100 has earned a status of being one of the most coveted pieces of computer hardware in the AI age. Even when other chips are used, the power is sometimes expressed as a number of “H100-equivalent” GPUs.

Nvidia is in the process of rolling out its next-gen H200 Blackwell GPUs, and last year CEO Jensen Huang hand-delivered the first DGX H200 server to OpenAI headquarters.

More Tech

See all Tech
ChatGPT Is Down

Is OpenAI on its way to becoming Lyft?

Once nearly synonymous with AI, it just got surpassed in valuation by Anthropic. Now it looks like it’s also going to get beaten to the IPO starting line.

tech

Palo Alto Networks surges after it beats revenue and earnings estimates

Cybersecurity firm Palo Alto Networks jumped more than 10% in postmarket trading after reporting fiscal third-quarter results that beat analyst revenue and earnings expectations.

The company posted adjusted earnings per share of $0.85, versus the FactSet analyst consensus estimate of $0.79 on $3 billion in revenue. (Wall Street had expected $2.94 billion.)

The company also boosted its guidance for the full fiscal year. The company now expects non-GAAP EPS in the range of $3.77 to $3.79, compared to its previous projection of $3.65 to $3.70 (and analysts’ expectations of $3.68). It also forecast revenue of $11.415 billion to $11.425 billion, representing year-over-year growth of 24%, compared to previous growth expectations of 22% to 23%.

Through Tuesday’s close, the stock had risen more than 60% in the past month.

tech

Microsoft releases 7 new models, next-gen quantum chip at Build conference

Microsoft is making it clear it can stand on its own as a competitor in the AI arena.

Today at its annual Microsoft Build developer conference, the company made a flurry of announcements that move it further away from the shadow of its complicated relationship with partner OpenAI.

Among the products announced:

  • New Nvidia-powered Windows PCs: the Surface Laptop Ultra and Surface RTX Spark Dev Box.

  • Seven new homegrown AI models: MAI Image-2.5, MAI Image-2.5-Flash, MAIN Transcribe-1.5, MAI Thinking-1, MAI Voice-2, MAIN Voice-2-Flash, and MAI Code-1-Flash.

  • Majorana 2, the company’s next-gen quantum chip.

  • Microsoft Scout, an integrated always-on agent built on OpenClaw.

  • Project Solara, an AI gadget operating system.

Investors were unimpressed, however, as shares were down over 4% after the announcements.

  • New Nvidia-powered Windows PCs: the Surface Laptop Ultra and Surface RTX Spark Dev Box.

  • Seven new homegrown AI models: MAI Image-2.5, MAI Image-2.5-Flash, MAIN Transcribe-1.5, MAI Thinking-1, MAI Voice-2, MAIN Voice-2-Flash, and MAI Code-1-Flash.

  • Majorana 2, the company’s next-gen quantum chip.

  • Microsoft Scout, an integrated always-on agent built on OpenClaw.

  • Project Solara, an AI gadget operating system.

Investors were unimpressed, however, as shares were down over 4% after the announcements.

tech

Amazon’s Prime Day is coming early this year

Amazon is moving its four-day Prime Day event up from July, where it’s been for the last five years, to June 23 through 26.

The retail giant cites scheduling clashes with the FIFA World Cup and the 250th anniversary of the signing of the Declaration of Independence as reasons for the move. Prime Day is one of Amazon’s biggest sales events of the year, helping drive $24.1 billion in US online spending last year, according to Adobe Analytics.

More concretely, the move means Amazon will pull a massive chunk of sales from one of its biggest events into Q2, which ends June 30, rather than Q3.

Beyond the top-line revenue shift, Amazon is also using the event to flex its newer strategic muscles, aggressively cross-promoting its same-day grocery delivery networks and its Amazon Haul discount storefront.

Latest Stories

Sherwood Media, LLC and Chartr Limited produce fresh and unique perspectives on topical financial news and are fully owned subsidiaries of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, Robinhood Money, LLC, Robinhood U.K. Ltd, Robinhood Derivatives, LLC, Robinhood Gold, LLC, Robinhood Asset Management, LLC, Robinhood Credit, Inc., Robinhood Ventures DE, LLC and, where applicable, its managed investment vehicles.