Tech
DeepSeek And Nvidia Logos
(VCG/Getty Images)

The trillion-dollar mystery surrounding DeepSeek’s Nvidia GPUs

There’s a cloud of suspicion hanging over the type and number of Nvidia GPUs DeepSeek used to train its R1 models.

At the center of the story of DeepSeek’s breakthrough achievement with its R1 models lies the Nvidia hardware that powered the servers that trained those models.

In December 2024, DeepSeek researchers released a paper that outlined the development and capabilities of the new DeepSeek-V3 large language model. In the paper, the researchers said they were able to train their powerful, efficient model over 2.78 million GPU hours of computing time on a cluster of only 2,048 Nvidia H800 GPUs. That is a very small number of GPUs for a model that matched or beat OpenAI’s state-of-the-art o1 model in some benchmarks.

For comparison, Meta trained its Llama 3.1 models on two clusters, using a total of 39.3 million GPU hours with 49,152 Nvidia H100 GPUs. Last week, Mark Zuckerberg said that Meta is planning on ending 2025 with over 1.3 million GPUs.

Released in 2023, the H800 is a GPU thats similar to the H100 but is tailored for the Chinese market to comply with US export controls concerning national security parameters that the Biden administration rolled out in 2022. Reuters reported that the main thing Nvidia changed in the H800 was that it “reduced the chip-to-chip data transfer rate to about half the rate.”

But The Wall Street Journal reports that government officials found the H800 exploited technical loopholes that met the strict requirements of the ban, but still gave Chinese buyers very powerful AI chips. To close the loophole, in October 2023, the US government banned the export of H800s as well.

It appears that DeepSeek was able to acquire its H800s during that short window of availability.

DeepSeek’s claims are drawing suspicion from some observers in the AI industry, but most appear to be just speculation. Scale AI CEO Alexandr Wang told CNBC that he suspected DeepSeek has “about 50,000 H100s, which they can’t talk about obviously because it is against the export controls that the United States has put in place,” and in a tweet, Elon Musk replied, “Obviously.” Musk, meanwhile, has bragged about xAI’s “Colossus supercluster,” which is powered by 100,000 H100 GPUs, and that he plans to scale up to 1 million of the expensive Nvidia chips.

There have been reports of H100s being smuggled into China through a series of intermediaries on the black market, but no evidence that DeepSeek did so.

Adding to the confusion, DeepSeek cofounder Liang Wenfeng said that the company does own a cluster of 10,000 Nvidia A100 GPUs, a cheaper and less powerful AI chip.

The H100 has earned a status of being one of the most coveted pieces of computer hardware in the AI age. Even when other chips are used, the power is sometimes expressed as a number of “H100-equivalent” GPUs.

Nvidia is in the process of rolling out its next-gen H200 Blackwell GPUs, and last year CEO Jensen Huang hand-delivered the first DGX H200 server to OpenAI headquarters.

More Tech

See all Tech
tech

Anthropic launches “Claude Design,” sending shares of Figma and Adobe down

Anthropic has been slowly and steadily gaining a leading share in the enterprise AI market by focusing on coding, spreadsheets, and other common productivity and workplace apps.

Now it’s going after design apps.

Today Anthropic launched Claude Design, a dedicated app powered by its latest model, Claude Opus 4.7, that lets users use text prompts to build website designs, user interface prototypes, presentations, and marketing materials.

Shares of Figma and Adobe sank on the news.

While Claude has previously had the ability to create designs and user interfaces, breaking it out into a dedicated app signals a major new piece of its enterprise strategy alongside its popular Claude Code product.

Today Anthropic launched Claude Design, a dedicated app powered by its latest model, Claude Opus 4.7, that lets users use text prompts to build website designs, user interface prototypes, presentations, and marketing materials.

Shares of Figma and Adobe sank on the news.

While Claude has previously had the ability to create designs and user interfaces, breaking it out into a dedicated app signals a major new piece of its enterprise strategy alongside its popular Claude Code product.

tech

Apple’s China iPhone shipments surged 20% in Q1 even as overall smartphone shipments fell

Apple’s iPhone shipments in China jumped 20% last quarter, even as the country’s overall smartphone market fell 4%, according to new data from Counterpoint Research. Rising memory costs have pushed prices higher across the industry, weighing on demand.

Apple appears poised to ride out the broader smartphone slump. Its strength at the less price-sensitive high end of the market and its unusual leverage over suppliers, which helps keep costs in check, give it an edge over rivals.

Greater China remains a critical region for Apple, making up about 18% of its total revenue in the fourth quarter. The company accounted for 19% of China’s smartphone market in the first quarter, up from 15% a year earlier, per Counterpoint.

tech
Rani Molla

Anthropic has surged past OpenAI in capturing business spending on generative-AI software

Last quarter, Anthropic attracted the lion’s share of trackable business spending on generative-AI software, according to new data from Ramp, a fintech company that provides corporate cards and expense management software for small firms and Fortune 500 companies alike.

The data showed that in the first quarter, Anthropic saw 37% of spending, its biggest share yet, versus 33% for OpenAI. Notably, the dataset doesn’t capture spending by Google or Microsoft.

OpenAI, which makes ChatGPT, still leads in overall adoption at 81% of AI buyers, but Anthropic is catching up, at nearly 63% in March. Overall, more than half of Ramp’s customers currently pay for AI, up from just 18% two years ago.

Anthropic’s enterprise tools, including Claude Code and Cowork, have been making waves among the business class, sending its revenue soaring.

Anthropic’s revenue share is even higher among companies spending on AI for the first time.

“Anthropic has definitely been on a tear,” Ara Kharazian, Ramp’s economist, told Sherwood News. “Its increase in adoption rates has been driven by its ability to sell to less technical users and smaller contracts than it typically has.”

It’s notable that midway through the first quarter, Anthropic had a falling-out with one of its biggest customers, the US government, which near the end of February decided to shun Anthropic’s products and lean into working with OpenAI.

tech
Jon Keegan

Report: Google ditches its objection to defense work, pitches Gemini to Pentagon

In 2018, Google employees protested against the company’s tech being used for the US military’s Project Maven — a drone targeting program — reminding the company of its “don’t be evil” motto.

After the controversy, the company declined to renew the contract with the Pentagon, drawing a bright line between Big Tech and the national security establishment.

What a difference a few years makes.

Google is now actively working to get its Gemini AI model to be used in classified national security settings, according to a new report from The Information. Seeking a similar deal to the one OpenAI hashed out with the Pentagon, Google reportedly wants a contract that allows use of Gemini in classified work, but with a prohibition on mass domestic surveillance and autonomous lethal weapons.

But Google is playing catch-up in a major way. Amazon and Microsoft both have been widely used for classified defense work, and contractors are already experienced in working with their cloud systems, while Google’s services have never been used in classified work.

What a difference a few years makes.

Google is now actively working to get its Gemini AI model to be used in classified national security settings, according to a new report from The Information. Seeking a similar deal to the one OpenAI hashed out with the Pentagon, Google reportedly wants a contract that allows use of Gemini in classified work, but with a prohibition on mass domestic surveillance and autonomous lethal weapons.

But Google is playing catch-up in a major way. Amazon and Microsoft both have been widely used for classified defense work, and contractors are already experienced in working with their cloud systems, while Google’s services have never been used in classified work.

Latest Stories

Sherwood Media, LLC produces fresh and unique perspectives on topical financial news and is a fully owned subsidiary of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, Robinhood Derivatives, LLC, or Robinhood Money, LLC. Futures and event contracts are offered through Robinhood Derivatives, LLC.