Blue and Orange 3D Cubes Representing Interconnected AI Systems and Digital Transformation

Getty Images

Alibaba researchers devise efficient GPU pooling system, reducing GPU use 82%

Drastically reducing the amount of GPUs needed for running AI models could have big consequences for the scale of huge data centers, while benefiting smaller organizations. It also could reduce demand for pricey new GPUs from Nvidia.

Jon Keegan, Matt Phillips

10/20/25 3:14PM

Researchers at Peking University and Alibaba have announced a new system that can drastically reduce GPU demand, by efficiently “pooling” computing across multiple models rather than assigning each model its own GPU.

Named “Aegaeon,” the system addresses a problem with assigning computing resources to the many AI models on the market: dedicating a set of GPUs to a specific model leaves precious processing cycles underutilized when the model is not receiving a lot of requests.

In the research paper, the authors noted that a small number of popular models, like Meta’s Llama, DeepSeek, and Qwen, dominate utilization, and 17.7% of GPUs serve only 1.35% of requests. That’s a lot of wasted GPU cycles.

The researchers use a system of “token-level auto-scaling,” which assigns computing at a granular level using tokens (the smallest unit of text an LLM processes, sometimes only a few letters) rather than at the “request” level, which might see one heavy computational task holding up the queue.

Using the Aegaeon system, in Alibaba Cloud’s production tests, the company was able reduce GPU demand by 82%. What would normally take 1,192 GPUs, the researchers were able to do with just 213 Nvidia H20 GPUs.

The consequences of this system could be significant. If AI companies can do more with less, maybe those massive data centers running AI models don’t need to be so huge, and maybe they don’t have to find as many complicated financing schemes to pay for all those GPUs.

But this also means that smaller players could be more competitive, especially in places like China, where export controls are making the most powerful processors hard to come by.

It could also be bad news for Nvidia, though Aegaeon is built on Nvidia software. And on Monday, some analysts on Wall Street pointed to the reports on Aegaeon as a reason for the day’s weakness in some previously high-flying data center stocks.

Oracle was down sharply for the second straight session. Hard disk drive makers Seagate Technology Holdings and Western Digital — big beneficiaries of the data center trade this year — also declined, as did AI energy plays Constellation Energy and Vistra.

Jon Keegan2h

Apple poaches Meta’s chief legal officer

Just a day after Meta announced that it had hired away Apple’s user interface design lead, Apple has announced that it’s poached Jennifer Newstead, Meta’s chief legal officer, to become Apple’s new general counsel. Kate Adams, Apple’s general counsel since 2017, will be retiring late next year.

Apple also announced the retirement of Lisa Jackson, vice president for Environment, Policy, and Social Initiatives, who will leave the company in late January 2026.

The flurry of high-level management changes at Apple happens amid fervent speculation that CEO Tim Cook may be retiring soon.

Apple announces executive transitions

Apple also announced the retirement of Lisa Jackson, vice president for Environment, Policy, and Social Initiatives, who will leave the company in late January 2026.

The flurry of high-level management changes at Apple happens amid fervent speculation that CEO Tim Cook may be retiring soon.

Jon Keegan5h

EU calls for bids to build “AI gigafactories” in 2026

The European Union wants to shore up its domestic AI infrastructure and reduce its dependence on American tech companies.

To further this goal, the bloc is planning on accepting bids to build EU-based “AI gigafactories,” according to a report from The Wall Street Journal.

EU Executive Vice-President for Tech Sovereignty, Security and Democracy Henna Virkkunen announced that bids would begin in January or February, per the report.

As the AI arms race heats up, countries are racing to secure their own sovereign AI infrastructure, including building their own AI models that reflect their culture and language and offer control over cloud computing resources.

Europe is lagging behind the US and Asia in AI infrastructure. But it may be hard for the EU to fully break free of American tech — unlike the US and China, there is no European alternative for the powerful GPUs needed to train and run AI models. It’s very likely that any AI gigafactories in the EU will be filled with GPUs from Nvidia.

EU to Open Bidding for AI Gigafactories in Early 2026

EU Executive Vice-President for Tech Sovereignty, Security and Democracy Henna Virkkunen announced that bids would begin in January or February, per the report.

Jon Keegan7h

Google’s AI chip business could be a $900 billion boon for the company

Google may be sitting on a massive new business that it has yet to fully exploit.

Google’s custom tensor processing unit (TPU) AI chips have been getting a lot of attention recently, making the tech world wonder if there are other ways to power its AI dreams rather than just by using Nvidia’s GPUs.

Bloomberg spoke with analysts who estimate that, if it does decide to sell its chips to others, Google could capture 20% of the AI market, making it a $900 billion business. For comparison, Google Cloud pulled in $43.2 billion of revenue last year.

Even if Google just sticks with renting access to its TPUs, it will continue to drive down costs and increase margins as it ekes out performance improvements, such as the 30x improvement in power efficiency that the latest generation of TPUs has delivered for the company.

Alphabet’s AI Chips Are a Potential $900 Billion ‘Secret Sauce’

Rani Molla8h

OpenAI’s Sam Altman has explored bringing his feud with Tesla’s Elon Musk to space

Billionaires, they’re just like us: they want to bring their terrestrial beefs to outer space.

OpenAI CEO Sam Altman has explored buying or partnering with a rocket company to compete with Tesla CEO Elon Musk’s SpaceX, The Wall Street Journal reports. The two billionaires have had numerous public feuds over the years that have played out in the courts and on social media. They also both lead AI companies that have insatiable needs for data centers and have publicly discussed building data centers in space.

Altman seems like he thinks this could be more than science fiction. He reportedly reached out to rocket maker Stoke Space to potentially make equity investments in the company to get a controlling stake, though the talks are no longer active, WSJ reports.

Or perhaps he just wanted a Sherwood bobblehead of himself.

Jon Keegan8h

Report: Meta to slash metaverse, VR spending by up to 30%

Four years after changing its name to reflect its focus on the loosely defined “metaverse,” Meta is planning deep cuts to the company’s money-losing virtual reality efforts, according to a report from Bloomberg.

Meta’s Reality Labs division, home to the teams working on metaverse products — which include Quest VR headsets, Horizon Worlds, and its Ray-Ban Meta glasses — has lost about $70 billion since the company started breaking out the unit in 2020.

The company has struggled to get consumers to buy into CEO Mark Zuckerberg’s vision of working and playing in virtual reality worlds, like the company’s Horizon Worlds platform.

Investors seem to love the news of the pivot, as shares shot up as much as 5% in early trading.

Meta’s recent hiring spree of AI superstars from competitors for its Meta Superintelligence Labs shows that the company’s attention is now all in on AI.

Meta’s Zuckerberg Plans Deep Cuts for Metaverse Efforts