Tech
2024-04-17-ai-capabilities-site

AI is getting good at a lot of different tasks

The state of AI

As AI has gone mainstream, companies haven’t been shy in deploying the technology — applying it to manual and repetitive tasks and even citing it as a reason for mass layoffs.

But how does AI compare to humans on technical tasks? A new report, Stanford University’s 2024 AI Index, summarizes where the burgeoning technology is at.

The headline is that recent breakthroughs have heralded an unprecedented improvement in the performance of AI models on benchmark tests. For a long time, AI has been able to tell what’s in a picture, even as websites ask us to endlessly prove we’re not a robot by clicking on images of traffic lights or stop signs.

But now, AI is doing visual reasoning and math — seriously hard math.

Indeed, the 2024 AI Index reports that models have gone from scoring less than 10% of the relative performance of humans to more than 90% in just 2 years in competition-level math. In more simple tasks, the AI models evaluated already outperform the relevant human benchmarks.

The good news for anyone worried about losing their job is that AI researchers are increasingly concerned about running out of high-quality data to train their models, with some predicting that the available supply will be exhausted by 2026. This shortage might force developers to depend increasingly on AI-generated, or 'synthetic', data for training new models. Adobe’s solution? Pay people $3 a minute for videos of them touching things.

Related reading: How ChatGPT broke the Turing test.

More Tech

See all Tech
tech

Report: SpaceX posted $18.5 billion in revenue and a $5 billion loss last year

All eyes on are SpaceX as it prepares for a blockbuster IPO as soon as this summer, and everyone is eager to get a look at the company’s official numbers for the first time.

The Information is reporting that last year, SpaceX posted $18.5 billion in revenue with a $5 billion loss.

According to the report, the numbers reflect the combined finances of SpaceX and xAI, which it acquired in February.

After acquiring xAI, SpaceX’s successful space launch and satellite business may have been dragged down by xAI’s massive data center spending. Earlier this year, Bloomberg reported that xAI had burned through $8 billion in the first nine months of 2025.

According to the report, the numbers reflect the combined finances of SpaceX and xAI, which it acquired in February.

After acquiring xAI, SpaceX’s successful space launch and satellite business may have been dragged down by xAI’s massive data center spending. Earlier this year, Bloomberg reported that xAI had burned through $8 billion in the first nine months of 2025.

tech

Report: Amazon hopes its Project Houdini modular data center plan is the trick to speed up construction

Amazon is looking for a magic trick that can help it get past data center construction bottlenecks so it can work through the $244 billion worth of cloud computing backlogs it wants to deliver.

It may have just pulled a rabbit out of its hat. (I know, groan.)

Business Insider is reporting that Amazon’s Project Houdini seeks to slash labor costs and installation time by building modular “data halls” — the rows of racks of servers that make up the heart of data centers — in factories, and then shipping them fully assembled on trailers to data center sites.

According to the report, the modular plan would save weeks of construction time and tens of thousands of hours of labor costs.

This week in Amazon’s letter to shareholders, CEO Andy Jassy wrote that the company is planning $200 billion in capital expenditure this year, and that it is embracing its tradition of taking big bets on experiments like Project Houdini:

“You need to invent and experiment like crazy. Many of these experiments will fail, and it might feel like you’re getting nowhere. But, your culture must possess the tenacity to keep at it.”

Business Insider is reporting that Amazon’s Project Houdini seeks to slash labor costs and installation time by building modular “data halls” — the rows of racks of servers that make up the heart of data centers — in factories, and then shipping them fully assembled on trailers to data center sites.

According to the report, the modular plan would save weeks of construction time and tens of thousands of hours of labor costs.

This week in Amazon’s letter to shareholders, CEO Andy Jassy wrote that the company is planning $200 billion in capital expenditure this year, and that it is embracing its tradition of taking big bets on experiments like Project Houdini:

“You need to invent and experiment like crazy. Many of these experiments will fail, and it might feel like you’re getting nowhere. But, your culture must possess the tenacity to keep at it.”

tech

Creator of popular, mysterious “HappyHorse” text-to-video model is Alibaba

AI benchmark leaderboards are often where mysterious new models make their debut, stoking speculation about the unnamed companies behind them.

That was the case with an impressive new text-to-video model named HappyHorse-1.0 that shot to the top of public leaderboards. CNBC reports that Chinese tech giant Alibaba has confirmed that it is the owner of the new model.

HappyHorse beat out the popular Seedance model from rival ByteDance in blind human evaluations to claim the top spot on the Artificial Analysis text-to-video leaderboard.

While OpenAI has announced it is shuttering its text-to-video Sora app, the category continues to see intense competition as a flurry of video models improve with more realistic physics and cinematic effects.

HappyHorse beat out the popular Seedance model from rival ByteDance in blind human evaluations to claim the top spot on the Artificial Analysis text-to-video leaderboard.

While OpenAI has announced it is shuttering its text-to-video Sora app, the category continues to see intense competition as a flurry of video models improve with more realistic physics and cinematic effects.

tech

OpenAI: Our new AI tool is too dangerous to release, too!

This week, Anthropic warned that it had developed a new model that was too dangerous to cybersecurity to be released to the public.

According to a new report, OpenAI is saying similar things about a new cybersecurity tool it is working on (separate from its rumored forthcoming Spud model).

Axios wrote that OpenAI is allowing a small group of partners to test its new AI tool, which has “advanced cybersecurity capabilities.”

The realization that we have arrived at an era of powerful new AI models that could overwhelm current cybersecurity defenses is spooking investors, with cybersecurity stocks like Cloudflare, Zscaler, CrowdStrike, and Palo Alto Networks all down sharply this morning.

Axios wrote that OpenAI is allowing a small group of partners to test its new AI tool, which has “advanced cybersecurity capabilities.”

The realization that we have arrived at an era of powerful new AI models that could overwhelm current cybersecurity defenses is spooking investors, with cybersecurity stocks like Cloudflare, Zscaler, CrowdStrike, and Palo Alto Networks all down sharply this morning.

tech

OpenAI’s Stargate shrinks further as UK data center “paused”

OpenAI’s ambitious Stargate global data center project just got smaller.

First announced at the White House alongside President Trump at the start of his second term, the OpenAI partnership with Oracle and SoftBank sought to build massive data centers around the world, including sites in the UAE, the UK, and Norway.

Bloomberg reports that the company is “pausing” the Stargate UK project, citing high energy costs and regulatory obstacles.

Last month, the company and its partner Oracle scrapped its planned expansion of the Stargate I data center site in Abilene, Texas.

In a statement to Bloomberg, the company said:

“AI compute is foundational to that goal — we continue to explore Stargate UK and will move forward when the right conditions such as regulation and the cost of energy enable long-term infrastructure investment.”

Stargate UK was announced in September, including a partnership with Nvidia and Nscale that would scale up to 31,000 GPUs.

Bloomberg reports that the company is “pausing” the Stargate UK project, citing high energy costs and regulatory obstacles.

Last month, the company and its partner Oracle scrapped its planned expansion of the Stargate I data center site in Abilene, Texas.

In a statement to Bloomberg, the company said:

“AI compute is foundational to that goal — we continue to explore Stargate UK and will move forward when the right conditions such as regulation and the cost of energy enable long-term infrastructure investment.”

Stargate UK was announced in September, including a partnership with Nvidia and Nscale that would scale up to 31,000 GPUs.

Latest Stories

Sherwood Media, LLC produces fresh and unique perspectives on topical financial news and is a fully owned subsidiary of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, Robinhood Derivatives, LLC, or Robinhood Money, LLC. Futures and event contracts are offered through Robinhood Derivatives, LLC.