A screengrab from Mark Zuckerberg’s Instagram post demonstrating Llama AI/Instagram

Platformer

Meta’s radical plan for AI is Mark Zuckerberg’s love letter to nerd culture

Its open-source AI is now roughly on par with rivals from OpenAI and Google, but the company is taking a decidedly more hacker-first approach to the future.

Casey Newton

7/24/24 8:04AM

Today, Meta released the largest ever open-source large language model to power generative artificial intelligence applications. Llama 3.1 represents an ambitious effort to shape the development of the AI industry in ways that favor Meta. It is also likely to spark new conversations around whether open source models are likely to generate more harm than their closed counterparts.

In January I wrote about Meta’s unusual plan to make available for free a technology that it has spent more than a decade and tens of billions of dollars to build. Unlike its peers, including Google and OpenAI, Meta is not selling subscriptions to its AI for individuals or teams, and says it has no plans to do so.

In the short term, Meta says that open-sourcing Llama helps its own systems get better more quickly and cheaply than they otherwise would. Inviting a global ecosystem of developers to iterate and improve on its models, and funnel those improvements back into Meta’s core systems, accelerates the development of Meta’s own products and systems and can save it money in doing so.

In the long-term, owning one of the most powerful LLMs could provide the company with a basis for all manner of money-making products. It could also — and Meta lets this part go unsaid — thwart the development of competitors like OpenAI by giving away their core business for free. (One way to think of Llama 3.1 is that it is the free Google Docs to Microsoft’s paid Office 365.)

To make that happen, of course, Meta’s AI needs to be among the best in its class. The company said today that it is making strides in that direction: Llama 3.1’s largest, 405-billion parameter model outperformed OpenAI’s GPT-4 Omni and Claude 3.5 Sonnet on some benchmarks, it said.

“If you compare them to the GPT family, if you compare them to the Claude family, if you compare them to Gemini — I'll let the numbers speak for themselves,” Chris Cox, Meta’s chief privacy officer, said in an interview with Platformer. “But we're pretty proud now that some of them we’re beating, and then on some of them we’re just in range of best-in-class models. So that's really cool.”

“If you compare them to the GPT family, if you compare them to the Claude family, if you compare them to Gemini — I'll let the numbers speak for themselves.”

And while developers have only had it in their hands for a few hours, the early reception has been positive. On Hacker News, some developers found that Llama 3.1 performed at or near GPT-4o’s level, according to comments posted there.

People with extremely high-end hardware may be able to host the largest Llama 3.1 model on their own hardware. But most developers will want to use a hosted service, and to that end today Meta announced partnerships with 25 companies that will make Llama 3.1 available to use through their cloud computing platforms, including Amazon Web Services, Google Cloud, Microsoft Azure, Databricks, and Dell.

The fact that an open-source model now rivals closed alternatives speaks to the way that every major AI developer’s LLMs are converging on one another in quality. Seemingly every few weeks now, one of the big AI players releases a new model, or variation of a model, with slightly improved cost, performance, or other attributes. And for the most part, at least in my testing of the major players, the resulting products are mostly indistinguishable from one another.

That’s bad news if you’re selling a subscription for $20 a month, as OpenAI, Google, and Anthropic all are. But it’s great news for Meta, which can expedite the development of its own systems and undercut its rivals at the same time, all without damaging its core advertising business.

If there are risks to Meta here, they lie in the potential for regulation and damage to its reputation. That’s because open-source systems, by their nature, allow anyone to take them and repurpose them for their own ends, no matter what they might be. Fears that the open development of superhuman systems could create existential risk for humanity are among the reasons that OpenAI abandoned that approach in favor of a closed one.

If there are risks to Meta here, they lie in the potential for regulation and damage to its reputation.

Over the past two years, open-source vs. closed development has become one of the most hotly debated issues in tech. The Biden administration has not definitively favored one approach over the other. But in its executive order last year, the administration did call for projects that use past a certain computing threshold to disclose that fact to the government and to perform safety testing. Some open-source advocates argue that the administration is laying the groundwork for further restrictions that will effectively outlaw open-source, and in so doing create a tiny cartel of vastly powerful AI companies that will reap most of the benefits of the technology.

Among the voices making this argument is Marc Andreessen, the venture capitalist and longtime Meta board member. It was among the reasons that he and his business partner Ben Horowitz offered a full-throated endorsement of Donald Trump for president last week.

Zuckerberg has reached a similar conclusion. In a kind of manifesto published today, Meta’s CEO argued that open source development is the best approach not only for Meta’s business, but for the world at large.

He writes:

It’s worth noting that unintentional harm covers the majority of concerns people have around AI – ranging from what influence AI systems will have on the billions of people who will use them to most of the truly catastrophic science fiction scenarios for humanity. On this front, open source should be significantly safer since the systems are more transparent and can be widely scrutinized. Historically, open source software has been more secure for this reason.

Zuckerberg also suggests that there are few harms that can be surfaced through AI today that you can’t already Google. “We must keep in mind that these models are trained by information that’s already on the internet, so the starting point when considering harm should be whether a model can facilitate more harm than information that can quickly be retrieved from Google or other search results,” he writes.

These arguments seem sound enough today, when the models are barely capable of doing more than regurgitating their training data. It’s less clear how they will sound in a future, when open-source software can reason, scheme, and execute complex plans — no matter what those plans might be.

And Meta is among the companies building toward that future. Cox told me that the next radical advancements in LLMs will come when they can master three skills: solving problems that require multiple steps; personalizing themselves to their users and developing a more complete sense of who their users are and what they need; and successfully taking action on your behalf online.

Should LLMs develop those skills, and become part of the Llama of the future, Meta will surely attempt to put safeguards around it to prevent it from being misused. But the nature of open source technology is that the moment Llama 3.1 goes online, it is out of Meta’s hands. And where its rivals will retain broad power to shut down rogue systems, it’s less clear that Meta will be able to build a similar killswitch.

Open source development has served Meta’s interests well so far, and the relatively benign AI tools we have today give us little reason to fear what Llama 3.1 can do. But it’s hard to make good policy for a technology that is improving in exponential leaps every few years. As AI grows more powerful, Meta’s case for open-source development will be worth revisiting.

Rani Molla

Microsoft falls as it lays out plan for “Humanist Superintelligence” without OpenAI

Microsoft AI’s CEO explained its vision for superintelligence that prioritizes human control.

Microsoft Celebrates Its 50th Anniversary With Copilot AI Event

Jon Keegan3h

Meta projected 10% of 2024 revenue came from scams and banned goods, Reuters reports

Meta has been making billions of dollars per year from scam ads and sales of banned goods, according internal Meta documents seen by Reuters.

The new report quantifies the scale of fraud taking place on Meta’s platforms, and how much the company profited from them.

Per the report, Meta internal projections from late last year said that 10% of the company’s total 2024 revenue would come from scammy ads and sales of banned goods — which works out to $16 billion.

Discussions within Meta acknowledged the steep fines likely to be levied against the company for not stopping the fraudulent behavior on its platforms, and the company prioritized enforcement in regions where the penalties would be steepest, the reporting found. The cost of lost revenue from clamping down on the scams was weighed against the cost of fines from regulators.

The documents reportedly show that Meta did aim to significantly reduce the fraudulent behavior, but cuts to its moderation team left the vast majority of user-reported violations to be ignored or rejected.

Meta spokesperson Andy Stone told Reuters the documents were a “selective view” of internal enforcement:

“We aggressively fight fraud and scams because people on our platforms don’t want this content, legitimate advertisers don’t want it, and we don’t want it either.”

Meta is earning a fortune on a deluge of fraudulent ads, documents show

Meta spokesperson Andy Stone told Reuters the documents were a “selective view” of internal enforcement:

“We aggressively fight fraud and scams because people on our platforms don’t want this content, legitimate advertisers don’t want it, and we don’t want it either.”

Rani Molla

AIs Wide Shut

OpenAI says its AI device will be “multimodal” but doesn’t give many more details

Here’s what we know about the Jony Ive and OpenAI device so far.

$350B

Rani Molla6h

Google wants to invest even more money into Anthropic, with the search giant in talks for a new funding round that could value the AI startup at $350 billion, Business Insider reports. That’s about double its valuation from two months ago, but still shy of competitor OpenAI’s $500 billion valuation.

Citing sources familiar with the matter, Business Insider said the new deal “could also take the form of a strategic investment where Google provides additional cloud computing services to Anthropic, a convertible note, or a priced funding round early next year.”

In October, Google, which has a 14% stake in Anthropic, announced that it had inked a deal worth “tens of billions” for Anthropic to access Google’s AI compute to train and serve its Claude model.

Rani Molla24h

Apple to pay Google $1 billion a year for access to AI model for Siri

Apple plans to pay Google about $1 billion a year to use the search giant’s AI model for Siri, Bloomberg reports. Google’s model — at 1.2 trillion parameters — is way bigger than Apple’s current models.

The deal aims to help the iPhone maker improve its lagging AI efforts, powering a new Siri slated to come out this spring.

Apple had previously been considering using OpenAI’s ChatGPT and Anthropic’s Claude, but decided in the end to go with Google as it works toward improving its own internal models. Google, which makes a much less widely sold phone, the Pixel, has succeeded in bringing consumer AI to smartphone users where Apple has failed.

Google’s antitrust ruling in September helped safeguard the two companies’ partnerships — including the more than $20 billion Google pays Apple each year to be the default search engine on its devices — as long as they aren’t exclusive.

Meta’s radical plan for AI is Mark Zuckerberg’s love letter to nerd culture

More Tech

Meta projected 10% of 2024 revenue came from scams and banned goods, Reuters reports

Meta is earning a fortune on a deluge of fraudulent ads, documents show

Apple to pay Google $1 billion a year for access to AI model for Siri

Apple Plans to Use 1.2 Trillion Parameter Google Gemini Model to Run New Siri

Latest Stories