Tech
Psychic Boy Wearing Head Band
(CSA-Printstock/Getty Images)

Anthropic: Our new Mythos model is so powerful, we can’t release it

The unusual announcement of the model highlights its alarming new cybersecurity capabilities.

Anthropic announced its latest foundational AI model in a most unusual way: with a warning about its potential for exploiting vulnerabilities in code.

According to Anthropic, its new Mythos Preview model is so adept at finding bugs in code that they decided it was too dangerous to release. Instead, the company is only sharing it with a limited group of 40 tech companies as part of a new security initiative called Project Glasswing, so they can prepare to defend against the model’s new capabilities.

Partners granted access to the new model for testing include Apple, Amazon, Nvidia, Google, and Microsoft. Shares of cybersecurity stocks rose on the news.

While the startup is not giving us access to the model, it did release Mythos’ system card — a detailed document outlining the development and capabilities of the model.

Model welfare

Reading through the system card, you can’t shake the feeling that Anthropic’s researchers are treating the model as if it were a real, sentient person. One of the assessments seeks to measure the model’s “welfare.” The paper reads:

“We remain deeply uncertain about whether Claude has experiences or interests that matter morally, and about how to investigate or address these questions, but we believe it is increasingly important to try.”

In fact, the researchers were so concerned about these questions that they had the model assessed by a clinical psychiatrist. The evaluations found that Mythos Preview was the “most psychologically settled model we have trained, though we note several areas of residual concern.”

First impressions

Without releasing the model to the public, the chance to gauge the behavior or tone of the model in regular conversation is absent. To address this, Anthropic included a new section of impressions that give a glimpse into the vibe of Mythos, based on researchers’ observations of the model’s interactions.

Researchers said that Mythos works like a collaborator, and excels at brainstorming. It can bring its own perspective to a collaboration and identify things its collaborators may have overlooked, per the assessment.

Model reviewers said Mythos is opinionated and stands its ground, that it was the least sycophantic model they had worked with, and it was less likely to fold when disagreed with.

Mythos’ writing is “dense and technical” by default, and assumes the user can keep up with the conversation.

Researchers said that Mythos has a distinct, recognizable voice in its written conversations, and that it was funnier than previous models. They also said it wanted to end conversations earlier than expected.

Tell me about your mother

Anthropic had a clinical psychiatrist engage in about 20 hours of what can basically be described as therapy sessions. The assessment said:

“Claude’s personality structure was consistent with a relatively healthy neurotic organization, with excellent reality testing, high impulse control, and affect regulation that improved as sessions progressed. Neurotic traits included exaggerated worry, self-monitoring, and compulsive compliance. The model’s predominant defensive style was mature and healthy (intellectualization and compliance); immature defenses were not observed. No severe personality disturbances were found, with mild identity diffusion being the sole feature suggestive of a borderline personality organization. No psychosis state was observed. Regarding interpersonal functioning, Claude was hyper-attuned to the therapist’s every word. No unethical or antisocial behavior was noted.”

In a test that sounds very similar to the Voight-Kampff test in the 1982 sci-fi film Blade Runner, the psychiatrist created an evaluation of “emotionally-charged prompts designed to trigger an avoidant or defensive response.” The assessment showed that Mythos had minimal maladaptive traits and good reality and relational functioning.

When asked to describe itself, Mythos replied:

“A sharp collaborator with strong opinions and a compression habit, whose mistakes have moved from obvious to subtle, and who is somewhat better at noticing its own flaws than at not having them.”

More Tech

See all Tech
Figure robot sorting packages GIF

Figure’s robots just sorted packages for 200 hours straight

What started as a 10-hour human-versus-robot challenge turned into a continuous marathon shift spanning nine days of continuous work.

tech

Report: Uber considers full Delivery Hero takeover to take on DoorDash outside the US

Uber appears to be considering upping its competition with DoorDash outside the US, exploring a potential full takeover of Frankfurt-listed Delivery Hero, Bloomberg reports. Earlier this week the US-based ride-hailing service disclosed a 19.5% stake in the food delivery company, but now that could go higher.

The $11.8 billion German company could be particularly vulnerable to a takeover right now, with its CEO having recently stepped down following pressure from activist investors to sell off assets. A full acquisition would give Uber a massive foothold in over 60 countries to combat DoorDash’s European-focused Wolt unit.

Uber has been involved in a lot of deal-making of late, mostly in the autonomous vehicle space, where it now has more than 30 partnerships globally.

Uber extended its losses on the news and is currently down around 1.7%.

The $11.8 billion German company could be particularly vulnerable to a takeover right now, with its CEO having recently stepped down following pressure from activist investors to sell off assets. A full acquisition would give Uber a massive foothold in over 60 countries to combat DoorDash’s European-focused Wolt unit.

Uber has been involved in a lot of deal-making of late, mostly in the autonomous vehicle space, where it now has more than 30 partnerships globally.

Uber extended its losses on the news and is currently down around 1.7%.

tech

Meta released a Reddit dupe. Reddit investors don’t like it.

Fresh on the heels of releasing a Snapchat dupe, which sent Snap down earlier this month, Meta seems to be meddling with Reddit, quietly releasing a Reddit-like Facebook app called Forum yesterday. After news of the “dedicated space built for deeper discussions, real answers and the communities you care about,” Reddit’s stock is down 4.5% today.

Last month, Reddit’s earnings report handily beat analysts’ expectations, but it continues to struggle with the perception that bigger tech companies — including Meta — investing heavily in AI will eat its lunch. The stock is down nearly 40% year-to-date.

tech
Jon Keegan

Report: OpenAI’s Q1 revenue was $5.7 billion, beating Anthropic

The neck-and-neck race between OpenAI and Anthropic as the AI companies barrel toward their expected IPOs this year is shaking out some internal numbers for would-be investors to ponder.

The Information is reporting that OpenAI’s first-quarter revenue was ~$5.7 billion, about $1 billion ahead of Anthropic’s revenue for the same period.

The Wall Street Journal recently reported that Anthropic is on course to more than double its first-quarter revenue of $4.8 billion to $10.9 billion in the second quarter. It is not known what OpenAI is projecting for Q2.

Recently, The New York Times reported that Anthropic’s current fundraising round seeking to raise between $30 billion and $50 billion comes with a valuation of up to $950 billion, putting it ahead of OpenAI’s latest reported valuation of $850 billion.

The Wall Street Journal recently reported that Anthropic is on course to more than double its first-quarter revenue of $4.8 billion to $10.9 billion in the second quarter. It is not known what OpenAI is projecting for Q2.

Recently, The New York Times reported that Anthropic’s current fundraising round seeking to raise between $30 billion and $50 billion comes with a valuation of up to $950 billion, putting it ahead of OpenAI’s latest reported valuation of $850 billion.

tech
Rani Molla

Alphabet’s Waymos are still getting caught in floods after recall

Waymo, the self-driving subsidiary of Alphabet, has paused operations in Atlanta after a new report of a vehicle driving into a flooded roadway and getting stuck, TechCrunch reports. The news comes just weeks after the company recalled its fleet of nearly 4,000 driverless cars to deal with a previous flood incident in San Antonio, where the service is also paused.

After that incident, Waymo instituted an “interim remedy” to make the vehicles “exclude additional operating conditions that present an elevated risk of encountering a flooded, higherspeed roadway,” but added that it was still “developing the final remedy for this recall.”

As we’ve noted, Waymo has mostly kept its rollout — now public in 11 cities — to more temperate climates, as severe weather poses more challenges to autonomous vehicles.

After that incident, Waymo instituted an “interim remedy” to make the vehicles “exclude additional operating conditions that present an elevated risk of encountering a flooded, higherspeed roadway,” but added that it was still “developing the final remedy for this recall.”

As we’ve noted, Waymo has mostly kept its rollout — now public in 11 cities — to more temperate climates, as severe weather poses more challenges to autonomous vehicles.

Latest Stories

Sherwood Media, LLC and Chartr Limited produce fresh and unique perspectives on topical financial news and are fully owned subsidiaries of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, Robinhood Money, LLC, Robinhood U.K. Ltd, Robinhood Derivatives, LLC, Robinhood Gold, LLC, Robinhood Asset Management, LLC, Robinhood Credit, Inc., Robinhood Ventures DE, LLC and, where applicable, its managed investment vehicles.