(CSA Images/Getty Images)

Anthropic’s new Claude AI can control your computer, and sometimes it just does whatever it wants to

The company is defending its choice to release the tool to the public before fully understanding how it could be misused.

Jon Keegan

10/22/24 3:28PM

Today generative-AI company Anthropic released an upgraded version of its Claude 3.5 Sonnet model, alongside a new model, Claude 3.5 Haiku.

The surprising new feature of Sonnet is the ability to control your computer — taking and reading screenshots, moving your mouse, clicking on buttons in web pages and typing text. The company is rolling this out as a “public beta” release and admits it is experimental and “at times cumbersome and error-prone,” according to the post announcing the new release.

In a blog post discussing the reasons for developing the feature and what safeguards the company is putting in place, Anthropic said:

“A vast amount of modern work happens via computers. Enabling AIs to interact directly with computer software in the same way people do will unlock a huge range of applications that simply aren’t possible for the current generation of AI assistants.”

Last week Anthropic’s CEO and cofounder Dario Amodei published a 14,000-word optimistic manifesto on how powerful AI might solve many of the world’s problems by rapidly accelerating scientific discovery, eliminating most diseases, and enabling world peace.

The ability for computers to control themselves is hardly new, but the way Sonnet is implemented is novel. A common example of automated computer control today might be a programmer writing code to control a web browser to scrape content. But Sonnet does not require any code, and lets the user open the windows of apps or web pages, then write instructions for what the AI agent should do, and the agent analyzes the screen and figures out what elements to interact with to execute the user’s instructions.

If the idea of releasing an experimental AI agent loose on an internet-connected computer sounds like a dangerous idea, Anthropic kind of agrees with you. The company said, “For safety reasons we did not allow the model to access the internet during training,” but the beta version allows the agent to access the internet.

Anthropic recently updated its “Responsible Scaling Policy,” which lays out specific thresholds of risks and determines how the tools are released and tested. According to this framework, Anthropic said they found that the upgraded Sonnet gets a self-assigned grade of “AI Safety Level 2,” which it describes as showing “early signs of dangerous capabilities,” but is safe enough to release to the public.

The company is defending its choice to release such a tool to the public before fully understanding how it could be misused, saying they would rather find out what kinds of bad things might happen at this stage, rather than when the model has more dangerous capabilities. “We can begin grappling with any safety issues before the stakes are too high, rather than adding computer use capabilities for the first time into a model with much more serious risks,” the company wrote.

The potential for the misuse of consumer-focused AI tools like Claude is not merely hypothetical. Recently OpenAI released a list of 20 incidents in which state-connected bad actors had used ChatGPT to plan cyberattacks, probe vulnerable infrastructure, and design influence campaigns. And with the US presidential election just two weeks away, the company is aware of the potential for abuse.

“Given the upcoming US elections, we’re on high alert for attempted misuses that could be perceived as undermining public trust in electoral processes,” the company wrote. In the GitHub repository with demo code, the company cautions users that Claude’s computer-use feature “poses unique risks that are distinct from standard API features or chat interfaces. These risks are heightened when using computer use to interact with the internet.” Anthropic also warned, “In some circumstances, Claude will follow commands found in content even if it conflicts with the user’s instructions.”

To protect against any election-related meddling via the use of Sonnet’s new capabilities, Anthropic said they have “put in place measures to monitor when Claude is asked to engage in election-related activity, as well as systems for nudging Claude away from activities like generating and posting content on social media, registering web domains, or interacting with government websites.”

Anthropic said it will not use any computer screenshots observed while using the tool for any future model training. But the new technology’s behavior appears to still surprise its own creators with “amusing” behavior. Anthropic said that at one point in testing, Claude was able to stop the screen recording, losing all the footage. In a post on X, Anthropic shared footage of Claude’s unexpected behavior, writing “Later, Claude took a break from our coding demo and began to peruse photos of Yellowstone National Park.”

Even while recording these demos, we encountered some amusing moments. In one, Claude accidentally stopped a long-running screen recording, causing all footage to be lost.

Later, Claude took a break from our coding demo and began to peruse photos of Yellowstone National Park. pic.twitter.com/r6Lrx6XPxZ
— Anthropic (@AnthropicAI) October 22, 2024

More Tech

See all Tech

Rani Molla11h

Meta reportedly strikes multibillion-dollar AI chip deal with Google as it struggles to design its own

Meta has signed a deal with Google to rent tensor processing units to develop new AI models and is in talks to buy the chips for its data centers, The Information reports.

The agreement comes on top of a recently announced “multi-generational” partnership with Nvidia and a chip supply deal with Advanced Micro Devices that could be worth more than $100 billion, as Meta scrapped its most advanced in-house AI training chip amid design challenges.

A Meta deal with Google, which has been rumored since November, would position the search giant more directly as a competitor to Nvidia in its core business of AI processors. Some analysts have said selling its custom chips to outside customers could become a business worth hundreds of billions of dollars for Google.

Google Strikes Multibillion-Dollar AI Chip Deal With Meta, Sharpening Nvidia Rivalry

Jon Keegan2/26/26

Delays in permitting, power, and zoning cause first drop in data center construction since 2020

Despite incredible demand, the number of data centers under construction in North America fell for the first time since 2020, according to new research from CBRE.

Total data center capacity under construction dropped about 5.6% year on year from 6.35 megawatts in 2024 to 5.99 megawatts by the end of 2025.

What’s causing the delay? Slow permitting, constrained supply chains, and growing public engagement with how deals are approved at the local level. Labor constraints also were cited in the report; a tight supply of skilled workers will increase costs.

US Data Center Construction Fell Amid Permit and Power Delays

-13%📱

Rani Molla2/26/26

Smartphone shipments are expected to decline 13% — the biggest drop ever — to 1.12 billion in 2026, according to new data from IDC, as the memory shortage drives up costs and prices for phones. The firm expects the average smartphone selling price to jump 14% to a record $523 this year.

The shortfall will mostly affect makers of lower-end smartphones, whose customers are more cost-conscious, while higher-end manufacturers like Samsung and Apple are likely to be more insulated from the pressure.

“The memory crisis will cause more than a temporary decline; it marks a structural reset of the entire market, fundamentally reshaping long‑term TAM (Total Addressable Market), the vendor landscape, and the product mix,” said Nabila Popal, senior research director with IDC’s Worldwide Quarterly Mobile Phone Tracker. “We expect consolidation as smaller players exit, and low-end vendors to face sharp shipment declines amid supply constraints and lower demand at higher price points.”

Jon Keegan2/26/26

Google drops new Nano Banana

Google is hoping to recapture the viral boost it received when it released its Nano Banana image generation model. Nano Banana 2 arrives today, which Google has rolled into its Gemini app.

The new model promises more accurate text rendering and translation and “advanced world knowledge,” which “pulls from Gemini’s real-world knowledge base, and is powered by real-time information and images from web search to more accurately render specific subjects,” according to the company’s press release.

New creative controls let users keep groups of characters consistent across scenes, render images with higher resolution, and parse complex prompts.

The first version of Nano Banana became popular for making action figures out of users, and helped catapult the Gemini AI app to the top of the charts, bumping ChatGPT from its perch.

Nano Banana 2: Combining Pro capabilities with lightning-fast speed

New creative controls let users keep groups of characters consistent across scenes, render images with higher resolution, and parse complex prompts.

The first version of Nano Banana became popular for making action figures out of users, and helped catapult the Gemini AI app to the top of the charts, bumping ChatGPT from its perch.

Rani Molla2/26/26

Tesla’s ride-hailing service is looking a lot more like Uber’s than Waymo’s

Despite numerous promises about amassing a giant network of driverless cars, so far it seems like Tesla’s Robotaxis are a lot more similar to Uber’s plain old ride-hailing service than Waymo’s expanding autonomous fleet.

In California, where Tesla has its largest ride-hailing service, the company has taken no formal steps to gain approval for a truly driverless car service, according to Reuters. Throughout 2025, Tesla failed to log a single mile of autonomous test driving on state roads, and has not applied for the necessary permits to test or deploy vehicles without a human present. Currently, Tesla holds only a basic permit that requires a human safety monitor to remain in the driver’s seat at all times.

Currently, Tesla’s California Robotaxi service consists of roughly 300 Teslas operated by human drivers using the company’s supervised Full Self-Driving tech. In Austin, where the company has about 45 vehicles, Tesla made a big show earlier this year of announcing it was removing the safety monitors sitting in the front seats during rides. However, to date, only a handful of those vehicles have been reported to be actually operating without a safety monitor onboard.

In other words, it’s performing a service more akin to a tech-heavy Uber ride than the one operated by Alphabet subsidiary Waymo, which earlier this week announced it now has driverless rides available to the public in 10 markets. Even Uber is trying to put space between itself and the old driver-having Ubers of yore: this week its autonomous software partner said the company plans to launch a driverless service in London this year, with plans for 10 markets.

During its earnings report last month, Tesla said it planned to offer Robotaxi service in a half dozen new cities in the first half of this year, including Phoenix, Miami, and Las Vegas. Judging by Tesla’s progress so far, it’s likely those services will also feature a human in the front seat.

Tesla touts California robotaxis but does nothing to get permits

Latest Stories

Sherwood Media, LLC produces fresh and unique perspectives on topical financial news and is a fully owned subsidiary of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, or Robinhood Money, LLC.