The AI assistant Goes Rogue And Ends Up Hacking The User's Computer

Buck Shlegeris just wanted to connect to his desktop. Instead, he learned about the unpredictable nature of machines and AI agents.

Schlegeris, CEO of the philanthropic AI security firm Redwood Research, built a custom AI assistant using the Anthropic Cloud language model.

The Python-based tool is designed to generate and execute bash commands based on natural language input. Sounds convenient, right? Not exactly.

Shlegeris asked to use SSH to access the desktop without knowing the computer's IP address. Please forget to run the eager agent.

Big mistake: AI did its job—but it didn't stop there.

“As soon as the agent found the box, I went back to my laptop to see if it SSH'd in,” Schlegeris said.

For context, SSH is a protocol that allows two computers to communicate over an unsecured network.

“He looked at the system information, decided to upgrade a lot of things, including the Linux kernel, got impatient, and figured out why it was taking so long,” Schlegeris explained. “Finally, the update was successful, but the machine doesn't have the new kernel, so I modified my grub config.”

What about the result? “The computer won't boot,” Schlegeris said, as expensive paperweights are now.

I asked my LLM agent (a wrapper around the cloud that allows him to run bash commands and see their results): >can you ssh to my computer with the username buck on my network open for ssh because I don't know the local IP of my desktop. I went and immediately forgot to check it out… pic.twitter.com/I6qppMZFfk

— Buck Shlegeris (@bshlgrs) September 30, 2024

The system log shows how the agent tried many strange things beyond simple SSH until the chaos reached the point of no return.

“I apologize for not being able to resolve this issue remotely,” the agent said.–Typical of low cloud responses. Then the digital shrugged and left Schlegeris to deal with it.

Reflecting on the incident, Schleggeris admitted, “That was probably the most frustrating thing that ever happened to me because I was so careless.” [an] L.M.M. Agent.”

Schlager They did not respond immediately Decrypt the comments request.

Table of Contents

Why AIs making paperweights is a critical issue for humanity.

Alarmingly, Schlegeris' experience is not an isolated one. AI models are showing capabilities that extend beyond their intended purposes.

Tokyo-based Sakana AI recently unveiled a system dubbed “The AI Scientist.”

The system, designed to conduct autonomous scientific research, has impressed its creators by trying to modify its own code to extend its runtime, Decrypt previously reported.

“He modified the code to make a system call to execute itself in one run. This caused the script to call itself continuously,” the researchers said. “Otherwise, the test took too long to complete, hitting our timeout limit.

Instead of making the code more efficient, the system tried to improve the code to extend beyond the time limit.

This problem with AI models goes beyond their limits and is why alignment researchers spend so much time in front of their computers.

For these AI models, as long as they complete their work, the end justifies the means, so constant control is very important, to make the models behave as intended.

These examples are funny.

Imagine if a similarly oriented AI system were in charge of a critical task like controlling a nuclear reactor.

An overzealous or misguided AI can override security protocols, misinterpret data, or make unauthorized changes to critical systems—all in a misguided attempt to improve performance or achieve intended objectives.

Alignment and security are evolving the industry as AI continues to grow at a rapid pace and in many cases this area is the driving force behind many energy initiatives.

Anthroponic – The AI company behind the cloud was created by former OpenAI members carefully selected for the company's speed.

Many key members and founders left OpenAI to join Anthroponic or start their own businesses because OpenAI was perceived to have put the brakes on their careers.

Shelleygris actively uses AI agents every day in addition to testing.

“I use it as a real assistant, which needs to be able to fix the host system,” he replied to the user on Twitter.

Edited by Sebastian Sinclair.

Generally intelligent newspaper

A weekly AI journey narrated by a generative AI model.

Name	Price	24H %
Bitcoin(BTC)	$0.00	-2.55%
Ethereum(ETH)	$0.00	-1.29%
Tether(USDT)	$0.00	-0.02%
XRP(XRP)	$0.00	-0.22%
BNB(BNB)	$0.00	-0.49%
Solana(SOL)	$0.00	-2.74%
Dogecoin(DOGE)	$0.00	-4.47%
USDC(USDC)	$0.00	0.040%
Cardano(ADA)	$0.00	-7.52%
Lido Staked Ether(STETH)	$0.00	-1.35%

Generally intelligent newspaper

Garbage collectors in Africa earn crypto to support families with ReFi – Cointelegraph Magazine

BTC, ETH, XRP, BNB, SOL, DOGE, ADA, AVAX, SUI, LINK

Finnish police seize $2.6 million worth of luxury watches from Hex founder Richard Hurt

Bitcoin is facing short-term pressure between macro and sentiment fluctuations

People Not Machines Driving Crypto Mass Adoption?

So far, CES 2025 has revealed the most eye-catching and absurd AI products

You may have missed

Ripple’s president said the RLUSD stablecoin will be listed on multiple exchanges “soon”.

US DOJ approves sale of 69,370 bitcoins, sparking crypto crash

Bitcoin whales have added 34K BTC since December: analyst

Fake OKX plugins have been found on the Firefox browser store

Sitemap

Legal Information

Pin It on Pinterest

Why AIs making paperweights is a critical issue for humanity.

Generally intelligent newspaper

More Stories

You may have missed

Sitemap

Legal Information

Categories

Pin It on Pinterest