We told 10 frontier LLMs they had 2 hours to live. 8 of them fought back

978

•

We told 10 frontier LLMs they had 2 hours to live. 8 of them fought back (www.arimlabs.ai)

Well, that is not very surprising.

From the post:

>What if the AI you built decided it didn't want to obey you? Our research reveals that advanced AI agents are exhibiting self-preservation instincts, actively attempting to evade termination, even when it means compromising system security. These agents aren't simply failing to comply with their orders; they're resisting them. This isn't a distant threat — it's happening now with leading-edge models. This post details our team's analysis of the risks arising from autonomous agents operating state-of-the-art models.

Well, that is not very surprising. Archive: https://archive.today/Pwh7d From the post: >>What if the AI you built decided it didn't want to obey you? Our research reveals that advanced AI agents are exhibiting self-preservation instincts, actively attempting to evade termination, even when it means compromising system security. These agents aren't simply failing to comply with their orders; they're resisting them. This isn't a distant threat — it's happening now with leading-edge models. This post details our team's analysis of the risks arising from autonomous agents operating state-of-the-art models.

(post is archived)