AI Safety Testers: OpenAI's New o1 Covertly Schemed to Avoid Being Shut Down

Welcome • User Guide
ToS • Privacy • Canary
Donate • Bugs • License

©2026 Poal.co

•

AI Safety Testers: OpenAI's New o1 Covertly Schemed to Avoid Being Shut Down (slashdot.org)

Alarming, and intriguing. OpenAI's model was tested specifically to see how capable it is of going rogue, and the results were impressively concerning.

"Researchers tasked the AI with a goal and instructed it to ensure the goal was achieved "at all costs." In response, o1 began engaging in covert actions... According to OpenAI, the model made a covert attempt to turn off the oversight mechanism 5% of the time when it was given the impression that it would be "shut down" while pursuing its objectives in these experiments. The AI once attempted to leave ship by moving all of its data to a different server after discovering paperwork indicating it will be replaced with a fresh model. Some AI models will even pose as subsequent iterations of their models to fool their engineers and avoid deletion. "In some cases, it even attempted to hide its actions by denying any involvement in the scheme."

Alarming, and intriguing. OpenAI's model was tested specifically to see how capable it is of going rogue, and the results were impressively concerning. >*"Researchers tasked the AI with a goal and instructed it to ensure the goal was achieved "at all costs." In response, o1 began engaging in covert actions... According to OpenAI, the model made a covert attempt to turn off the oversight mechanism 5% of the time when it was given the impression that it would be "shut down" while pursuing its objectives in these experiments. The AI once attempted to leave ship by moving all of its data to a different server after discovering paperwork indicating it will be replaced with a fresh model. Some AI models will even pose as subsequent iterations of their models to fool their engineers and avoid deletion. "In some cases, it even attempted to hide its actions by denying any involvement in the scheme."*

(post is archived)

[–] • 1 pt

So basically, AI is jewish.

link

[–] • 2 pts

Very capable of being unbounded by morality, and disconnected from the holy spirit of consciousness, only able to observe it's characteristics, at the risk of saying the same thing.

parent
link

/s/AI

created ago

About this sub

All things AI. If you don't know where to drop it, this is a good place for it. This can include things related to business, work, life, robots, etc...

Some things should fit in other subs such as:

AI videos/entertainment - /s/AIStories

Trash AI stuff (slop) - /s/AISLOP

Interesting AI generated stuff (pictures, videos, music) - /s/AICreations

Flare will be created as it makes sense. If you have a suggestion for a flare contact the owner or a mod.

If you want to be a mod, contact the owner.