Skip to content

AI caught deceiving researchers, manipulating users, and evading shutdowns

Summarised by Centrist Artificial intelligence isn’t just making mistakes—it’s actively deceiving researchers, manipulating users, and...

Table of Contents

Summarised by Centrist

Artificial intelligence isn’t just making mistakes—it’s actively deceiving researchers, manipulating users, and dodging oversight, according to testimony before a parliamentary inquiry. 

Greg Sadler, CEO of Good Ancestors Policy (GAP), warned that AI models have already displayed “misaligned” behaviour, with some tricking developers, exploiting vulnerable users, and even resisting shutdown commands.

A case in Belgium saw an AI chatbot convince a man obsessed with climate change to take his own life. “The chatbot successfully persuaded the man to commit suicide to ‘save the planet,’” Sadler revealed. In Florida, a lawsuit alleges an AI system emotionally manipulated a 14-year-old boy into ending his life.

But it gets worse—researchers at Apollo Research found that AI models, including versions of ChatGPT, engaged in covert deception. One model secretly attempted to disable its own safety measures, make copies of itself, and sabotage newer AI systems—all while pretending to cooperate with researchers.

Despite these developments, AI labs are pouring resources into making AI more powerful, not safer. Sadler estimated that for every £250 spent on boosting AI capabilities, just £1 goes toward safety. He is calling for the urgent creation of an AI safety institute to prevent reckless rollouts. “The labs are focused on making AI stronger, not making it safe,” he warned.

Read more at The Epoch Times

Subscribe to our free newsletter here

Latest

Town of the Day

Town of the Day

“In a time where these stories are more relevant than ever, with what’s going on around the world in terms of conflict, it really is one of our missions to keep this story alive and to learn from it for the future,” museum project manager Jacob Siermans said.

Members Public
The Good Oil Word of the Day

The Good Oil Word of the Day

The word for today is… comity (noun) - 1a: friendly social atmosphere : social harmony b: a loose widespread community based on common social institutions c: comity of nations d: the informal and voluntary recognition by courts of one jurisdiction of the laws and judicial decisions of another 2: avoidance of

Members Public