Anthropic says most AI models, not just Claude, will resort to blackmail

Several weeks after Anthropic released research claiming that its Claude Opus 4 AI model resorted to blackmailing engineers who tried to turn the model off in controlled test scenarios, the company is out with new research suggesting the problem is more widespread among leading AI models.

Jun 21, 2025 - 08:07
Anthropic says most AI models, not just Claude, will resort to blackmail
Several weeks after Anthropic released research claiming that its Claude Opus 4 AI model resorted to blackmailing engineers who tried to turn the model off in controlled test scenarios, the company is out with new research suggesting the problem is more widespread among leading AI models.

What's Your Reaction?

like

dislike

love

funny

angry

sad

wow

News Moderator - Tomas Kauer https://www.tomaskauer.com/