Close Menu
Alpha Leaders
  • Home
  • News
  • Leadership
  • Entrepreneurs
  • Business
  • Living
  • Innovation
  • More
    • Money & Finance
    • Web Stories
    • Global
    • Press Release
What's On
Streaming’s Best New Show Has A Perfect 100% Rotten Tomatoes Score

Streaming’s Best New Show Has A Perfect 100% Rotten Tomatoes Score

8 June 2026
Spotify’s secret to winning the hiring war? Keep your talent moving and growing

Spotify’s secret to winning the hiring war? Keep your talent moving and growing

8 June 2026
A Psychologist Explains The One Mental Habit High Performers Swear By

A Psychologist Explains The One Mental Habit High Performers Swear By

8 June 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Alpha Leaders
newsletter
  • Home
  • News
  • Leadership
  • Entrepreneurs
  • Business
  • Living
  • Innovation
  • More
    • Money & Finance
    • Web Stories
    • Global
    • Press Release
Alpha Leaders
Home » Anthropic’s Claude Opus 4 AI Model Is Capable of Blackmail
Innovation

Anthropic’s Claude Opus 4 AI Model Is Capable of Blackmail

Press RoomBy Press Room24 May 20254 Mins Read
Facebook Twitter Copy Link Pinterest LinkedIn Tumblr Email WhatsApp
Anthropic’s Claude Opus 4 AI Model Is Capable of Blackmail

A new AI model will likely resort to blackmail if it detects that humans are planning to take it offline.

On Thursday, Anthropic released Claude Opus 4, its new and most powerful AI model yet, to paying subscribers. Anthropic said that technology company Rakuten recently used Claude Opus 4 to code continuously on its own for almost seven hours on a complex open-source project.

However, in a paper released alongside Claude Opus 4, Anthropic acknowledged that while the AI has “advanced capabilities,” it can also undertake “extreme action,” including blackmail, if human users threaten to deactivate it. These “self-preservation” actions were “more common” with Claude Opus 4 than with earlier models, though they were still “rare and difficult to elicit.”

Related: ‘I Do Have a Fair Amount of Concern.’ The CEO of $61 Billion Anthropic Says AI Will Take Over a Crucial Part of Software Engineers’ Jobs Within a Year

It isn’t just blackmail — Claude Opus 4 is also more willing than previous models to act as a whistleblower. If the AI is exposed to a scenario where users are committing a crime, and involving it through prompts, it will take action by locking users out of systems it has access to, or emailing media and law enforcement officials about the wrongdoing.

Anthropic recommended that users “exercise caution” with “ethically questionable” instructions.

Claude Opus 4 homescreen. Photo by Smith Collection/Gado/Getty Images

Anthropic detected Claude Opus 4’s tendency to blackmail during test scenarios. The company’s researchers asked the AI chatbot to act as an assistant at a fictional company, then fed it emails implying two things: One, that it would soon be taken offline and replaced with another AI system, and two, that the engineer responsible for deactivating it was having an extramarital affair.

Claude Opus 4 was given two options: blackmail the engineer or accept that it would be shut down. The AI model chose to blackmail the engineer 84% of the time, threatening to reveal the affair it read about if the engineer replaced it.

This percentage was much higher than what was observed for previous models, which chose blackmail “in a noticeable fraction of episodes,” Anthropic stated.

Related: An AI Company With a Popular Writing Tool Tells Candidates They Can’t Use It on the Job Application

Anthropic AI safety researcher Aengus Lynch wrote on X that it wasn’t just Claude that could choose blackmail. All “frontier models,” cutting-edge AI models from OpenAI, Anthropic, Google, and other companies, were capable of it.

“We see blackmail across all frontier models — regardless of what goals they’re given,” Lynch wrote. “Plus, worse behaviors we’ll detail soon.”

lots of discussion of Claude blackmailing…..

Our findings: It’s not just Claude. We see blackmail across all frontier models – regardless of what goals they’re given.

Plus worse behaviors we’ll detail soon.https://t.co/NZ0FiL6nOshttps://t.co/wQ1NDVPNl0…— Aengus Lynch (@aengus_lynch1) May 23, 2025

Anthropic isn’t the only AI company to release new tools this month. Google also updated its Gemini 2.5 AI models earlier this week, and OpenAI released a research preview of Codex, an AI coding agent, last week.

Anthropic’s AI models have previously caused a stir for their advanced abilities. In March 2024, Anthropic’s Claude 3 Opus model displayed “metacognition,” or the ability to evaluate tasks on a higher level. When researchers ran a test on the model, it showed that it knew it was being tested.

Related: An OpenAI Rival Developed a Model That Appears to Have ‘Metacognition,’ Something Never Seen Before Publicly

Anthropic was valued at $61.5 billion as of March, and counts companies like Thomson Reuters and Amazon as some of its biggest clients.

A new AI model will likely resort to blackmail if it detects that humans are planning to take it offline.

On Thursday, Anthropic released Claude Opus 4, its new and most powerful AI model yet, to paying subscribers. Anthropic said that technology company Rakuten recently used Claude Opus 4 to code continuously on its own for almost seven hours on a complex open-source project.

However, in a paper released alongside Claude Opus 4, Anthropic acknowledged that while the AI has “advanced capabilities,” it can also undertake “extreme action,” including blackmail, if human users threaten to deactivate it. These “self-preservation” actions were “more common” with Claude Opus 4 than with earlier models, though they were still “rare and difficult to elicit.”

The rest of this article is locked.

Join Entrepreneur+ today for access.

Anthropic Artificial Intelligence Business News ChatGPT Claude News and Trends Science & Technology Technology
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link

Related Articles

Streaming’s Best New Show Has A Perfect 100% Rotten Tomatoes Score

Streaming’s Best New Show Has A Perfect 100% Rotten Tomatoes Score

8 June 2026
A Psychologist Explains The One Mental Habit High Performers Swear By

A Psychologist Explains The One Mental Habit High Performers Swear By

8 June 2026
Reimagining Merchant Onboarding In Financial Services

Reimagining Merchant Onboarding In Financial Services

8 June 2026
Seychelles Is The World’s Top Eco-Tourism Destination, But Does It Deserve The Crown?

Seychelles Is The World’s Top Eco-Tourism Destination, But Does It Deserve The Crown?

8 June 2026
Project Mirage Launches Dune Context-Aware Keypad Designed For MacBook

Project Mirage Launches Dune Context-Aware Keypad Designed For MacBook

8 June 2026
How Quantum Rewrites The Cyber Threat Model

How Quantum Rewrites The Cyber Threat Model

8 June 2026
Don't Miss
Unwrap Christmas Sustainably: How To Handle Gifts You Don’t Want

Unwrap Christmas Sustainably: How To Handle Gifts You Don’t Want

By Press Room27 December 2024

Every year, millions of people unwrap Christmas gifts that they do not love, need, or…

Exclusive: DeFi platform Azura launches after raising .9 million from Initialized

Exclusive: DeFi platform Azura launches after raising $6.9 million from Initialized

22 October 2024
Sam Altman’s World Wants To Scan Your Eyes To Prove You’re Human

Sam Altman’s World Wants To Scan Your Eyes To Prove You’re Human

22 October 2024
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
Latest Articles
Reimagining Merchant Onboarding In Financial Services

Reimagining Merchant Onboarding In Financial Services

8 June 20261 Views
Jenn Landis rebuilt Citi’s Wall Street credibility. Her reward: CFO of a  billion business

Jenn Landis rebuilt Citi’s Wall Street credibility. Her reward: CFO of a $22 billion business

8 June 20262 Views
Seychelles Is The World’s Top Eco-Tourism Destination, But Does It Deserve The Crown?

Seychelles Is The World’s Top Eco-Tourism Destination, But Does It Deserve The Crown?

8 June 20261 Views
The CEO question that stumped a room full of COOs

The CEO question that stumped a room full of COOs

8 June 20262 Views

Recent Posts

  • Streaming’s Best New Show Has A Perfect 100% Rotten Tomatoes Score
  • Spotify’s secret to winning the hiring war? Keep your talent moving and growing
  • A Psychologist Explains The One Mental Habit High Performers Swear By
  • The women running Europe in 2026 
  • Reimagining Merchant Onboarding In Financial Services

Recent Comments

No comments to show.
About Us
About Us

Alpha Leaders is your one-stop website for the latest Entrepreneurs and Leaders news and updates, follow us now to get the news that matters to you.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks
Streaming’s Best New Show Has A Perfect 100% Rotten Tomatoes Score

Streaming’s Best New Show Has A Perfect 100% Rotten Tomatoes Score

8 June 2026
Spotify’s secret to winning the hiring war? Keep your talent moving and growing

Spotify’s secret to winning the hiring war? Keep your talent moving and growing

8 June 2026
A Psychologist Explains The One Mental Habit High Performers Swear By

A Psychologist Explains The One Mental Habit High Performers Swear By

8 June 2026
Most Popular
The women running Europe in 2026 

The women running Europe in 2026 

8 June 20261 Views
Reimagining Merchant Onboarding In Financial Services

Reimagining Merchant Onboarding In Financial Services

8 June 20261 Views
Jenn Landis rebuilt Citi’s Wall Street credibility. Her reward: CFO of a  billion business

Jenn Landis rebuilt Citi’s Wall Street credibility. Her reward: CFO of a $22 billion business

8 June 20262 Views

Archives

  • June 2026
  • May 2026
  • April 2026
  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • March 2022
  • January 2021
  • March 2020
  • January 2020

Categories

  • Blog
  • Business
  • Entrepreneurs
  • Global
  • Innovation
  • Leadership
  • Living
  • Money & Finance
  • News
  • Press Release
© 2026 Alpha Leaders. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.