Close Menu
Alpha Leaders
  • Home
  • News
  • Leadership
  • Entrepreneurs
  • Business
  • Living
  • Innovation
  • More
    • Money & Finance
    • Web Stories
    • Global
    • Press Release
What's On
Could This 5M Investment Be The Pinocchio Moment For Quantum Computing?

Could This $375M Investment Be The Pinocchio Moment For Quantum Computing?

11 June 2026
The space economy’s next frontier is in ground infrastructure, Northwood Space CEO says

The space economy’s next frontier is in ground infrastructure, Northwood Space CEO says

11 June 2026
The World Cup’s Real Viral Threats Aren’t Ebola Or Hantavirus

The World Cup’s Real Viral Threats Aren’t Ebola Or Hantavirus

11 June 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Alpha Leaders
newsletter
  • Home
  • News
  • Leadership
  • Entrepreneurs
  • Business
  • Living
  • Innovation
  • More
    • Money & Finance
    • Web Stories
    • Global
    • Press Release
Alpha Leaders
Home » Anthropic accused of ‘secret sabotage’ as Claude Fable 5 silently limits AI research capabilities
News

Anthropic accused of ‘secret sabotage’ as Claude Fable 5 silently limits AI research capabilities

Press RoomBy Press Room11 June 20265 Mins Read
Facebook Twitter Copy Link Pinterest LinkedIn Tumblr Email WhatsApp
Anthropic accused of ‘secret sabotage’ as Claude Fable 5 silently limits AI research capabilities

When Anthropic made its first Mythos-tier model available to the general public yesterday, called Claude Fable 5, Fortune reported it was a “considerable step” for the lab, coming just over a week after the company confidentially filed for IPO paperwork. It had initially deemed Mythos-class models too dangerous to release, citing their significantly enhanced ability to identify software vulnerabilities, but said it was now confident new guardrails in Claude Fable 5 are enough to ensure these dangerous skills don’t fall into the wrong hands.

Just hours after the model’s release, however, major backlash from AI researchers, developers, and policy experts began brewing on social media. The pushback centered around a paragraph buried in Claude Fable 5’s 319-page system card—a document that offers detailed safety disclosures—which revealed that Fable would quietly downgrade its own responses when it detected requests related to cutting-edge AI development work, such as building the infrastructure used to train large AI models.

In practice, that means a user could ask Fable for help, receive a deliberately weakened answer, but not know the model was holding anything back. Critics made it clear they felt this undermined a basic expectation that a tool would either do what it was asked or tell the user it wouldn’t.

Unlike Fable’s other restrictions, such as around cybersecurity and biology, which openly redirect users to a less powerful model with a visible notification, the system card emphasized that this is “not visible to the user.” The model still responds, but uses “interventions to limit Claude’s effectiveness” without telling the user it’s doing so.

Anthropic estimated the restrictions would affect roughly 0.03% of traffic. But it also defended its effort by saying “enforcing this restriction through our safeguards avoids accelerating the actors most willing to violate these terms.” 

Pushback from AI community

A wide swath of the AI community pushed back sharply—including open-source researchers critical of Anthropic’s closed policies, as well as AI safety experts who typically align with Anthropic.

“To have my access to the cutting edge models for my work rug pulled in an under the table fashion is appalling,” wrote Nathan Lambert, an open-model researcher who most recently led work at AI2. “To me this paints Anthropic clearly as anti-science, and therefore anti-progress and anti-safety.” 

Dean Ball, a senior fellow at the Foundation for American Innovation who previously served as senior policy advisor at the White House Office of Science and Technology Policy, wrote that Anthropic’s “secret sabotage” safety policy “massively and profoundly raises the status of the argument that AI safety has been hype to justify monopolistic behavior by labs.” 

And Jeremy Howard, head of nonprofit research group Fast AI, wrote that “Anthropic has chosen the opposite of the safe path: they are allowing themselves, the current top lab, to use their top model for frontier AI research. They’ve said they’ll sabotage others who try. This means the AI frontier advances, & power imbalance increases.” 

Even former Anthropic employees joined in. Behnam Neyshabur, who previously co-led Anthropic’s effort to develop an AI scientist, posted on X saying: “Working on AI for cancer? Sorry, I can’t help you. Working on AI for Alzheimer’s Disease? Sorry, I’m becoming a bit dumb when it comes to the AI part of it.” In another post, he added: “I’ve argued for the last eight months that this was the direction things were heading. In my view, concentrating these capabilities fundamentally slows scientific and technological progress and is net negative for humanity.”

Not all prominent AI voices weighed in with criticism, however. Ethan Mollick, an associate professor at Wharton studying AI, innovation, and entrepreneurship, did not focus on the restrictions, writing in a blog post that Claude Fable 5 “outperformed basically every other public model I have used by a considerable margin.” 

Former OpenAI cofounder and Tesla AI director Andrej Karpathy, who announced he had joined Anthropic last month, called Claude Fable 5 a “super exciting release” on X and said it is a “major-version-bump-deserving step change forward.” He did, however, point out that the model “still has quirks that people will run into and the safeguards are configured to be a little too trigger-happy for launch, which can hopefully be tuned over time.”  

Anthropic says it wants to make models accessible and safe

Before the release, Anthropic seemed to gird itself for backlash, though it did not specifically address potential blowback regarding the research restrictions. In an interview with Fortune yesterday, Dianne Na Penn, Anthropic’s head of product management, research, and labs, said that the new model was able to produce frontier performance that was 10 to 20 points more than its previous model, Opus 4.8 or other frontier models.

“I think generally being able to do that, at the same time having the right guardrails in place to make it accessible, and generally in a safe manner, I think that’s probably the main thing that I want folks to take away,” she said. “We’re raising the bar on the intelligence of the models, and at the same time, we are pushing the frontier in a safe manner.” 

She added that Anthropic recognized that some benign requests would initially be blocked. “We’re working actively on making those safeguard improvements post-launch, but we wanted to make the model accessible generally in a safe manner as soon as we could.”

Anthropic did not respond to Fortune’s request for comment.

Anthropic
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link

Related Articles

The space economy’s next frontier is in ground infrastructure, Northwood Space CEO says

The space economy’s next frontier is in ground infrastructure, Northwood Space CEO says

11 June 2026
Meta is tackling the blue-collar worker shortage by investing 5 million in data center trade jobs

Meta is tackling the blue-collar worker shortage by investing $115 million in data center trade jobs

11 June 2026
Gates testifies on Epstein: Fortune reported payments to his ex-girlfriend, M Microsoft deal

Gates testifies on Epstein: Fortune reported payments to his ex-girlfriend, $1M Microsoft deal

11 June 2026
The curse of Trump watching sports in person: the home team seems to always lose

The curse of Trump watching sports in person: the home team seems to always lose

10 June 2026
Digital sovereignty isn’t the same thing as digital isolation. Asia’s governments should be careful

Digital sovereignty isn’t the same thing as digital isolation. Asia’s governments should be careful

10 June 2026
How the World Cup is a high-stakes stage for Big Tech’s AI push

How the World Cup is a high-stakes stage for Big Tech’s AI push

10 June 2026
Don't Miss
Unwrap Christmas Sustainably: How To Handle Gifts You Don’t Want

Unwrap Christmas Sustainably: How To Handle Gifts You Don’t Want

By Press Room27 December 2024

Every year, millions of people unwrap Christmas gifts that they do not love, need, or…

Exclusive: DeFi platform Azura launches after raising .9 million from Initialized

Exclusive: DeFi platform Azura launches after raising $6.9 million from Initialized

22 October 2024
Sam Altman’s World Wants To Scan Your Eyes To Prove You’re Human

Sam Altman’s World Wants To Scan Your Eyes To Prove You’re Human

22 October 2024
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
Latest Articles
Humana To Divest End-Of-Life Care Business For 0 Million

Humana To Divest End-Of-Life Care Business For $900 Million

11 June 20262 Views
Anthropic accused of ‘secret sabotage’ as Claude Fable 5 silently limits AI research capabilities

Anthropic accused of ‘secret sabotage’ as Claude Fable 5 silently limits AI research capabilities

11 June 20264 Views
NYT ‘Pips’ Hints, Answers And Walkthrough For Thursday, June 11

NYT ‘Pips’ Hints, Answers And Walkthrough For Thursday, June 11

11 June 20262 Views
Gates testifies on Epstein: Fortune reported payments to his ex-girlfriend, M Microsoft deal

Gates testifies on Epstein: Fortune reported payments to his ex-girlfriend, $1M Microsoft deal

11 June 20262 Views

Recent Posts

  • Could This $375M Investment Be The Pinocchio Moment For Quantum Computing?
  • The space economy’s next frontier is in ground infrastructure, Northwood Space CEO says
  • The World Cup’s Real Viral Threats Aren’t Ebola Or Hantavirus
  • Meta is tackling the blue-collar worker shortage by investing $115 million in data center trade jobs
  • Humana To Divest End-Of-Life Care Business For $900 Million

Recent Comments

No comments to show.
About Us
About Us

Alpha Leaders is your one-stop website for the latest Entrepreneurs and Leaders news and updates, follow us now to get the news that matters to you.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks
Could This 5M Investment Be The Pinocchio Moment For Quantum Computing?

Could This $375M Investment Be The Pinocchio Moment For Quantum Computing?

11 June 2026
The space economy’s next frontier is in ground infrastructure, Northwood Space CEO says

The space economy’s next frontier is in ground infrastructure, Northwood Space CEO says

11 June 2026
The World Cup’s Real Viral Threats Aren’t Ebola Or Hantavirus

The World Cup’s Real Viral Threats Aren’t Ebola Or Hantavirus

11 June 2026
Most Popular
Meta is tackling the blue-collar worker shortage by investing 5 million in data center trade jobs

Meta is tackling the blue-collar worker shortage by investing $115 million in data center trade jobs

11 June 20263 Views
Humana To Divest End-Of-Life Care Business For 0 Million

Humana To Divest End-Of-Life Care Business For $900 Million

11 June 20262 Views
Anthropic accused of ‘secret sabotage’ as Claude Fable 5 silently limits AI research capabilities

Anthropic accused of ‘secret sabotage’ as Claude Fable 5 silently limits AI research capabilities

11 June 20264 Views

Archives

  • June 2026
  • May 2026
  • April 2026
  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • March 2022
  • January 2021
  • March 2020
  • January 2020

Categories

  • Blog
  • Business
  • Entrepreneurs
  • Global
  • Innovation
  • Leadership
  • Living
  • Money & Finance
  • News
  • Press Release
© 2026 Alpha Leaders. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.