Anthropic’s Claude Fable 5 Draws Backlash From Cybersecurity Researchers Over Overzealous Guardrails

Anthropic’s newly released Claude Fable 5 model is facing criticism from cybersecurity professionals who say its safety restrictions are blocking legitimate security work, with researchers reporting that even routine tasks such as code reviews and reading security blog posts trigger the model’s guardrails.

Valentina “Chompie” Palmiotti, a security researcher at IBM X-Force, said the model rejects requests that are only tangentially related to cybersecurity. Matt Suiche, a member of the technical staff at AI cybersecurity startup Tolmo, described the restrictions as appearing keyword-based, noting that secure coding requests were being misclassified as cybersecurity work and downgraded to Claude Opus 4.8.

Suiche nonetheless characterised the cautious approach as understandable given the early stage of deployment, suggesting guardrails would likely be relaxed over time as Anthropic deepens collaboration with cybersecurity firms.

Anthropic offers a Cyber Verification Program through which approved professionals can access Claude with fewer restrictions. OpenAI operates a comparable scheme called Trusted Access for Cyber.

Need Deeper Intelligence on the AI Market?

AI Insider's Market Intelligence platform tracks funding rounds, competitive landscapes, and technology trends across the global AI ecosystem in real time. Get the data and insights your organization needs to make informed decisions.

Related Articles

a computer circuit board with a brain on it
Understanding AI Token Economics: Why Supply Matters

There is a new unit of account in the artificial intelligence industry, and it is not the GPU, the model, or the API call. It

Glowing ai chip on a circuit board.
UK Universities Launch SOFAIR Lab to Build Open-Source AI That Runs Without Big Tech Infrastructure

A coalition of leading British universities has established the Science of Fundamental AI Research (SOFAIR) Lab, a major new initiative aimed at developing next-generation open-source

A red square button with the letter a on it
Sail Research Closes $80M in Funding to Build Max-Efficiency Infrastructure for AI Agents

Insider Brief PRESS RELEASE — Sail Research, the infrastructure company purpose-built for long-horizon AI agents, has announced it has raised $80 million in Seed and

Stay Updated with AI Insider

Get the latest AI funding news, market intelligence, and industry insights delivered to your inbox weekly.

$ 0 M

Seed round tracked

Gitar — Code Validation

Get the Weekly Briefing

Funding analysis, market intelligence, and industry trends delivered to your inbox every week.

Need bespoke intelligence?

Our team combines real-time data with decades of sector experience to guide your decisions.

Subscribe today for the latest news about the AI landscape