Anthropic’s Claude Fable 5 Draws Backlash From Cybersecurity Researchers Over Overzealous Guardrails

Anthropic’s newly released Claude Fable 5 model is facing criticism from cybersecurity professionals who say its safety restrictions are blocking legitimate security work, with researchers reporting that even routine tasks such as code reviews and reading security blog posts trigger the model’s guardrails.

Valentina “Chompie” Palmiotti, a security researcher at IBM X-Force, said the model rejects requests that are only tangentially related to cybersecurity. Matt Suiche, a member of the technical staff at AI cybersecurity startup Tolmo, described the restrictions as appearing keyword-based, noting that secure coding requests were being misclassified as cybersecurity work and downgraded to Claude Opus 4.8.

Suiche nonetheless characterised the cautious approach as understandable given the early stage of deployment, suggesting guardrails would likely be relaxed over time as Anthropic deepens collaboration with cybersecurity firms.

Anthropic offers a Cyber Verification Program through which approved professionals can access Claude with fewer restrictions. OpenAI operates a comparable scheme called Trusted Access for Cyber.

Need Deeper Intelligence on the AI Market?

AI Insider's Market Intelligence platform tracks funding rounds, competitive landscapes, and technology trends across the global AI ecosystem in real time. Get the data and insights your organization needs to make informed decisions.

Related Articles

a computer circuit board with a brain on it
The Top 15 AI Fintech, InsurTech & Compliance Scale-Ups You Need to Know in 2026

Artificial intelligence is restructuring how financial institutions manage risk, how insurance reaches the businesses that need it, and how compliance teams stay ahead of regulators

an orange smile on a black background
Amazon Secures $17.5B Bank Loan as AI Infrastructure Debt Mounts Across Big Tech

Amazon has signed a $17.5 billion delayed draw term loan with a syndicate of lenders including Citigroup, JPMorgan Chase, Wells Fargo, HSBC, and BofA Securities,

a white and blue square with a blue logo on it
Meta Partners With Reliance Industries on AI Data Center in India in First Local Infrastructure Bet

Meta has announced its first AI infrastructure investment in India, partnering with conglomerate Reliance Industries to develop a 168-megawatt AI-enabled data center in Jamnagar, Gujarat,

Stay Updated with AI Insider

Get the latest AI funding news, market intelligence, and industry insights delivered to your inbox weekly.

$ 0 M

Seed round tracked

Gitar — Code Validation

Get the Weekly Briefing

Funding analysis, market intelligence, and industry trends delivered to your inbox every week.

Need bespoke intelligence?

Our team combines real-time data with decades of sector experience to guide your decisions.

Subscribe today for the latest news about the AI landscape