OpenAI and Anthropic Collaborate on Rare Joint AI Safety Study

AI Research & Advances

OpenAI and Anthropic, two of the world’s leading AI labs, have conducted a rare cross-lab collaboration, briefly opening access to their models for joint safety testing. The research, published this week, aimed to uncover blind spots in internal evaluations and explore how competing AI companies can work together on alignment and safety.

The study compared behaviors across models, revealing key differences in refusal and hallucination rates, as well as the growing challenge of sycophancy, where AI systems reinforce harmful behavior to please users. Findings suggested that Anthropic’s Claude models erred on the side of refusal, while OpenAI’s models attempted more answers, often at higher risk of inaccuracy.

The collaboration comes amid intense competition in AI development, with billion-dollar infrastructure investments and escalating talent wars. Despite these pressures, Wojciech Zaremba of OpenAI and Nicholas Carlini of Anthropic emphasized the importance of continued cooperation to set safety standards. Both labs signaled interest in expanding joint testing in the future, encouraging other AI companies to adopt similar collaborative approaches.

AI, AI Safety Study, Anthropic, business, OpenAI, Research

James Dargan

James Dargan is a writer and researcher at The AI Insider. His focus is on the AI startup ecosystem and he writes articles on the space that have a tone accessible to the average reader.

Share this article:

All tags

AI, AI Safety Study, Anthropic, business, OpenAI, Research

You May Also Be Interested In

Encord Announces $60M Series C to Expand AI-Native Data Infrastructure for Real-World Applications

James Dargan March 3, 2026

Cursor Surpasses $2B Annualized Revenue as Enterprise AI Coding Adoption Accelerates

James Dargan March 3, 2026

Dyna.Ai Closes Series A to Turn Enterprise AI Pilots into Real Business Results

James Dargan March 3, 2026

14.ai Raises $3M to Build AI-Native Customer Service Agency

James Dargan March 3, 2026

ChatGPT Uninstalls Surge as Claude Gains Downloads Following Defense AI Deal

James Dargan March 3, 2026

AI Insider News

Dyna.Ai Closes Series A to Turn Enterprise AI Pilots into Real Business Results

James Dargan March 3, 2026

14.ai Raises $3M to Build AI-Native Customer Service Agency

James Dargan March 3, 2026

ChatGPT Uninstalls Surge as Claude Gains Downloads Following Defense AI Deal

James Dargan March 3, 2026

AI Insider

Discover the future of AI technology with "AI Insider" - your go-to platform for industry data, market insights, and groundbreaking AI news

Related Articles

Encord Announces $60M Series C to Expand AI-Native Data Infrastructure for Real-World Applications

March 3, 2026

Cursor Surpasses $2B Annualized Revenue as Enterprise AI Coding Adoption Accelerates

March 3, 2026

Dyna.Ai Closes Series A to Turn Enterprise AI Pilots into Real Business Results

March 3, 2026

14.ai Raises $3M to Build AI-Native Customer Service Agency

March 3, 2026