Harvard Study Finds OpenAI’s o1 Model Outperforms Physicians in ER Triage Diagnoses

study published in Science by researchers at Harvard Medical School and Beth Israel Deaconess Medical Center found that OpenAI’s o1 model outperformed internal medicine physicians in emergency room diagnostic accuracy, correctly identifying the exact or near diagnosis in 67% of triage cases compared to 55% and 50% for the two attending physicians assessed.

Lead author Arjun Manrai said the AI surpassed both prior models and physician benchmarks across nearly every test. The researchers stressed the models received identical information to what physicians saw in electronic records, with no preprocessing.

However, the study stopped well short of advocating clinical deployment, calling instead for formal prospective trials. Critics, including emergency physician Kristen Panthagani, cautioned that comparing AI to non-specialist physicians and equating diagnostic guessing with genuine emergency care represented a significant methodological limitation.

Need Deeper Intelligence on the AI Market?

AI Insider's Market Intelligence platform tracks funding rounds, competitive landscapes, and technology trends across the global AI ecosystem in real time. Get the data and insights your organization needs to make informed decisions.

Related Articles

Researchers Look at Impact of AI Support Tool in Real-world Primary Care Trial

A Gates Foundation-funded clinical trial found that a generative AI tool used by frontline clinicians in Kenya was safe and improved clinical decision-making, but did

a computer circuit board with a brain on it
Top 20 AI Vertical Workflow App CEOs You Need to Know in 2026

Artificial intelligence is rapidly moving beyond general-purpose chatbots and into industry-specific software that automates entire business processes. Known as vertical workflow applications, these platforms are designed

Claude’s Paying Consumer Base Grows 75% in 2026 as Anthropic Closes Gap With ChatGPT

Anthropic’s Claude is gaining significant ground with paying consumers, according to credit card transaction data from Indagari, which analyses anonymised spending patterns across approximately 28

Stay Updated with AI Insider

Get the latest AI funding news, market intelligence, and industry insights delivered to your inbox weekly.

$ 0 M

Seed round tracked

Gitar — Code Validation

Get the Weekly Briefing

Funding analysis, market intelligence, and industry trends delivered to your inbox every week.

Need bespoke intelligence?

Our team combines real-time data with decades of sector experience to guide your decisions.

Subscribe today for the latest news about the AI landscape