Aging

Harvard Study in Science: OpenAI's o1 Outperforms ER Physicians on Diagnostic Accuracy

Read in other languages: 한국어日本語
Sciences May 3, 2026 By Insights AI 1 min read 1 views Source

Study Overview

A peer-reviewed study from Harvard Medical School and Beth Israel Deaconess Medical Center, published in Science, found that OpenAI's o1 model outperformed two attending physicians in diagnosing real emergency room cases.

Key Numbers

  • 76 real ER triage cases evaluated
  • OpenAI o1 exact or near-exact diagnoses: 67%
  • Two internal medicine physicians: 55% and 50%
  • On 5 detailed clinical case studies: o1 scored 89% vs. 46 doctors using conventional search tools at 34%

Methodology

Both the model and physicians received identical, unprocessed EHR data as text. No additional images or lab data were provided, mirroring actual clinical information availability.

Significance and Caveats

Researchers emphasized augmentation over replacement — AI as a second-opinion tool for time-pressured ER clinicians. The 76-case sample size is too small for regulatory approval, and further studies covering rare diseases and complex comorbidities are needed before clinical deployment.

Source: TechCrunch

Share: Long

Related Articles

Sciences Apr 14, 2026 2 min read

OpenAI says ChatGPT is already being used at research scale across science and mathematics. In its January 2026 report, the company says advanced science and math usage reached nearly 8.4 million weekly messages from roughly 1.3 million weekly users, with early evidence that GPT-5.2 is contributing to serious mathematical work.

Comments (0)

No comments yet. Be the first to comment!

Leave a Comment