Google DeepMind introduced the AI co-clinician, a multimodal AI agent research initiative to support healthcare workers. In blind evaluations, the system achieved zero critical errors in 97 of 98 primary care queries and matched physician-level performance on medication knowledge benchmarks.
AI vs. Emergency Room Doctors: The 2026 Diagnostic Studies
Current state
A collection of 2026 studies showing AI matching or outperforming ER physicians — from DeepMind's AI co-clinician to the Harvard-Science paper on OpenAI o1 diagnosing 67% of ER patients correctly.
What changed recently
- Google DeepMind's AI Co-Clinician: Zero Critical Errors in 97 of 98 Primary Care Queries
- Harvard Study: OpenAI's o1 Diagnoses 67% of ER Patients Correctly, Outperforming Doctors
- Harvard Study in Science: OpenAI's o1 Outperforms ER Physicians on Diagnostic Accuracy
Key tensions
Signals to watch
- Momentum and new coverage around “healthcare”
- Momentum and new coverage around “google-deepmind”
- Momentum and new coverage around “openai”
Timeline
A Harvard Medical School trial found OpenAI's o1 reasoning model correctly diagnosed 67% of ER patients, beating human physicians at 50-55%. With more clinical data, o1 reached 82% accuracy and dominated long-term treatment planning at 89% versus doctors' 34%.
A peer-reviewed study published in Science tested OpenAI's o1 on 76 real ER triage cases and found it achieved exact or near-exact diagnoses 67% of the time, versus 55% and 50% for two attending physicians who received identical patient data.
A new study published in Science found that a state-of-the-art LLM matched or exceeded human emergency physicians in diagnostic choices, emergency triage, and next-step management decisions using real ER data and hundreds of physician comparisons. Researchers say the results call for collaborative care models, not AI replacement of doctors.
The important medical AI story here is not replacement but reliability. Google DeepMind says its AI co-clinician produced zero critical errors in 97 of 98 realistic primary-care queries, while physicians still beat it overall in multimodal telemedicine simulations.