Google reports early real-world feasibility for conversational diagnostic AI AMIE in primary care
Original: Exploring the feasibility of conversational diagnostic AI in a real-world clinical study View original →
What Google studied
On March 11, 2026, Google Research and Google DeepMind published results from what they describe as a first-of-its-kind prospective real-world study of AMIE, their conversational diagnostic AI, in partnership with Beth Israel Deaconess Medical Center. The system was used before ambulatory primary care visits to gather a patient history through a secure text chat and then generate a transcript and summary for the clinician. The work moves AMIE beyond simulated patient or benchmark settings into an actual care workflow, which is the threshold many medical AI projects have struggled to cross.
How the deployment worked
The study enrolled 100 adult patients with new, non-emergency complaints, and 98 of them later attended their scheduled appointment. Each AMIE interaction was observed live by a physician supervisor who could stop the session if one of four safety criteria was triggered, including signs of self-harm risk, significant distress, or potential clinical harm. Google says no safety stops were required. Before the patient saw the doctor, the system produced a pre-visit summary so the clinician could review the history in advance rather than starting from a blank chart.
What the results suggest
According to Google, blinded clinical evaluators rated AMIE and primary care providers similarly on overall differential diagnosis quality and on the appropriateness and safety of management plans, while clinicians still scored better on practicality and cost effectiveness. Google also reports that AMIE included the final diagnosis in 90% of cases and reached 75% top-3 accuracy, with strong performance maintained in the subset of cases later confirmed by diagnostic testing. Patient attitudes toward AI became more positive after using AMIE, and participating clinicians said the summaries improved visit readiness.
Why it matters
The announcement is significant because it adds prospective human-subject evidence to a field that usually relies on retrospective chart studies or controlled simulations. At the same time, Google emphasizes that this was a single-center feasibility study with a text-only interface, live physician oversight, and no controlled comparison against an alternative workflow. That means the paper is better read as evidence that supervised deployment can be safe and workable in a limited setting, not as proof that conversational diagnostic AI is ready to operate independently in routine care.
Related Articles
Google Research와 Google DeepMind가 Beth Israel Deaconess Medical Center와 함께 외래 primary care 환경에서 conversational diagnostic AI인 AMIE의 실제 feasibility study를 공개했다. overall management plan과 differential diagnosis에서는 의사와 비슷한 수준을 보였지만, practicality와 cost-effectiveness에서는 여전히 physician이 우세했다.
Google와 Taiwan의 NHIA는 20년 넘게 축적한 건강 데이터, AI-on-DM model, Gemini 기반 assistant를 결합해 국가 단위 당뇨 위험 선별을 확대하고 있다고 밝혔다. Google에 따르면 1건 평가 시간을 20분에서 25초로 줄였고, 정부 앱 사용자 10 million명에게 개인화된 guidance도 제공할 계획이다.
Google이 Imperial College London, 영국 NHS와 진행한 연구에서 AI가 기존 screening이 놓친 interval cancer의 25%를 찾아냈다고 밝혔다. 두 편의 Nature Cancer 연구는 workload 절감 가능성과 함께, 실제 임상 도입에는 신뢰와 calibration이 필요하다는 점도 보여준다.