AI-clinician collaboration via disagreement prediction: A decision pipeline and retrospective analysis of real-world radiologist-AI interactions

Sanchez Morgan, Alford Kyle, Krishna Viswesh, Huynh Thanh M., Nguyen Chanh D.T., Lungren Matthew P., Truong Steven Q.H., Rajpurkar Pranav

Publisher

Clinical decision support tools can improve diagnostic performance or reduce variability, but they are also subject to post-deployment underperformance. Although using AI in an assistive setting offsets many concerns with autonomous AI in medicine, systems that present all predictions equivalently fail to protect against key AI safety concerns. We design a decision pipeline that supports the diagnostic model with an ecosystem of models, integrating disagreement prediction, clinical significance categorization, and prediction quality modeling to guide prediction presentation. We characterize disagreement using data from a deployed chest X-ray interpretation aid and compare clinician burden in this proposed pipeline to the diagnostic model in isolation. The average disagreement rate is 6.5%, and the expected burden reduction is 4.8%, even if 5% of disagreements on urgent findings receive a second read. We conclude that, in our production setting, we can adequately balance risk mitigation with clinician burden if disagreement false positives are reduced.

Publisher: Cell Reports Medicine

Article number: 101207

ISSN (Electronic): 26663791

Keywords

  • AI safety
  • artificial intelligence
  • clinical decision support
  • clinician workload estimation
  • computer-aided diagnosis
  • disagreement prediction
  • human-AI collaboration
  • machine learning
  • radiology

ASJC Scopus subject areas

  • Biochemistry, Genetics and Molecular Biology (all)

Publication year

2023

Fingerprint