Exploring the Future of AI in Clinical Collaboration: A Study on Tumor Board Case Preparation

Picture of Amanda K Hall
Amanda K Hall
Picture of Ruican Zhong
Ruican Zhong
Picture of Selin S. Everett
Selin S. Everett
Picture of Alyssa Unell
Alyssa Unell
Picture of Hanwen Xu
Hanwen Xu
Picture of Matthias Blondeel
Matthias Blondeel
Picture of Jonathan Carlson
Jonathan Carlson
Picture of Katie Claveau
Katie Claveau
Picture of Thulasee Jose
Thulasee Jose
Picture of Tristan Naumann
Tristan Naumann
Picture of David C. Rhew
David C. Rhew
Picture of Naiteek Sangani
Naiteek Sangani
Picture of Frank Tuan
Frank Tuan
Picture of James Weinstein
James Weinstein
Picture of Elizabeth D Mynatt
Elizabeth D Mynatt
Picture of Scott Saponas
Scott Saponas
Picture of Hao Qiu
Hao Qiu
Picture of Leonardo Schettini
Leonardo Schettini
Picture of Joseph Samuel Preston
Joseph Samuel Preston
Picture of Aiden Gu
Aiden Gu
Picture of Naoto Usuyama
Naoto Usuyama
Picture of Zelalem Gero
Zelalem Gero
Picture of Cliff Wong
Cliff Wong
Picture of Noel Christopher Codella
Noel Christopher Codella
Picture of Hoifung Poon
Hoifung Poon
Picture of Shrey Jain
Shrey Jain
Picture of Matthew Lungren
Matthew Lungren
Picture of Eric Horvitz
Eric Horvitz
Published at Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems 2026
  • Best Paper Honorable Mention

Abstract

Multidisciplinary tumor boards (MTBs) bring specialists together to identify therapies for complex cancer cases, but preparing for them is time-intensive. Clinicians must extract key details from extensive records and evaluate treatment options. While large language models (LLMs) show promise in medicine for basic tasks like summarizing notes, little is known about their role in high-stakes tasks like MTB preparation. We conducted a mixed-methods study with 16 oncologists using two AI systems to prepare patient cases for MTB: an off-the-shelf assistant (Copilot) and a task-specific multi-agent system (Healthcare Agent Orchestrator, HAO). We analyzed oncologist prompts, AI responses, and oncologists' perception of AI. Participants showed greater willingness to adopt HAO but were often overconfident in AI summaries and skeptical of AI-recommended therapies. Trust calibration strategies, such as source links and agent-trajectories, failed to align trust with system capabilities. We conclude with how AI systems should be built to support clinicians in high-stakes tasks.

Materials