2509.12194
2026-05-26
cs.AI
cs.CV
Teaching large language models to reason like expert diagnosticians
教会大型语言模型像专家诊断医生一样推理
Thomas A. Buckley, Riccardo Conci, Peter G. Brodeur, Jason Gusdorf, Sourik Beltrán, Bita Behrouzi, Byron Crowe, Jacob Dockterman, Muzzammil Muhammad, Sarah Ohnigian, Andrew Sanchez, James A. Diao, Aashna P. Shah, Daniel Restrepo, Eric S. Rosenberg, Andrew S. Lea, Emily Glanton, Kimberly LeBlanc, Undiagnosed Diseases Network, Marinka Zitnik, Scott H. Podolsky, Zahir Kanjee, Raja-Elie E. Abdulnour, Jacob M. Koshy, Adam Rodman, Arjun K. Manrai
发表机构
*
Department of Biomedical Informatics, Harvard Medical School(哈佛医学院生物医学信息学系)
;
Department of Medicine, Beth Israel Deaconess Medical Center(贝塞斯达医院内科部)
;
The Mongan Institute, Massachusetts General Hospital(麻省总医院蒙根研究所)
;
Division of Gastroenterology, Brigham and Women’s Hospital(布里洛妇女医院胃肠病科)
;
Department of Medicine, Brigham and Women’s Hospital(布里洛妇女医院内科部)
;
Department of Medicine, Massachusetts General Hospital(麻省总医院内科部)
;
Department of Pathology, Massachusetts General Hospital(麻省总医院病理学部)
;
Department of Health Humanities and Bioethics, University of Rochester School of Medicine and Dentistry(罗切斯特大学医学院和牙科学院健康人文与生物伦理学部)
;
Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University(哈佛大学凯普纳人工智能研究所)
;
Center for the History of Medicine, Countway Library of Medicine, Harvard Medical School(哈佛医学院医学史中心,考特维图书馆)
;
Department of Global Health and Social Medicine, Harvard Medical School(哈佛医学院全球健康与社会医学部)
;
Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital(布里洛妇女医院呼吸科和重症医学科)
AI总结
提出 Dr. CaBot 代理 AI 系统,通过生成基于初始病例描述的幻灯片演示来模拟专家诊断推理,并在 NEJM CPC 和 NIH 未诊断疾病网络病例上取得优于前沿模型的表现,同时发布 CPC-Bench 基准以促进临床 AI 发展。