2605.28338
2026-05-28
cs.AI
SafeMed-R1: Clinician-Audited Safety and Ethics Alignment for Medical Large Language Models
SafeMed-R1: 临床医生审计的安全与伦理对齐用于医疗大语言模型
Chao Ding, Mouxiao Bian, Tianbin Li, Minjia Yuan, Yidong Jiang, Yankai Jiang, Jinru Ding, Jiayuan Chen, Zhuangzhi Gao, Pengcheng Chen, Zhao He, Rongzhao Zhang, Meiling Liu, Luyi Jiang, Jie Xu
发表机构
*
Shanghai Artificial Intelligence Laboratory(上海人工智能实验室)
;
Joint Laboratory of Biomedical Artificial Intelligence(生物医学人工智能联合实验室)
;
Shanghai Institute of Infectious Disease and Biosecurity(上海传染病与生物安全研究院)
;
Shanghai Health Development Research Center (Shanghai Medical Information Center)(上海健康发展战略研究中心(上海医疗信息中心))
;
University of Washington(华盛顿大学)
;
Department of Eye and Vision Sciences, University of Liverpool(利物浦大学眼科与视觉科学系)
;
Liverpool Centre for Cardiovascular Science, University of Liverpool(利物浦大学心血管科学中心)
;
School of Computer Science and Technology, Tongji University(同济大学计算机科学与技术学院)
AI总结
提出SafeMed-R1模型,通过可追溯的临床信任信号管道和红队压力测试实现安全与伦理对齐,在临床基准上达到79.6%的宏平均准确率,并将不安全输出减少约3-5%。