ClinicBot: A Guideline-Grounded Clinical Chatbot with Prioritized Evidence RAG and Verifiable Citations
ClinicBot:基于指南的临床聊天机器人,具有优先证据RAG和可验证引用
Navapat Nananukul, Mayank Kejriwal
AI总结 提出ClinicBot系统,通过结构化提取指南、按临床重要性排序证据和多智能体协作,生成准确、可验证的临床回答。
详情
临床诊断需要准确、可验证且明确基于官方指南的答案。虽然大型语言模型在自然语言处理方面表现出色,但它们产生幻觉的倾向削弱了其在需要精确性的高风险医疗环境中的实用性。现有的检索增强生成(RAG)系统对所有证据一视同仁,产生嘈杂的上下文和与临床实践不符的通用答案。我们提出ClinicBot,一个通过三项关键进展将指南建议转化为可信临床支持的人工智能系统:(1)将临床指南结构化提取为语义单元(建议、表格、定义、叙述)并带有明确的出处;(2)证据优先级排序,根据临床重要性和指南结构而非文本相似性对内容进行排序;(3)一个基于Web的界面,呈现简洁、可操作的答案及可验证的证据。我们将使用真实患者的糖尿病问题以及一个忠实于美国糖尿病协会(ADA)《糖尿病护理标准(2025)》的额外糖尿病风险评估工具来演示ClinicBot。演示将说明语义知识提取和分层证据排名如何在多智能体设置中可靠运行,以大规模处理复杂的临床指南。
Clinical diagnosis requires answers that are accurate, verifiable, and explicitly grounded in official guidelines. While large language models excel at natural language processing, their tendency to hallucinate undermines their utility in high-stakes medical contexts where precision is essential. Existing retrieval-augmented generation (RAG) systems treat all evidence equally, producing noisy context and generic answers misaligned with clinical practice. We present ClinicBot, an AI system that translates guideline recommendations into trustworthy clinical support through three key advances: (1) structured extraction of clinical guidelines into semantic units (recommendations, tables, definitions, narrative) with explicit provenance, (2) evidence prioritization that ranks content by clinical significance and guideline structure rather than textual similarity, and (3) a web-based interface that presents concise, actionable answers with verifiable evidence. We will demonstrate ClinicBot using diabetes questions from real patients and an additional diabetes risk assessment tool that is faithful to the American Diabetes Association (ADA) Standards of Care in Diabetes (2025). The demonstration will illustrate how semantic knowledge extraction and hierarchical evidence ranking can reliably operate in a multi-agent setting to process complex clinical guidelines at scale.