arXivDaily arXiv每日学术速递 周一至周五更新

AI 大模型

RAG / 检索增强生成

检索增强生成、向量检索、知识库问答和面向大模型的搜索系统。

今日/当前日期收录 1 信号源:cs.IR, cs.CL, cs.AI, cs.DB
2606.18381 2026-06-18 cs.CL cs.IR 新提交 92%

SproutRAG: Attention-Guided Tree Search with Progressive Embeddings for Long-Document RAG

SproutRAG: 基于注意力引导的树搜索与渐进嵌入的长文档RAG

Amirhossein Abaskohi, Issam H. Laradji, Peter West, Giuseppe Carenini

发表机构 * University of British Columbia(不列颠哥伦比亚大学) ServiceNow Research(ServiceNow研究院)

专题命中 长文档RAG :提出注意力引导的层次化RAG框架,解决长文档检索粒度与上下文连贯性权衡。

AI总结 提出SproutRAG,通过注意力引导构建句子级分块树,实现多粒度检索,无需额外LLM调用,平均信息效率提升6.1%。

详情
AI中文摘要

检索增强生成(RAG)系统必须平衡检索粒度与上下文连贯性,现有方法通过LLM引导的分块、单级上下文扩展或层次摘要来解决这一挑战。这些方法在索引或检索过程中依赖昂贵的LLM调用,将上下文聚合限制在单一粒度级别,或通过摘要引入信息损失。我们提出SproutRAG,一种注意力引导的层次化RAG框架,通过将句子级块组织成逐渐增大但语义连贯的单元,利用学习到的句子间注意力构建二分块树,从而解决这一权衡。与依赖外部LLM、固定上下文扩展或有损摘要的先前方法不同,SproutRAG学习哪些注意力头和层最能捕捉语义文档结构,实现无需额外LLM调用或压缩摘要的多粒度检索。在检索时,SproutRAG使用层次化束搜索检索多个粒度的候选,捕获超越平面检索的多句子相关性。该框架通过联合目标进行端到端训练,同时改进嵌入和树结构。在涵盖科学、法律和开放域设置的四个基准上的实验表明,SproutRAG在最强基线上平均信息效率(IE)提升6.1%。代码可在该https URL获取。

英文摘要

Retrieval-augmented generation (RAG) systems must balance retrieval granularity with contextual coherence, a challenge that existing methods address through LLM-guided chunking, single-level context expansion, or hierarchical summarization. These approaches variously depend on costly LLM calls during indexing or retrieval, limit context aggregation to a single granularity level, or introduce information loss through summarization. We present SproutRAG, an attention-guided hierarchical RAG framework that addresses this trade-off by organizing sentence-level chunks into progressively larger but semantically coherent units, using learned inter-sentence attention to construct a binary chunking tree. Unlike prior approaches that rely on external LLMs, fixed context expansion, or lossy summarization, SproutRAG learns which attention heads and layers best capture semantic document structure, enabling multi-granularity retrieval without additional LLM calls or compressed summaries. At retrieval time, SproutRAG uses hierarchical beam search to retrieve candidates at multiple granularities, capturing multi-sentence relevance beyond flat retrieval. The framework is trained end-to-end with a joint objective that improves both embeddings and tree structure. Experiments across four benchmarks spanning scientific, legal, and open-domain settings demonstrate that SproutRAG improves information efficiency (IE) by 6.1% on average over the strongest baseline. Code is available on https://github.com/AmirAbaskohi/SproutRAG.