Decentralized Multi-Agent Systems with Shared Context
具有共享上下文的去中心化多智能体系统
Yuzhen Mao, Azalia Mirhoseini
AI总结 提出DeLM框架,通过并行智能体、共享上下文和任务队列去中心化协调,解决集中式MAS的瓶颈,在软件工程和长上下文推理中提升性能并降低成本。
详情
多智能体系统(MAS)通过将复杂问题分解为并行子任务,可以在测试时扩展大型语言模型的推理能力。然而,大多数现有的MAS依赖于集中式编排,其中主智能体分配工作、收集输出并合并结果。随着子任务数量的增长,该控制器成为通信和集成瓶颈。我们提出了去中心化语言模型(DeLM),这是一种通过并行智能体、共享验证上下文和任务队列来去中心化协调的MAS框架。智能体异步认领子任务,读取累积进度,执行局部推理,并写回紧凑的验证更新。共享上下文充当公共通信基础,使智能体能够基于彼此的验证进度进行构建,而无需通过中央控制器路由每次更新。实验上,DeLM在软件工程测试时扩展和长上下文推理方面均有所改进。在SWE-bench Verified上,DeLM在Avg.@1、Pass@2和Pass@4上均取得了最佳性能,比最强基线高出多达10.5个百分点,同时每个任务的成本降低约50%。在LongBench-v2多文档问答上,DeLM在四个前沿模型系列中取得了最高平均准确率,比最强基线高出多达5.7个百分点。代码可在我们的项目网站(此 https URL)上获取。
Multi-agent systems (MAS) can scale large language model reasoning at test time by decomposing complex problems into parallel subtasks. However, most existing MAS rely on centralized orchestration, where a main agent assigns work, collects outputs, and merges results. As the number of subtasks grows, this controller becomes a communication and integration bottleneck. We propose Decentralized Language Models (DeLM), a MAS framework that decentralizes coordination through parallel agents, a shared verified context, and a task queue. Agents asynchronously claim subtasks, read accumulated progress, perform local reasoning, and write back compact verified updates. The shared context acts as a common communication substrate, enabling agents to build on one another's verified progress without routing every update through a central controller. Empirically, DeLM improves both software-engineering test-time scaling and long-context reasoning. On SWE-bench Verified, DeLM achieves the best performance across Avg.@1, Pass@2, and Pass@4, with gains of up to 10.5 percentage points over the strongest baseline, while reducing cost per task by roughly 50%. On LongBench-v2 Multi-Doc QA, DeLM achieves the highest average accuracy across four frontier model families, improving over the strongest baseline by up to 5.7 percentage points. The code is available on our project website at https://yuzhenmao.github.io/DeLM/.