arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2088
2605.27332 2026-05-27 cs.SE cs.AI cs.CV

EdgeFlow: Edge-Map Augmented VLM-Based Flowchart Processing for Industrial Requirements Engineering

EdgeFlow: 基于边缘图增强的VLM流程图处理用于工业需求工程

Zhifei Dou, Shabnam Hassani, Ou Wei

AI总结 提出EdgeFlow方法,通过向视觉语言模型(VLM)输入添加Canny边缘图作为结构先验,无需训练数据或微调即可提升流程图到Mermaid代码的转换精度,在工业数据集上节点F1提升17.39%,边F1提升16.94%。

Comments 10 pages

详情
AI中文摘要

流程图广泛应用于工业需求中,但通常以静态图像形式嵌入。视觉语言模型(VLM)在将这些流程图转换为机器可读模型以支持需求工程活动方面显示出潜力,然而,当直接应用于流程图转换时,它们常常在拓扑关键视觉细节上失败。为了解决这个问题,我们提出了EdgeFlow,它通过向VLM的原始输入添加确定性提取的Canny边缘图——作为结构先验——来改进流程图到Mermaid的转换,无需标注训练数据或领域特定的模型微调。我们在IndusReqFlow(一个来自真实世界需求的数据集)上评估了EdgeFlow。与现成的VLM相比,EdgeFlow将节点级F1提高了17.39个百分点,边级F1提高了16.94个百分点。在路径级别,EdgeFlow将路径F1提高了11.06个百分点,从而更好地支持基于模型的测试。这些结果表明,EdgeFlow提供了一种实用的、无需训练的方法,用于改进工业需求工程中保持拓扑结构的流程图到Mermaid转换。在公共合成基准上的跨数据集评估结果显示没有显著改进;这凸显了需要包含工业数据的多样化基准,以全面评估未来基于VLM的需求工程工具。

英文摘要

Flowcharts are widely used in industrial requirements, but usually remain embedded as static images. Vision Language Models (VLMs) show promise in the conversion of these flowcharts into machine-readable models for RE activities, yet, when directly applied to flowchart conversion, they often fail on topology-critical visual details. To address this, we propose EdgeFlow that augments a VLM's original input with a deterministically extracted Canny edge map-acting as a structural prior-to improve flowchart-to-Mermaid conversion, without requiring annotated training data or domain-specific model fine-tuning. We evaluate EdgeFlow on IndusReqFlow, a dataset sourced from real-world requirements. Compared with off-the-shelf VLMs, EdgeFlow improves node-level F1 by 17.39 percentage points and edge-level F1 by 16.94 percentage points. At the path level, EdgeFlow improves path F1 by 11.06 percentage points, enabling better support for model-based testing. These results demonstrate that EdgeFlow provides a practical, training-free means to improve topology-preserving flowchart-to-Mermaid conversion for industrial RE. Cross-dataset evaluation results on a public synthetic benchmark show no significant improvement; this highlights the need for diverse benchmarks incorporating industrial data for the comprehensive evaluation of future VLM-based RE tools.

2605.27328 2026-05-27 cs.SE cs.AI cs.MA

Governed Evolution of Agent Runtimes through Executable Operational Cognition

通过可执行操作认知实现代理运行时的受控演化

Mariano Garralda-Barrio

AI总结 本文提出一个框架,通过可执行操作认知实现多智能体系统中代理生成工件的受控运行时演化,引入HarnessMutation机制在验证、可追溯、评估和回滚约束下进行生命周期感知的运行时适应。

Comments 14 pages, 4 figures, 1 table. Reference implementation and associated source code available at: https://github.com/mgarralda/governed-runtime

详情
AI中文摘要

近期智能体系统的进展越来越将代码视为可执行的操作基底,而非可丢弃的输出工件。先前的工作如\emph{Code as Agent Harness}将经过验证的智能体生成工件视为运行时实体,可以在长时间运行的认知循环中创建、执行、修订、持久化和重用。然而,这些工件的治理、生命周期管理和操作演化仍未被充分定义。 本文提出了一个通过可执行操作认知实现多智能体系统中受控运行时演化的框架。我们将智能体生成工件形式化为持久的运行时能力,这些能力逐渐成为操作基底的一部分,而非瞬时的中间输出。基于这一视角,我们引入了\emph{HarnessMutation}作为一种受控机制,用于在明确的验证、可追溯性、评估和回滚约束下进行生命周期感知的运行时适应。 该框架不将运行时适应视为无限制的自我修改,而是将演化建模为在持久操作记忆上的有界且可观察的过程。它进一步展示了这些思想如何在现代智能体运行时和面向治理的编排系统上实现,为适应性基础设施提供了概念基础,使其演化保持明确、可审计且受约束。

英文摘要

Recent advances in agentic systems increasingly treat code as an executable operational substrate rather than as a disposable output artifact. Prior work such as \emph{Code as Agent Harness} frames validated agent-generated artifacts as runtime entities that can be created, executed, revised, persisted, and reused within long-running cognitive loops. However, the governance, lifecycle management, and operational evolution of such artifacts remain under-specified. This paper proposes a framework for governed runtime evolution in multi-agent systems through executable operational cognition. We formalize agent-generated artifacts as persistent runtime capabilities that progressively become part of the operational substrate rather than transient intermediate outputs. Building on this perspective, we introduce \emph{HarnessMutation} as a governed mechanism for lifecycle-aware runtime adaptation operating under explicit validation, traceability, evaluation, and rollback constraints. Rather than treating runtime adaptation as unrestricted self-modification, the proposed framework models evolution as a bounded and observable process over persistent operational memory. It further shows how these ideas can be operationalized over modern agent runtimes and governance-oriented orchestration systems, providing a conceptual foundation for adaptive infrastructures whose evolution remains explicit, auditable, and constrained.

2605.27299 2026-05-27 cs.CR cs.AI cs.HC cs.LG cs.SY eess.SY

Risk Averse Alert Prioritization for IDS Using Subnormal Gaussian Fuzzy Models

使用次正态高斯模糊模型的IDS风险规避警报优先级排序

Murat Moran

AI总结 提出基于次正态高斯模糊数的警报优先级排序框架,通过建模威胁严重性、检测置信度和组织风险态度三种不确定性,利用排序指数实现可调安全姿态,实验证明在检测器退化下比基线方法更鲁棒。

详情
AI中文摘要

现代入侵检测系统每天生成数千条警报,但由于误报或低影响事件过多,警报疲劳严重限制了安全运营的有效性。我们通过提出一个基于次正态高斯模糊数的原则性警报优先级排序框架来解决这个问题,该框架明确建模了三种不确定性来源:威胁严重性、检测置信度和组织风险态度。每个警报被表示为一个模糊数,其核心表示严重性,展度表示不确定性,高度反映检测可靠性。我们应用排序指数对警报进行优先级排序,允许组织通过风险态度参数调整安全姿态。在CIC-IDS2017和NSL-KDD上的实验验证表明,在检测器退化下,该方法比基线方法具有更强的鲁棒性(NDCGrel@100为0.9963对比0.8215),在中等置信度警报中具有明显区分度,在稳健检测器下与基线方法接近。该框架具有理论基础、计算效率高、提供可解释推理,并且在检测器系列和校准错误场景下保持鲁棒性。

英文摘要

Modern intrusion detection systems generate thousands of alerts daily, but alert fatigue severely limits security operations effectiveness due to too many false positives or low-impact events. We address this by proposing a principled framework for alert prioritization based on subnormal Gaussian fuzzy numbers, explicitly modeling three sources of uncertainty: threat severity, detection confidence, and organizational risk attitude. Each alert is represented as a fuzzy number with the core indicating severity, spread indicating uncertainty, and height reflecting detection reliability. We apply ranking indices to prioritize alerts, allowing organizations to tune security posture through a risk-attitude parameter. Experimental validation on CIC-IDS2017 and NSL-KDD demonstrates greater robustness than baselines under detector degradation (0.9963 vs 0.8215 NDCGrel@100), with distinct differentiation in mid-confidence alerts and near-parity with baselines under robust detectors. The framework is theoretically grounded, computationally efficient, provides interpretable reasoning, and remains robust across detector families and miscalibration scenarios.

2605.27246 2026-05-27 cs.LO cs.AI math.LO

Many Logics, One Methodology: A Plea for Logical Pluralism in Formalised Reasoning (preprint)

多种逻辑,一种方法论:在形式化推理中倡导逻辑多元主义(预印本)

Christoph Benzmüller, Daniel Kirchner, Luca Pasetto

AI总结 本文基于LogiKEy逻辑多元知识表示与推理方法论,主张在统一元逻辑框架内支持对象逻辑层面的逻辑多元主义,并警告逻辑帝国主义对跨学科复用的阻碍。

Comments 21 pages, 6 figures; to appear (preprint)

详情
AI中文摘要

这份立场声明回顾了二十年来在经典高阶逻辑(HOL)中浅嵌入非经典逻辑的工作,该研究扩展为HOL中的一系列逻辑嵌入,并启发了LogiKEy逻辑多元知识表示与推理方法论。本文在LogiKEy等统一元逻辑框架内,以计算形而上学为基础,论证了对象逻辑层面的逻辑多元主义。更广泛地说,它倡导现代证明助手对逻辑多元主义的原则性支持,并警告逻辑帝国主义——即在大规模理论发展中僵化采用单一基础逻辑——这阻碍了LogiKEy旨在实现的跨学科复用。

英文摘要

This position statement looks back on two decades of work on shallow embeddings of non-classical logics in classical higher-order logic (HOL), a line of research that expanded into a range of logic embeddings in HOL and inspired the LogiKEy logic-pluralistic knowledge representation and reasoning methodology. This paper advances the case for logical pluralism at object-logic level within a unifying meta-logical framework such as LogiKEy, grounding the argument in computational metaphysics. More broadly, it advocates principled support for logical pluralism in modern proof assistants, and cautions against logical imperialism -- the rigid adoption of a single foundational logic for large-scale theory developments -- which impedes the interdisciplinary reuse that LogiKEy is designed to enable.

2605.27210 2026-05-27 quant-ph cs.AI

Qiskit QuantumKatas: Adapting Microsoft's Quantum Computing exercises for LLM evaluation

Qiskit QuantumKatas: 为LLM评估改编微软的量子计算练习

Juan Cruz-Benito, Ismael Faro

AI总结 本文将微软的QuantumKatas量子计算课程从Q#移植到Qiskit,并构建评估框架,用于系统评估大型语言模型在量子计算任务上的能力。

详情
AI中文摘要

我们将微软的QuantumKatas——一个成熟的量子计算课程——从Q#改编到最广泛采用的量子计算框架Qiskit,并打包一个用于系统LLM评估的评估框架。由此产生的基准测试包含26个类别中的350个任务,涵盖从基本门到高级算法(Grover、Simon、Deutsch-Jozsa)、纠错、密钥分发和量子游戏。每个任务包括自然语言提示、规范解和通过经典电路模拟的确定性测试验证。通过基于QuantumKatas经过验证的教学设计而不是从头创建任务,我们继承了有原则的难度递进和全面的概念覆盖,同时贡献了框架改编、评估基础设施和实证分析。我们评估了7种提示配置下的16个LLM——总共39,200次模型运行——以证明基准测试的实用性。三个关键发现出现:(1)基准测试有效区分模型能力,最佳配置通过率从32.3%到83.1%不等,前沿模型与开源模型之间平均差距为26.1个百分点;(2)模型在实现已知算法方面表现良好(SimonsAlgorithm 82.1%,BasicGates 81.6%),但在问题编码方面表现不佳(SolveSATWithGrover 34.4%,DistinguishUnitaries 40.0%);(3)思维链提示显示出适度双峰效应——它是三个模型的最佳策略(其中两个根据供应商文档明确进行了推理调优),但降低了其余模型的性能,使其总体上处于中游(平均56.3%),落后于少样本-5(57.8%)。我们发布基准测试、评估框架和基线结果,以支持量子计算中LLM能力的研究。

英文摘要

We adapt Microsoft's QuantumKatas -- a well-established quantum computing curriculum -- from Q# to Qiskit, the most widely-adopted quantum computing framework, and package it with an evaluation framework for systematic LLM assessment. The resulting benchmark comprises 350 tasks across 26 categories, spanning fundamental gates through advanced algorithms (Grover's, Simon's, Deutsch-Jozsa), error correction, key distribution, and quantum games. Each task includes a natural language prompt, canonical solution, and deterministic test verification via classical circuit simulation. By building on the QuantumKatas' proven pedagogical design rather than creating tasks from scratch, we inherit a principled difficulty progression and comprehensive concept coverage while contributing the framework adaptation, evaluation infrastructure, and empirical analysis. We evaluate 16 LLMs across 7 prompting configurations -- a total of 39,200 model runs -- to demonstrate the benchmark's utility. Three key findings emerge: (1) the benchmark effectively differentiates model capabilities, with best-configuration pass rates ranging from 32.3% to 83.1% and a 26.1 pp average gap between frontier and open-source models; (2) models perform well at implementing known algorithms (SimonsAlgorithm 82.1%, BasicGates 81.6%) but struggle with problem encoding (SolveSATWithGrover 34.4%, DistinguishUnitaries 40.0%); and (3) chain-of-thought prompting shows a modestly bimodal effect -- it is the best strategy for three models (two of them explicitly reasoning-tuned per vendor documentation) but degrades performance for the rest, leaving it mid-pack in aggregate (56.3% mean) behind few-shot-5 (57.8%). We release the benchmark, evaluation framework, and baseline results to support research on LLM capabilities in quantum computing.

2605.27205 2026-05-27 eess.IV cs.AI

TWIST: Closed-Loop token Synchronization for Application-Aware Wireless Digital Twins

TWIST:面向应用感知无线数字孪生的闭环令牌同步

Sige Liu, Kezhi Wang

AI总结 提出TWIST框架,通过闭环令牌同步和模式条件不等错误保护,在有限通信资源下实现应用感知的无线数字孪生状态同步,提升交通状态推断性能并降低同步成本。

详情
AI中文摘要

无线数字孪生需要在有限且时变的通信资源下,对随时间演变的物理场景及其数字副本进行重复同步。对于以感知为中心的数字孪生,像素域传输或均匀保护的比特流可能与孪生侧应用消耗的语义状态不匹配。本文提出TWIST,一种面向应用感知无线数字孪生的闭环令牌同步框架。TWIST将每个物理观测表示为一个令牌,并通过无线链路同步该状态,而非优化视觉重建。令牌位置按任务相关性分组,并通过低、中、高同步模式下的模式条件不等错误保护进行保护。在孪生侧,解码置信度将不可靠的硬令牌决策转换为擦除,在更新语义孪生状态之前由补全模型恢复。恢复后的状态支持交通状态推断,并生成紧凑的反馈统计信息,包括信道质量、接收器不确定性、语义漂移和应用优先级,用于后续模式自适应。在动态道路场景数字孪生场景上的实验表明,与固定模式和仅信道自适应策略相比,TWIST改善了交通状态推断和语义孪生状态同步,同时相对于始终高传输降低了平均同步成本。

英文摘要

Wireless digital twins require repeated synchronization between a time-evolving physical scene and its digital counterpart under limited and time-varying communication resources. For perception-centric twins, pixel-domain transmission or uniformly protected bitstreams can be mismatched to the semantic state consumed by twin-side applications. This paper proposes TWIST, a closed-loop token synchronization framework for application-aware wireless digital twins. TWIST represents each physical observation as a token and synchronizes this state over a wireless link, rather than optimizing visual reconstruction. Token positions are grouped by task relevance and protected through mode-conditioned unequal error protection under low-, medium-, and high-synchronization modes. At the twin side, decoding confidence converts unreliable hard token decisions into erasures, which are restored by a completion model before updating the semantic twin state. The recovered state supports traffic-state inference and generates compact feedback statistics, including channel quality, receiver uncertainty, semantic drift, and application priority, for subsequent mode adaptation. Experiments on a dynamic road-scene digital-twin scenario show that TWIST improves traffic-state inference and semantic twin-state synchronization compared with fixed-mode and channel-only adaptation strategies, while reducing the average synchronization cost relative to always-high transmission.

2605.27139 2026-05-27 eess.IV cs.CV physics.ins-det

Unsupervised Deep Image Prior for Sparse-View and Limited-Angle Electron Tomography

无监督深度图像先验用于稀疏视角和有限角度电子断层扫描

Serge Brosset, Daniel del Pozo Bueno, Thomas David, Laure Guetaz, Philippe Ciuciu, Zineb Saghi

AI总结 提出无监督深度图像先验方法,在稀疏视角和有限角度条件下实现与监督方法相当的电子断层重建性能,并应用于实验数据验证其可靠性。

Comments 22 pages, 12 figures

详情
AI中文摘要

电子断层扫描(ET)在纳米材料的三维(3D)表征中发挥着重要作用。然而,在有限角度和稀疏视角条件下,传统算法会产生退化的重建结果,影响所得3D数据的质量和可解释性。本文提出深度图像先验(DIP),一种无监督的深度学习(DL)方法,用于高度退化的断层扫描采集,并通过模拟数据证明,即使在倾斜范围仅为60°、倾斜步长为10°的情况下,其性能也与需要训练数据集的监督方法相当。然后,我们将其应用于实验数据,并表明它在稀疏视角和有限角度条件下都能实现可靠的3D量化,突显了其在广泛材料和采集模式中的潜力。

英文摘要

Electron tomography (ET) plays an important role in the three-dimensional (3D) characterization of nanomaterials. However, under limited-angle and sparse-view conditions, conventional algorithms produce degraded reconstructions, which compromise the quality and interpretability of resulting 3D data. In this paper, we present deep image prior (DIP), an unsupervised deep learning (DL) approach, for highly degraded tomography acquisitions and demonstrate, using simulated data, that its performance is comparable to that of supervised approaches requiring training datasets, even for tilt ranges as limited as 60° and tilt increments of 10°. We then apply it to experimental data and show that it enables reliable 3D quantification under both sparse-view and limited-angle conditions, highlighting its potential for a wide range of materials and acquisition modalities.

2605.27135 2026-05-27 cs.CR cs.CV

Do Modern Post-Hoc Watermarking Methods Beat Broken-Arrows?

现代事后水印方法能否击败断箭?

Enoal Gesny, Eva Giboulot

AI总结 本文通过公平比较现代与经典事后水印方法在多种攻击下的鲁棒性和安全性,发现经典方法在现实场景中更优。

详情
AI中文摘要

随着扩散模型等生成模型的快速普及,数字水印已成为识别AI生成图像的关键解决方案。现代事后水印方案利用神经网络实现极低的误报率,同时对常见图像变换保持鲁棒性。然而,这些现代方法与经典方法之间缺乏比较,特别是在鲁棒性和安全性优先于极低误报概率的现实场景中。本文提出了现代与经典事后水印在多种经典增强和近期复杂攻击下的鲁棒性和安全性的公平比较。实验表明,在现实场景中,经典水印在保持鲁棒性的同时,在安全性方面优于现代技术。

英文摘要

With the rapid proliferation of generative models, such as diffusion models, digital watermarking has emerged as a crucial solution for identifying AI-generated images. Modern post-hoc watermarking schemes use neural networks to achieve an extremely low false-alarm rate while remaining robust to common image transformations. However, there is a lack of comparison between these modern methods and classic ones, particularly in real-world scenarios where robustness and security take precedence over achieving an extremely low false-alarm probability. In this paper, we propose a fair comparison of robustness and security between modern and classic post-hoc watermarking across various types of classic augmentations and recent sophisticated attacks. Our experiments show that, in a realistic scenario, classic watermarking outperforms modern techniques in terms of security while maintaining robustness.

2605.27131 2026-05-27 cs.ET cs.AI cs.DB

Beyond the Data Mesh Illusion: Designing Modern AI-augmented Lakehouses to Bridge the Gap Between Theory and Practice

超越数据网格幻象:设计现代AI增强型湖仓以弥合理论与实践差距

Oliver Angélil, Jan Migon

AI总结 针对企业数据平台中领域自服务与整体治理之间的张力,提出一种基于现代湖仓架构的AI增强型中心辐射模型,通过中心卓越中心提供共享服务与AI治理,领域团队逐步承担更多责任,以平衡灵活性与控制,并通过数据产品采纳率、查找时间和洞察时间三个指标评估架构效果。

Comments 11 pages, 5 figures

详情
AI中文摘要

企业数据平台面临着领域自服务与整体治理之间的持久张力。数据网格范式提出了去中心化的领域所有权作为解决方案,但纯粹的实现往往效果不佳:团队在没有足够的平台成熟度、工具或协调机制的情况下继承了新的责任。本文认为,通过在现代湖仓架构上叠加AI增强的中心辐射模型,可以缓解灵活性与控制之间的权衡。中心枢纽(卓越中心)提供共享平台服务、策略自动化和AI驱动的治理,自动标准化数据产品、生成质量规则、起草数据合约并审查变更以检测回归。领域辐条拥有业务语义、产品积压和本地迭代节奏,随着成熟度提高逐步承担更多责任。执行治理任务的同一LLM也降低了领域从业者发展跨业务和数据工程的真正跨职能专业知识的门槛,使辐条团队能够承担更大的端到端所有权,而无需按比例增加对中心的依赖。自然语言对话界面进一步为业务用户民主化访问,释放了历史上未充分利用的企业数据。在组织方面,我们提出了一个分阶段框架,将所有权从中心转移到辐条,避免了集中式瓶颈和不协调的去中心化。我们通过三个结果指标评估架构:数据产品采纳率、查找时间和洞察时间,这些指标将平台成功与可衡量的业务价值而非内部活动联系起来。

英文摘要

Enterprise data platforms face an enduring tension between domain self-service and holistic governance. The data mesh paradigm proposed decentralized domain ownership as a remedy, but pure implementations frequently underdeliver: teams inherit new responsibilities without the platform maturity, tooling, or coordination mechanisms needed to exercise them effectively. This paper argues that the flexibility-versus-control trade-off can be relaxed through an AI-augmented hub-and-spoke model layered on a modern lakehouse architecture. A central hub (Center of Excellence) provides shared platform services, policy automation, and AI-enabled governance, automatically standardizing data products, generating quality rules, drafting data contracts, and reviewing changes for regressions. Domain spokes own business semantics, product backlogs, and local iteration cadence, progressively assuming greater responsibility as they mature. The same LLMs that automate governance tasks also lower the barrier for domain practitioners to develop genuine cross-functional expertise spanning business and data engineering, enabling spoke teams to take on greater end-to-end ownership without proportionally increasing their dependence on the hub. Natural-language conversational interfaces further democratize access for business users, exposing historically underutilized enterprise data. On the organizational side, we propose a staged framework that shifts ownership from hub to spokes, avoiding both centralized bottlenecks and uncoordinated decentralization. We evaluate the architecture through three outcome metrics: data product adoption, time-to-find, and time-to-insight, that tie platform success to measurable business value rather than internal activity.

2605.27110 2026-05-27 cs.CR cs.CL

BAIT: Boundary-Guided Disclosure Escalation via Self-Conditioned Reasoning

BAIT: 基于边界引导的自我条件推理披露升级

Xuan Luo, Yue Wang, Geng Tu, Jing Li, Ruifeng Xu

AI总结 提出BAIT三步越狱框架,通过识别保护边界、细化边界和请求详细示例,利用模型自身推理和一致性倾向实现恶意目标披露,实验表明在多个基准上攻击成功率显著优于传统方法。

详情
AI中文摘要

在这项工作中,我们提出了BAIT(边界感知迭代陷阱),一个三步越狱框架,通过内部披露接近恶意目标。BAIT首先要求模型识别保护边界,然后要求其细化该边界,最后请求一个详细示例。通过在每个步骤中扩展模型之前的响应,BAIT将模型自身的推理和一致性倾向转变为披露路径。在AdvBench、JailbreakBench、AIR-Bench和SORRY-Bench上的实验表明,BAIT在顶级大语言模型上持续实现高攻击成功率,显著超越了传统的越狱基线。进一步分析揭示:1)预防导向的框架显著优于直接知识请求;2)细化步骤在披露升级中起关键作用;3)前两步有一定概率引发有害内容,同时触发很少的过滤。

英文摘要

In this work, we propose BAIT (Boundary-Aware Iterative Trap), a three-step jailbreak framework that approaches malicious goals through internal disclosure. BAIT first asks the model to identify the protection boundary, then requires it to refine that boundary, and finally requests a detailed example. By expanding each step upon the model's previous responses, BAIT turns the model's own reasoning and consistency tendency into a disclosure pathway. Experiments on AdvBench, JailbreakBench, AIR-Bench, and SORRY-Bench demonstrate that BAIT consistently achieves strong attack success rates across top-tier large language models, significantly advancing conventional jailbreak baselines. Further analysis reveals that: 1) prevention-oriented framing significantly outperforms direct knowledge request; 2) the refinement step plays a critical role in disclosure escalation; and 3) the first two steps have a certain chance of eliciting harmful content while triggering little filtering.

2605.27093 2026-05-27 stat.ML cs.LG

Gaussian Process-based learning with new MCMC-based implementation of Wishart prior on correlation matrix

基于高斯过程的学习:相关矩阵上Wishart先验的新MCMC实现

Kane Warrior, Dalia Chakrabarty

AI总结 提出一种自组装Wishart先验用于协方差矩阵,结合MCMC对核超参数进行贝叶斯推断,通过回溯窗口引入自适应性,有效诊断弱信息输入。

详情
AI中文摘要

在输入-输出关系的概率监督学习中(作为高斯过程(GP)的样本函数),通常为核的超参数指定先验,这些超参数参数化GP的协方差函数,其中(所得多元正态)似然的诱导协方差矩阵控制学习和预测。当所寻求的函数高度多元时,必须同时学习多个长度尺度参数,使得推断困难。我们为协方差矩阵开发了一种“自组装”Wishart先验,同时使用MCMC对核超参数进行贝叶斯推断。该构造使用最近MCMC迭代的回溯窗口来定义依赖于时间步长的尺度矩阵,从而为链引入自适应性。结果表明,在基于GP的学习范式中,对协方差矩阵的直接先验指定可用于诊断弱信息输入。我们通过两个不同的实证示例支持我们的先验开发——一个基于合成数据,另一个基于真实世界数据集。

英文摘要

In probabilstic supervised learning of an input-output relationship - as a sample function of a Gaussian Process (GP) - priors are typically specified for the hyperparameters of the kernel that parametrises the covariance function of the GP, where the induced covariance matrix of the (resulting multivariate Normal) likelihood, governs the learning and prediction. When the sought function is highly multivariate, multiple lengthscale parameters must be learnt simultaneously, making inference difficult. We develop a ``self-assembled'' Wishart prior for the covariance matrix, while undertaking Bayesian inference on the kernel hyperparameters using MCMC. The construction uses a look-back window over recent MCMC iterations to define a time-step dependent scale matrix, thereby introducing adaptiveness to the chain. Results suggest that direct prior specification on the covariance matrix can be useful for diagnosing weakly informative inputs within the GP-based learning paradigm. We support our prior development with two distinct empirical illustrations - one on synthetic data, and another on a real-world dataset.

2605.27076 2026-05-27 cs.MA cs.LG

Cost of Structural Learning Under Censored Feedback: A Threshold-Bandit Approach

审查反馈下结构学习的代价:一种阈值-老虎机方法

Michael Ledford, William Regli

AI总结 针对任务仅当联盟达到未知规模阈值时才产生奖励的审查反馈问题,提出阈值激活合作多臂老虎机模型,并通过集中式算法C-TAC实现O(log T)累积遗憾,以及去中心化事件触发协议D-TAC在保持可行性对齐的同时减少23倍通信。

详情
AI中文摘要

在许多多智能体应用中,任务仅当由满足未知规模阈值的联盟执行时才产生奖励;否则,反馈完全被审查。这种审查造成了可识别性问题:智能体无法区分随机失败与协调不足。我们将此设置形式化为阈值激活合作多臂老虎机(TAC-MAB),并在集中式和去中心化协调下进行分析。我们证明集中式算法(C-TAC)实现了累积遗憾O(log T),该遗憾分解为结构搜索项(捕获在审查反馈下解决可行性的代价)和统计监控项(用于价值估计)。然后我们引入D-TAC,一种去中心化事件触发协议,其中智能体仅在其结构信念改变时进行同步。实验表明,在保守信念融合下,D-TAC相对于集中式基线实现了23倍的通信减少,同时保持了可行性对齐。这些结果刻画了在审查反馈下学习的协调代价,并表明无需持续同步即可实现接近集中式的通信效率。

英文摘要

In many multi-agent applications, tasks yield rewards only when executed by a coalition meeting an unknown size threshold; otherwise, feedback is fully censored. This censorship creates an identifiability problem: agents cannot distinguish stochastic failure from insufficient coordination. We formalize this setting as the Threshold-Activated Cooperative Multi-Armed Bandit (TAC-MAB) and analyze it under both centralized and decentralized coordination. We show that a centralized algorithm (C-TAC) achieves cumulative regret O(log T), decomposed into a structural-search term that captures the cost of resolving feasibility under censored feedback and a statistical-monitoring term for value estimation. We then introduce D-TAC, a decentralized event-triggered protocol in which agents synchronize only when their structural beliefs change. Empirically, D-TAC achieves a 23x reduction in communication relative to the centralized baseline while preserving feasibility alignment under conservative belief fusion. These results characterize the coordination cost of learning under censored feedback and show that near-centralized communication efficiency is achievable without continuous synchronization.

2605.27051 2026-05-27 cs.SE cs.AI

ConVer: Using Contracts and Loop Invariant Synthesis for Scalable Formal Software Verification

ConVer:使用合约和循环不变式合成实现可扩展的形式化软件验证

Muhammad A. A. Pirzada, Weiqi Wang, Yiannis Charalambous, Konstantin Korovin, Lucas C. Cordeiro

AI总结 提出一种自上而下的组合验证工具ConVer,利用大语言模型合成函数合约,并通过CEGAR-CEGIS循环迭代精炼合约,以解决大规模C程序形式化验证中的状态空间爆炸问题。

Comments 12 pages; 6 figures

详情
AI中文摘要

大型C程序的形式化验证受到状态空间爆炸的阻碍:有界模型检验(BMC)工具必须通过展开所有嵌套结构来编码整个状态空间直至预定边界。我们提出了ConVer,一种自上而下的组合验证工具。给定一个带有顶层断言的C程序,ConVer自上而下地分解验证:它使用大语言模型(LLM)从系统属性中合成函数合约,然后在CEGAR-CEGIS循环中交替进行系统级和函数级检查,每当检查失败时通过SMART ICE学习精炼合约。我们在四个难度递增的基准测试套件上评估了ConVer,并与其他最先进(SOTA)工具进行了比较。在包含45个简单C程序的Frama-C基准测试中,ConVer在三个LLM后端上实现了82-96%的验证成功率,其中93-95%的收敛程序仅需一次CEGAR-CEGIS迭代。在X.509解析器基准测试(6个程序)和LF2C-Simple套件(17个程序)上,ConVer分别实现了33-50%和82-88%的成功率。在包含11个递归和循环密集型程序的VerifyThis套件上,预抽象策略实现了55-64%的成功率。此外,我们提出了ESBMC-LF,一个预处理工具,它将LF模型转换为C语言,同时保留LF文件的属性,使ConVer能够验证它们。我们使用ESBMC-LF将LF验证器基准测试转换为C语言;我们将这些称为LF-Hard。我们表明,ConVer总体上成功验证了67%的LF-Hard基准测试。

英文摘要

Formal verification of large C programs is impeded by state-space explosion: Bounded Model Checking (BMC) tools must encode the entire state space up to the predetermined bound by unrolling all nested constructs. We present ConVer, a top-down compositional verification tool. Given a C program with a top-level assertion, ConVer decomposes verification top-down: it uses a large language model (LLM) to synthesise function contracts from the system property, then alternates system-level and function-level checks in a CEGAR-CEGIS loop, refining contracts whenever a check fails via SMART ICE learning. We evaluate ConVer on four benchmark suites of increasing difficulty and against other state-of-the-art (SOTA) tools. On the Frama-C benchmark of 45 simple C programs, ConVer achieves 82-96% verification success across three LLM backends, with 93-95% of converged programs requiring only a single CEGAR-CEGIS iteration. On the X.509 parser benchmark (6~programs) and LF2C-Simple suite (17 programs), ConVer achieves 33-50% and 82-88% success respectively. On the VerifyThis suite of 11 recursive and loop-intensive programs, the Pre-Abstraction strategy achieves 55-64% success. In addition, we present ESBMC-LF a preprocessor tool that converts LF models to C while preserving the properties of the LF files, enabling ConVer to verify them. We transpile the LF Verifier Benchmarks using ESBMC-LF to C; we denote those LF-Hard. We show that ConVer successfully verifies 67% of LF-Hard benchmarks overall.

2605.27043 2026-05-27 stat.ML cs.LG stat.ME

Causal Representation Learning for Generalisable Recommendation

因果表示学习用于可泛化推荐

Yorgos Felekis, Michael O'Riordan, Oriol Corcoll, Ciarán M. Gilligan-Lee

AI总结 针对推荐系统中训练分布与部署分布不一致导致的泛化问题,提出基于因果表示学习的信息论解缠标准及其可计算变分下界,仅利用混淆日志即可提升模型在分布偏移下的泛化能力,在Spotify A/B测试、KuaiRand数据集和合成基准上验证了有效性。

详情
AI中文摘要

基于观测数据训练的预测模型在部署时往往无法泛化到所遇到的分布,尤其是当训练数据是被优化系统的产物时。推荐系统是一个典型例子:它们是在被部署策略、过去用户行为和平台过滤混淆的交互日志上训练的。因此,训练分布与在服务时评分的候选分布存在显著差异,这种差距使得离线指标无法可靠预测在线性能。我们通过一种受因果表示学习(CRL)启发的方法来解决分布偏移问题。我们提出了一种信息论解缠标准,并证明其最优值仅取决于输入的因果成分。然后,我们推导出一个可处理的变分下界,使得该标准仅从有限观测数据中即可优化。我们的方法范围比大多数CRL文献更窄,因为我们目标是改善分布偏移下的泛化能力,而非完全识别所有潜在因果因素。这个更窄的目标使得该方法实用,仅需要现有的混淆日志,适用于任何标准监督模型,且不增加推理时间成本。我们的主要评估是在Spotify上对数百万用户进行的A/B测试,应用于个性化播放列表生成的排序器。一个容量匹配的CRL变体在离线性能上相当,但在在线听众参与度上带来了显著提升。在公开的KuaiRand推荐数据集和具有已知因果结构的合成基准上的补充证据显示了相同模式:与基线离线持平,在分布偏移下获得收益。在所有三种设置中,加入我们的因果解缠目标都带来了更有意义的分布外泛化。

英文摘要

Predictive models trained on observational data often fail to generalise to the distributions they encounter when deployed, especially when the training data is a product of the system being optimised. Recommender systems are a canonical example: they are trained on interaction logs confounded by the deployed policy, past user behaviour, and platform filtering. As a result, the training distribution differs substantially from the candidate distribution scored at serving time, a gap that makes offline metrics unreliable predictors of online performance. We address the distribution shift problem with a method motivated by causal representation learning (CRL). We propose an information-theoretic disentanglement criterion and prove that its optimum depends only on the causal components of the input. We then derive a tractable variational lower bound that makes the criterion optimisable from finite observational data alone. The scope of our method is narrower than that of much of the CRL literature, in that we target better generalisation under distribution shift, not full identification of all latent causal factors. This narrower target is what makes the method practical, requiring only the existing confounded logs, applying to any standard supervised model, and adding no inference-time cost. Our headline evaluation is an A/B test with millions of users on Spotify, applied to a production ranker for personalised playlist generation. A capacity-matched CRL variant performed on par offline but delivered substantial online gains in listener engagement. Complementary evidence on the public KuaiRand recommendation dataset and a synthetic benchmark with known causal structure shows the same pattern: offline parity with baseline, gains under distribution shift. Across all three settings, adding our causal disentanglement objective yields meaningfully better out-of-distribution generalisation.

2605.27042 2026-05-27 cs.CR cs.AI

Lessons from Penetration Tests on Large-Scale Agent Systems

大规模智能体系统渗透测试的经验教训

Kevin Eykholt, Dhilung Kirat, Xiaokui Shu, Jiyong Jang, Frederico Araujo, Ian Molloy

AI总结 本文通过对2025年专有智能体产品的两次渗透测试,评估了AI智能体的安全态势是否有所改善,并指出许多安全漏洞并非全新,而是反映了先前计算系统中长期存在的重复性弱点类别。

Comments Accepted at SAGAI 2026

详情
AI中文摘要

随着AI系统获得越来越多的自主性和执行能力,发现的安全漏洞数量持续上升。然而,许多这些漏洞并非根本上的新颖,而是反映了先前计算系统中长期观察到的重复性弱点类别。具有执行能力的AI智能体实际上是无限的自修改程序,与计算栈的多个层进行广泛交互。这种广泛的交互表面给开发者带来了显著的安全负担,他们必须推理并保护复杂的跨层行为。先前的研究主要集中在开源智能体和智能体框架中的漏洞。相比之下,专有智能体系统——在更严格的编码标准和正式审查流程下开发——是否表现出类似的安全弱点仍不清楚。在本文中,我们展示了2025年对专有智能体产品进行的两次渗透测试的结果,并评估了自这些评估以来AI智能体的安全态势是否有所改善。

英文摘要

As AI systems gain increasing autonomy and execution capability, the number of discovered security vulnerabilities continues to rise. However, many of these vulnerabilities are not fundamentally novel, but instead reflect recurring classes of weaknesses long observed in prior computing systems. Execution-capable AI agents are effectively unbounded, self-modifying programs that interact extensively with multiple layers of the computing stack. This broad interaction surface imposes a significant security burden on developers, who must reason about and secure complex cross-layer behaviors. Prior research has primarily focused on vulnerabilities in open-source agents and agent frameworks. In contrast, it remains unclear whether proprietary agent systems -- developed under stricter coding standards and formal review processes -- exhibit similar security weaknesses. In this paper, we present findings from two penetration tests conducted in 2025 against proprietary agent products and evaluate whether the security posture of AI agents has improved since these assessments.

2605.27039 2026-05-27 eess.AS cs.SD

Why Can't They Remember? Uncovering Representation and Retrieval Bottlenecks in Multi-Turn Acoustic Memory

为什么它们记不住?揭示多轮声学记忆中的表征和检索瓶颈

Yang Xiao, Siyi Wang, Han Yin, Hong Jia, Vidhyasaharan Sethu, Eun-Jung Holden, Ting Dang

AI总结 本文通过引入EnvMem基准,发现大型音频语言模型在多轮交互中非语音信息记忆失败的主要原因是表征轨迹漂移,而非注意力分配不足。

详情
AI中文摘要

大型音频语言模型(LALMs)处理语音和环境声学线索,但在多轮交互中难以保留非语音信息。语义(语音)和声学(非语音)理解之间的性能差距仍未被充分理解,其表征和检索的底层机制尚不清楚。本文引入EnvMem,一个受控的多轮基准,用于研究这一差距并识别表征(即潜在嵌入)和检索层面(即注意力分配)失败的根源。我们进一步进行事后干预以探究表征结构和注意力动态。我们的结果揭示表征轨迹漂移是关键失败模式,同时表明注意力分配在解释观察到的退化中作用有限。总体而言,我们提供了一个系统框架,用于分析和改进长上下文LALMs中的非语言记忆,为未来鲁棒声学记忆建模的数据和训练设计提供启示。

英文摘要

Large audio language models (LALMs) process both speech and environmental acoustic cues, yet struggle to retain non-speech information across multi-turn interactions. The performance gap between semantic (speech) and acoustic (non-speech) understanding remains poorly understood, and the underlying mechanisms of representation and retrieval are still unclear. This work introduces EnvMem, a controlled multi-turn benchmark designed to study this gap and identify the root causes of failures at the representation (i.e., latent embeddings) and retrieval levels (i.e., attention allocation). We further conduct post-hoc interventions to probe representational structure and attention dynamics. Our results reveal representational trajectory drift as the key failure mode, while showing that attention allocation plays a limited role in explaining the observed degradation. Overall, we provide a systematic framework for analyzing and improving non-linguistic memory in long-context LALMs, shedding light on future data and training design for robust acoustic memory modeling.

2605.27014 2026-05-27 cs.LO cs.AI

ReasonOps: A Unified Operational Paradigm for Trustworthy Verified LLM Reasoning

ReasonOps: 可信验证的LLM推理的统一操作范式

Adnan Rashid

AI总结 本文提出ReasonOps,一种将推理视为持续监控、可验证、可靠性感知的操作过程的统一范式,整合语义解释、自动形式化、符号推理、定理证明、运行时保证、概率可靠性估计和自适应修正,以解决当前LLM推理中的逻辑不一致、幻觉符号转换等问题。

Comments 5 Pages

详情
AI中文摘要

大型语言模型(LLM)已将人工智能从主要生成系统转变为日益强大的推理代理。最近在定理证明、自动形式化、符号推理和工具增强语言模型方面的进展表明,在机器辅助形式推理方面取得了实质性进展。然而,当前的推理系统仍然存在隐藏的逻辑不一致、幻觉符号转换、无支持的定理应用以及有限可靠性保证。现有方法在形式验证、运行时保证、神经符号推理和可信人工智能(AI)研究社区之间仍然分散。本文介绍了ReasonOps,一种用于可信验证推理系统的统一操作范式。受DevOps和MLOps等操作生态系统的启发,ReasonOps将推理视为一个持续监控、可验证、可靠性感知的操作过程,而不是一个孤立的推理任务。所提出的范式将语义解释、自动形式化、符号推理、定理证明、运行时保证、概率可靠性估计和自适应修正整合到一个统一的推理生命周期中。本文进一步介绍了ReasonOps架构,使用自主制动系统分析示例演示了其工作流程,并讨论了其在未来安全关键自主AI系统中的潜在作用。我们认为,像ReasonOps这样的操作推理范式可能成为下一代可信AI生态系统的基础设施。

英文摘要

Large Language Models (LLMs) have transformed artificial intelligence from primarily generative systems into increasingly capable reasoning agents. Recent advances in theorem proving, autoformalization, symbolic reasoning, and tool-augmented language models demonstrate substantial progress toward machine-assisted formal reasoning. However, current reasoning systems still suffer from hidden logical inconsistencies, hallucinated symbolic transitions, unsupported theorem applications, and limited reliability guarantees. Existing approaches remain fragmented across formal verification, runtime assurance, neuro-symbolic reasoning and trustworthy Artificial Intelligence (AI) research communities. This paper introduces ReasonOps, a unified operational paradigm for trustworthy verified reasoning systems. Inspired by operational ecosystems such as DevOps and MLOps, ReasonOps treats reasoning as a continuously monitored, verifiable, reliability-aware operational process rather than an isolated inference task. The proposed paradigm integrates semantic interpretation, autoformalization, symbolic reasoning, theorem proving, runtime assurance, probabilistic reliability estimation, and adaptive correction into a unified reasoning lifecycle. The paper further presents the ReasonOps architecture, demonstrates its workflow using an autonomous braking system analysis example, and discusses its potential role in future safety-critical autonomous AI systems. We argue that operational reasoning paradigms such as ReasonOps may become foundational infrastructure for next-generation trustworthy AI ecosystems.

2605.26990 2026-05-27 stat.ML cs.LG

Constrained Bayesian Experimental Design via Online Planning

通过在线规划的约束贝叶斯实验设计

Yujia Guo, Daolang Huang, Xinyu Zhang, Sammie Katt, Samuel Kaski, Ayush Bharti

AI总结 提出一种结合离线预训练摊销策略和后验网络与在线多步前瞻规划(场景树)的方法,以在动态约束下优化贝叶斯实验设计,相比现有方法获得更优信息序列且计算开销适中。

Comments 24 pages, 9 figures. Accepted at the Forty-Third International Conference on Machine Learning (ICML 2026)

详情
AI中文摘要

贝叶斯实验设计(BED)是一个用于数据高效顺序实验设计的理论框架。然而,现有的BED方法无法适应实际任务中由于预算限制、成本变化或物理约束(限制设计随时间演化)而产生的动态约束。在本文中,我们介绍了一种新的BED方法,通过将离线预训练的摊销策略和后验网络与使用场景树的在线多步前瞻规划相结合,实现了实验设计的约束优化。我们通过实验证明,在多种约束BED任务中,我们的方法相比现有方法产生了更信息丰富的设计序列,同时仅增加了适度的额外计算开销。

英文摘要

Bayesian experimental design (BED) is a principled framework for data-efficient design of sequential experiments. However, existing BED methods are unable to adapt to dynamic constraints inherent in real-world tasks due to budget limitations, varying costs, or physical constraints that restrict how designs evolve over time. In this paper, we introduce a novel approach to BED that enables constrained optimization of experimental designs by combining offline pre-training of an amortized policy and a posterior network with online multi-step lookahead planning using scenario trees. We empirically demonstrate that our method yields substantially more informative design sequences than existing methods across a range of constrained BED tasks, while incurring only a modest additional computational overhead.

2605.26973 2026-05-27 stat.ML cond-mat.dis-nn cs.LG cs.NE q-bio.NC

Signal-to-Noise Ratio and Sample Size Govern Representational Alignment in Neural Networks

信噪比与样本量控制神经网络中的表征对齐

Ali Hussaini Umar, Alessandro Laio

AI总结 通过理论和实验证明,信噪比和训练样本量以单调和非单调方式分别影响神经网络表征对齐,且对齐程度在插值阈值附近最小,与泛化误差解耦。

详情
AI中文摘要

已知神经网络会发展出潜在表征,这些表征是$对齐$的,即在不同架构、训练协议或训练数据集训练的网络之间结构相似。我们在一个受控环境中研究这一现象,使用被噪声过程的独立实现扰动的训练集,训练一组网络执行回归和分类任务。我们表明,信噪比(SNR)和训练样本量以定性相似的方式影响对齐,无论是在真实世界数据集上训练的网络,还是在极其简单的具有单个隐藏层的$线性$网络中(其对齐可以解析估计)。在线性和非线性网络、回归和分类任务以及合成和真实数据中,我们一致观察到,对齐随SNR单调变化,但随训练样本量非单调变化。特别地,对齐在插值阈值附近最小,且更强的对齐不一定对应更好的泛化误差。这些发现揭示了数据质量和数量对对齐的非平凡依赖关系,且与泛化性能解耦。

英文摘要

Neural networks are known to develop latent representations that are $aligned$, namely structurally similar across networks trained with different architectures, training protocols, or training datasets. We study this phenomenon in a controlled setting, where we train an ensemble of networks on regression and classification tasks using training sets perturbed by independent realizations of a noise process. We show that the signal-to-noise ratio (SNR) and the training sample size influence the alignment in qualitatively similar ways in networks trained on real-world datasets and in an extremely simple $linear$ network with a single hidden layer, for which the alignment can be estimated analytically. Across linear and nonlinear networks, regression and classification tasks, and both synthetic and real-world data, we consistently observe that alignment varies monotonically with SNR but non-monotonically with training sample size. In particular, the alignment is minimized near the interpolation threshold, and a stronger alignment does not necessarily correspond to better generalization error. These findings reveal a non-trivial dependence of alignment on data quality and quantity, decoupled from generalization performance.

2605.26925 2026-05-27 quant-ph cs.LG

Adaptive Reinforcement Learning for Robust Open Quantum System Control: A Multi-Task Framework with Temporal Optimization

自适应强化学习用于鲁棒开放量子系统控制:一种带有时间优化的多任务框架

Haftu W. Fentaw, Steve Campbell, Simon Caton

AI总结 提出一种多任务软演员-评论家(SAC)强化学习框架,用于开放量子系统控制,同时学习最优脉冲序列并发现特定问题的演化时间T和控制脉冲段数N,在51种哈密顿量变化下实现高保真度状态转移,并展现出优于GRAPE的鲁棒性。

详情
AI中文摘要

我们提出了一种多任务软演员-评论家(SAC)强化学习框架,用于跨不同哈密顿量的开放系统量子控制,该框架学习最优脉冲序列,同时发现特定问题的演化时间T和控制脉冲段数N。在51种哈密顿量变化上的实验结果表明,多任务SAC模型能够生成控制脉冲,在环境噪声下将系统从初始状态驱动到目标状态,并具有高保真度,为适用于实际噪声量子器件的通用量子控制奠定了必要基础。通过逐步扩展训练哈密顿量集,我们研究了使用给定数量样本哈密顿量训练的单个多任务模型是否能够成功完成来自同一哈密顿量空间但训练中未遇到的哈密顿量的状态转移任务。此外,我们的鲁棒性不保真度度量(RIM)分析表明,与GRAPE优化的控制相比,SAC训练的策略对脉冲幅度扰动和退相干率变化表现出更优越的鲁棒性。

英文摘要

We present a Multi-task Soft Actor-Critic (SAC) Reinforcement Learning framework designed for open-system quantum control across diverse Hamiltonians, which learns optimal pulse sequences while simultaneously discovering problem-specific evolution time T and number of control pulse segments N. Experimental results across 51 Hamiltonian variations demonstrate that the multi-task SAC model is able to generate control pulses that can drive a system, under environment noise, from its initial state to its target state with high fidelities, establishing essential foundations for universal quantum control applicable to realistic noisy quantum devices. Through progressive expansion of the training Hamiltonian set, we investigate if a single multi-task model trained using a given number of sample Hamiltonians can successfully accomplish state-transfer tasks for Hamiltonians drawn from the same Hamiltonian space but not encountered during training. In addition, our Robustness Infidelity Measure (RIM) analysis reveals that SAC trained policies exhibit superior robustness to pulse amplitude perturbations and decoherence rate variations compared to GRAPE-optimized controls.

2605.26898 2026-05-27 cs.SE cs.AI

Strategies for Guiding LLMs to Use Software Design Patterns: A Case of Singleton

引导LLM使用软件设计模式的策略:以单例模式为例

Viktor Kjellberg, Farnaz Fotrousi, Miroslaw Staron

AI总结 通过实验比较四种提示策略(指令、二元自动反馈、详细自动反馈、少样本详细反馈),评估13个LLM在164个Java编码挑战中生成遵循单例模式的代码的能力,发现迭代二元反馈在保持或提升功能性的同时最佳地实现了单例模式对齐。

Comments Accepted at PROMISE 2026

详情
AI中文摘要

大型语言模型(LLM)可以从自然语言提示生成功能性源代码,但往往无法一致地遵循更高级别的架构结构或设计模式。由于LLM在软件工程中的应用日益增多,它们将既定设计原则应用于生成代码的能力对于软件产品的长期成功至关重要。因此,本文的目标是确定引导LLM将设计模式融入生成源代码的策略。我们设计了一个计算实验,评估13个LLM生成遵循单例设计模式的代码的能力,使用了四种提示策略:指令、二元自动反馈、详细自动反馈以及带少样本提示的详细反馈,在HumanEval-X的164个Java编码挑战中进行。我们的结果表明,引导LLM包含设计模式的最佳策略在很大程度上取决于模型类型。尽管如此,总体而言,迭代二元反馈在保持或改善代码功能性的同时,提供了与单例模式的最佳对齐。通过指令引导,Llama 3.3在100%的情况下生成了单例类,并改善了代码功能性,使通过的测试数量增加了34.1个百分点。通过指令和二元反馈引导,它取得了类似的结果。Qwen 3(8B)使用二元反馈将单例模式对齐度提高到99.2%,功能性提高到58.6%。我们的结果表明,即使是简单的策略也可以用于引导LLM使用设计模式。

英文摘要

Large Language Models (LLMs) can generate functional source code from natural-language prompts, but often fail to consistently follow higher-level architectural structures or design patterns. Since LLMs are increasingly used in software engineering, their ability to apply established design principles to generated code is crucial to the long-term success of software products. Therefore, the goal of this paper is to identify strategies for guiding LLMs to incorporate design patterns into the generated source code. We designed a computational experiment to evaluate the ability of 13 LLMs to generate code that follows the Singleton design pattern, using four prompting strategies: instructions, binary automated feedback, extensive automated feedback, and extensive feedback with few-shot prompts, in 164 Java coding challenges from HumanEval-X. Our results shows that the optimal strategy to guide LLMs to include design patterns depends heavily on the type of model. Still, overall, iterative binary feedback provides the best alignment with Singleton while preserving or improving the code's functionality. With guiding with instructions, Llama 3.3 generated Singleton classes in 100% of cases and improved code functionality, increasing the number of tests passed by 34.1 percentage points. It achieved a similar result with guidance through instructions and binary feedback. Qwen 3 (8B) increased the alignment with Singleton to 99.2% and the functionality to 58.6% using binary feedback. Our result suggests that even simple strategies can be used to guide LLMs to use design patterns.

2605.26886 2026-05-27 cs.DS cs.LG

Parsimonious Learning-Augmented Online Metric Matching

简约学习增强的在线度量匹配

Yongho Shin, Phanu Vajanopath

AI总结 针对在线度量匹配问题,提出一种简约学习增强算法,通过虚拟预测填补缺失预测,并建立性能下界,实验验证了其有效性。

Comments To appear in ICML 2026

详情
AI中文摘要

近年来,学习增强算法受到了广泛关注,尤其是在在线优化领域。由于生成预测的高计算成本,越来越多的研究关注于学习增强算法中性能保证与预测使用数量之间的权衡,例如缓存和度量任务系统问题。在本文中,我们将这一研究方向扩展到在线度量匹配,开发了简约学习增强算法并建立了其性能下界。我们的方法将“跟随预测”框架扩展到简约设置,通过在缺乏实际预测时使用一种在线度量匹配算法来填充虚拟预测,该算法在执行过程中保持良好中间匹配。我们通过实证评估补充了理论结果,证明了我们方法的实际有效性。

英文摘要

Learning-augmented algorithms have received significant attention in recent years, particularly in the context of online optimization. Motivated by the high computational cost of generating predictions, a growing line of work studies the tradeoff between performance guarantees and the number of predictions used in learning-augmented algorithms for problems such as caching and metrical task systems. In this paper, we extend this line of research to online metric matching by developing parsimonious learning-augmented algorithms and establishing lower bounds on their performance. Our approach extends the Follow-the-Prediction framework to the parsimonious setting by filling in a virtual prediction in the absence of an actual prediction, using an online metric matching algorithm that maintains good intermediate matchings throughout its execution. We complement our theoretical results with an empirical evaluation, demonstrating the practical effectiveness of our approach.

2605.26870 2026-05-27 cs.MA cs.AI cs.HC

Persistent AI Agents in Academic Research: A Single-Investigator Implementation Case Study

学术研究中的持久性AI智能体:单研究者实施案例研究

Anas H. Alzahrani

AI总结 通过单研究者案例研究,分析了持久性AI智能体在真实学术环境中的架构、使用、产出和治理,发现缓存主导的工作流可能将经济单位从每token成本转向每完成工件成本。

Comments 19 pages, 2 figures, 3 main tables; supplementary appendix with 6 tables, 2 figures, and a reproducibility methods section. Describes 17 configured agents in a persistent research environment and introduces the PARE-M (Persistent Agentic Research Environment Measurement) framework

详情
AI中文摘要

背景:大型语言模型通常作为模型、基准或简短对话片段进行评估。当智能体持久嵌入真实学术研究环境,具有持久记忆、本地文件、外部工具、计划例程、委派角色和明确安全协议时,会发生什么知之甚少。方法:从2026年1月31日至5月25日进行了一项结构化自我观察的实施案例研究。分析单元是持久的人-智能体环境:研究者、智能体运行时、记忆层、工具、仓库、计划任务、专门智能体角色和治理规则。结果使用PARE-M(持久智能体研究环境测量)组织,这是一个涵盖架构、利用、工件生产、资源使用、可重复性和治理的测量框架。结果:可恢复的主智能体遥测包含96个活跃日中的75,671条去重记录,其中8,059条用户角色消息和23,710条助手角色消息。工作空间包括502个记忆相关文件、17个配置的智能体目录和57个技能文件。活跃系统时间为579.7小时(30分钟上限间隙估计)。记忆衍生记录识别出482个输出代理事件和889个失败、验证、纠正或协议代理事件。一个严格的2026年5月轨迹子集捕获了627个模型完成事件和73.95百万记录token,其中82.9%为缓存读取。结论:工作流以缓存为主导,表明持久智能体环境可能将经济单位从每token成本转向每完成工件成本。未来评估应使用工件级分母、可重复解析规则、纠正分类法和治理事件的独立编码。

英文摘要

Background: Large language models are typically evaluated as models, benchmarks, or short conversational episodes. Less is known about what happens when an agent is embedded persistently in a real academic research environment with durable memory, local files, external tools, scheduled routines, delegated roles, and explicit safety protocols. Methods: A structured self-observed implementation case study was conducted from January 31 to May 25, 2026. The unit of analysis was the persistent human-agent environment: researcher, agent runtime, memory layer, tools, repositories, scheduled jobs, specialized agent roles, and governance rules. Outcomes were organized using PARE-M (Persistent Agentic Research Environment Measurement), a measurement framework covering architecture, utilization, artifact production, resource use, reproducibility, and governance. Results: Recoverable main-agent telemetry contained 75,671 de-duplicated records across 96 active days, with 8,059 user-role and 23,710 assistant-role messages. The workspace included 502 memory-related files, 17 configured agent directories, and 57 skill files. Active system time was 579.7 hours (30-minute capped-gap estimate). Memory-derived records identified 482 output-proxy events and 889 failure, verification, correction, or protocol-proxy events. A strict May 2026 trajectory subset captured 627 model-completed events and 73.95 million recorded tokens, of which 82.9% were cache reads. Conclusions: The workflow was cache-dominant, suggesting that persistent agentic environments may shift the economic unit from cost per token to cost per completed artifact. Future evaluations should use artifact-level denominators, reproducible parsing rules, correction taxonomies, and independent coding of governance events.

2605.26856 2026-05-27 q-bio.NC cs.AI cs.RO

The Sensation Modulating Network:Haltability as the architectural ground for object-directed phenomenology

感觉调节网络:可停性作为对象导向现象学的架构基础

G. Nagarjuna, Durgaprasad Karnam

AI总结 本文提出感觉调节网络(SMN)作为具身认知的架构,通过对手动力学和可停性机制,将对象导向现象学(胡塞尔意义)的意向性建立在身体组织的结构特征上,从而调和认知主义与4E认知的争论。

Comments 64 pages, main body 38 pages + References 6, Appendices 20 pages, Tables 3, and Figures 21

详情
AI中文摘要

认知科学仍然分裂为认知主义——它解释了递归和语言,但无法将形式符号扎根于意义——和4E方法——它将认知扎根于身体,但很少详细说明身体的架构以支持生成性。我们认为这一僵局源于对具身代理架构的不完整描述,并提出一个架构:感觉调节网络(SMN),即认知代理被构想为整个身体,在每个解剖尺度上由对手动力学组织,由感觉调节器构建,这些调节器通过一个基底感知和行动,配对成协调动作区,由全身广播网络路由。三个承诺赋予了SMN其效力。可停性——将对抗性可供性招募到共激活平衡中——提供了对象导向现象学(在胡塞尔意义上)所需的架构位置:对手性使得共激活成为可能,共激活使得停止成为可能,停止使得注意成为可能,注意使得意向指向成为可能,而无需在顶层添加任何模块。可自我调节动作模式(SMAP)的双信号特性使得自我/世界区分成为布线的结构特征,而非代理应用的范畴。四级动作模式层级——基础、可停、可协商、交易——提供了从自主规律性到公共惯例化的单一轨迹,将基于语法的生成性条件定位为架构转变。SMN调和了认知主义与4E的争论:递归存在于可协商动作模式的可修改动力学中,具身性存在于支持它们的对手基底中。附录中给出了一个初步的形式化方法和八个预测寄存器(七个可测试,一个假设性),以及参考模拟。

英文摘要

Cognitive science remains split between cognitivism - which accounts for recursion and language but cannot ground formal symbols in meaning - and 4E approaches - which ground cognition in the body but rarely specify the body's architecture in enough detail to support generativity. We argue the impasse stems from an incomplete account of the embodied agent's architecture, and propose one: the Sensation Modulating Network (SMN), the cognitive agent conceived as the whole body, organized at every anatomical scale by opponent dynamics, built from Sensation Modulators that sense and act through one substrate, paired into Coordinated Action Zones routed by a body-wide broadcast network. Three commitments give the SMN its purchase. Haltability - the recruitment of antagonistic affordance into co-activated equilibrium - provides the architectural locus that object-directed phenomenology, in Husserl's sense, requires: opponency enables co-activation, co-activation enables halt, halt enables attention, attention enables intentional directedness, with no module added on top. The dual-signal property of self-modulatable action patterns (SMAPs) makes the self/world distinction a structural feature of the wiring rather than a category the agent applies. And a four-level action-pattern hierarchy - Basal, Haltable, Negotiable, Transactional - gives a single trajectory from autonomic regularity to public conventionalization, locating the conditions for grammar-grounded generativity as architectural transitions. The SMN reconciles the cognitivism-4E debate: recursion lives in the modifiable dynamics of Negotiable Action Patterns, embodiment in the opponent substrate that supports them. A tentative formalism and eight predicted registers (seven testable, one hypothetical), with reference simulations, are given in an appendix.

2605.26821 2026-05-27 hep-ph cs.LG hep-ex

Particle-Lund Multimodality in Jet Taggers

喷注标记器中的粒子-拉普兰多模态

Loukas Gouskos, Benedikt Maier

AI总结 提出PLuM多模态架构,联合处理粒子成分与拉普兰平面分裂,通过交叉注意力机制研究显式QCD层次结构是否补充原始粒子表示,发现对顶夸克和H→bb标记有系统性提升,在HH(4b)分析中背景抑制提高25%。

详情
AI中文摘要

拉普兰平面提供了喷注内QCD辐射的物理动机层次表示,而基于变换器的标记器通过直接从原始粒子成分及其成对关系中学习达到了最先进的性能。我们研究变换器是否从成分级输入隐式捕获层次QCD结构,或者显式物理表示是否仍然具有互补性。为了测试这一点,我们引入了PLuM,一种多模态架构,将粒子成分和拉普兰平面分裂投影到共享潜在空间,并用统一变换器联合处理两者。交叉注意力允许模型探测结构化QCD信息是否提供了超出粒子单独编码的区分能力。我们观察到顶夸克和H→bb标记的系统性增益,而在H→cc或H→4q拓扑中没有发现可比改进。这种选择性增强表明,即使在高度表达性的架构中,关于b喷注形成的显式层次信息仍然与原始粒子表示互补,而其他拓扑已经在成分级被很好地捕获。对于高影响LHC分析,如洛伦兹增强的双希格斯玻色子搜索中的四b夸克末态(HH(4b)),增益显著:在25%的双希格斯效率工作点,PLuM的背景抑制比基线高25%。我们的结果表明,在变换器时代,QCD辐射的物理结构化表示仍然保留区分价值,激励进一步研究深度学习算法如何编码喷注动力学的不同方面。

英文摘要

The Lund plane offers a physics-motivated, hierarchical representation of QCD radiation within jets, while transformer-based taggers have reached state-of-the-art performance by learning directly from raw particle constituents and their pairwise relations. We investigate whether transformers implicitly capture hierarchical QCD structure from constituent-level inputs, or whether explicit physics representations remain complementary. To test this, we introduce PLuM, a multimodal architecture that projects particle constituents and Lund plane splittings into a shared latent space, processing both jointly with a unified transformer. Cross-attention allows the model to probe whether structured QCD information provides discriminating power beyond what particles alone encode. We observe systematic gains for top-quark and $\mathrm{H}\to\mathrm{b}\bar{\mathrm{b}}$ tagging, while finding no comparable improvement for $\mathrm{H}\to\mathrm{c}\bar{\mathrm{c}}$ or $\mathrm{H}\to 4\mathrm{q}$ topologies. This selective enhancement suggests that explicit hierarchical information about b-jet formation remains complementary to raw particle representations even in highly expressive architectures, while other topologies are already well-captured at constituent level. For high-impact LHC analyses such as Lorentz-boosted di-Higgs searches in the four $\mathrm{b}$ quark final state ($\mathrm{H}\mathrm{H}(4\mathrm{b})$), the gains are substantial: at a $25\%$ di-Higgs efficiency working point, PLuM achieves $25\%$ higher background rejection than the baseline. Our results indicate that physically structured representations of QCD radiation retain discriminating value in the transformer era, motivating further study into how different aspects of jet dynamics are encoded by deep learning algorithms.

2605.26819 2026-05-27 cs.IR cs.AI

RAGEAR: Retrieval-Augmented Graph-Enhanced Academic Recommender

RAGEAR: 检索增强的图增强学术推荐器

Francesco Granata, Lorenzo Lamazzi, Misael Mongiovì, Francesco Poggi, Valeria Secchini

AI总结 提出RAGEAR,一种神经符号推荐系统,结合密集检索和知识图谱,通过图感知聚合函数将片段级证据传播到课程级推荐,在学术课程推荐中优于元数据基线。

详情
AI中文摘要

我们提出了RAGEAR(检索增强的图增强学术推荐器),一种用于学术课程推荐的神经符号推荐系统。RAGEAR将完整讲座转录本的密集检索与符号知识图谱相结合,该图谱建模课程、课程、转录本片段、学分、学习计划和课程信息。知识图谱支持基于结构化约束(如学分、学科、学习计划和先修课程)的符号过滤和情境化。与基于元数据的方法不同,它通过检索与学生查询语义对齐的转录本片段来利用细粒度的教学内容。主要贡献是一种图感知聚合函数,它将片段级证据传播到课程级推荐。得分结合了三个因素:与课程相关的检索相似性份额、其相关片段的基于排名的强度以及证据在课程间的分布。我们通过人工评估样本和大规模基于LLM的相关性评估,在152个学生类查询上评估了RAGEAR。结果表明,讲座转录本优于仅元数据检索,并且RAGEAR进一步提高了基于转录本的归一化SumP基线的排名质量,尤其是在排名靠前的推荐中。

英文摘要

We present RAGEAR (Retrieval-Augmented Graph-Enhanced Academic Recommender), a neurosymbolic recommender system for academic course recommendation. RAGEAR combines dense retrieval over full lecture transcripts with a symbolic Knowledge Graph modelling courses, lessons, transcript chunks, credits, study plans, and curricular information. The Knowledge Graph supports symbolic filtering and contextualisation based on structured constraints, such as credits, academic disciplines, study plans, and prerequisites. Unlike metadata-based approaches, it exploits fine-grained instructional content by retrieving transcript chunks semantically aligned with a student's query. The main contribution is a graph-aware aggregation function that propagates chunk-level evidence to course-level recommendations. The score combines three factors: the share of retrieved similarity associated with a course, the rank-based strength of its relevant chunks, and the distribution of evidence across lessons. We evaluate RAGEAR on 152 student-like queries through a human evaluation sample and a large-scale LLM-based relevance assessment. Results show that lecture transcripts improve over metadata-only retrieval, and that RAGEAR further improves ranking quality over a transcript-based normalized SumP baseline, especially for top-ranked recommendations.

2605.26807 2026-05-27 cs.SE cs.AI

HTMLCure: Turning Browser Experience into State Guided Repair for Interactive HTML

HTMLCure:将浏览器体验转化为面向交互式HTML的状态引导修复

Jiajun Wu, Jian Yang, Tuney Zheng, Wei Zhang, Haowen Wang, Yihang Lou, Xianglong Liu

AI总结 提出HTMLCure框架,通过浏览器交互执行、状态感知诊断和闭环修复引擎,从大规模HTML页面中筛选并修复可修复页面,显著提升SFT数据质量和模型性能。

Comments 27 pages, 11 figures. Code: https://github.com/wuyuVerse/HTMLCure

详情
AI中文摘要

LLM现在可以生成完整的HTML页面,但其中许多页面仅在表面上正确:它们渲染一次,然后在滚动、悬停、点击、调整大小或游戏过程中失败。基于截图的评估可能遗漏这些失败,而过滤会丢弃许多仍然可修复的页面。我们引入了HTMLCure,一个浏览器体验框架,在系统与页面交互后评估HTML。评估器跨视口和交互状态执行页面,记录确定性的浏览器证据,并向VLM提供来自执行轨迹的精选关键帧,而非孤立截图。相同的状态信号驱动闭环修复引擎:HTMLCure诊断当前页面,选择特定状态的修复家族,再次运行每个候选页面,并导出质量清理后的页面用于SFT。在97K提示语料库上,这将直接可用的种子扩展为63703个质量清理页面的候选池,从中我们构建了最终的40K页面精炼SFT集。在相同骨干和训练方案下,HTMLCure-27B-Refined在HTMLBench-400上达到50.6分,确定性测试用例通过率为45.2%,与Kimi-K2.6和GPT-5.4等强参考行处于相同性能区间。在发布的MiniAppBench验证集上,它达到81.2的平均分,比原始27B SFT提高15.3分,接近强参考系统的水平。

英文摘要

LLMs can now produce full HTML pages, but many of those pages are only superficially correct: they render once, then fail under scroll, hover, click, resize, or gameplay. Evaluation from screenshots can miss these failures, and filtering discards many pages that are still repairable. We introduce HTMLCure, a browser experience framework that evaluates HTML after the system has interacted with it. The evaluator executes the page across viewports and interaction states, records deterministic browser evidence, and gives the VLM curated keyframes from the executed trajectory rather than isolated screenshots. The same state signal drives a closed loop repair engine: HTMLCure diagnoses the current page, chooses a state specific repair family, runs each candidate again, and exports quality cleared pages for SFT. On a 97K prompt corpus, this expands the directly usable seed into a candidate pool of 63703 quality cleared pages, from which we construct the final refined SFT set of 40K pages. Under the same backbone and training recipe, HTMLCure-27B-Refined reaches 50.6 on HTMLBench-400 with 45.2% deterministic test case pass, placing it in the same performance band as strong reference rows such as Kimi-K2.6 and GPT-5.4. On the released MiniAppBench validation split, it reaches 81.2 average, improving raw 27B SFT by 15.3 points and approaching the level of strong reference systems.

2605.26786 2026-05-27 cs.CY cs.AI cs.LG

Implementation of Big Data Analytics for Diabetes Management: Needs Assessment in the Rwanda Healthcare System

大数据分析在糖尿病管理中的应用:卢旺达医疗系统需求评估

Silas Majyambere, Tony Lindgren, Workneh Y. Ayele, Celestin Twizere

AI总结 本研究通过利益相关者研讨会评估卢旺达医疗系统采用大数据分析管理糖尿病的准备情况,并提出了一个基于可解释机器学习模型的实用框架。

详情
AI中文摘要

糖尿病是一种慢性代谢疾病,如果不及早诊断和管理,可能导致严重的健康问题。大数据分析和机器学习为分析大型健康数据集、支持早期发现和更好的治疗决策提供了实用工具。然而,它们在常规临床实践中的使用仍然有限。本研究考察了卢旺达医疗系统采用大数据分析管理糖尿病的准备情况。随着该国不断扩大电子病历和健康信息系统的使用,改善预测、监测和临床决策的新机遇随之出现。我们举办了一个为期五天的研讨会,涉及25名关键利益相关者,包括临床医生、数据管理员、政策制定者、医学研究人员、营养学家和技术提供商,以评估准备情况并识别现有差距。研究结果突出了大数据分析实施的潜力和主要挑战。基于这些结果,本文提出了一个实用的大数据分析框架,利用可解释的机器学习模型支持糖尿病管理策略。

英文摘要

Diabetes is a chronic metabolic disease that can lead to serious health problems if not diagnosed and managed early. Big Data Analytics (BDA) and machine learning offer practical tools for analyzing large health datasets and supporting early detection and better treatment decisions. However, their use in routine clinical practice is still limited. This study examines the readiness of Rwanda's healthcare system to adopt big data analytics for diabetes management. As the country continues to expand its use of electronic medical records and health information systems, new opportunities arise for improving prediction, monitoring, and clinical decision-making. A five-day workshop involving 25 key stakeholders, including clinicians, data managers, policymakers, medical researchers, nutritionists, and technology providers, was conducted to assess preparedness and identify existing gaps. The findings highlight both the potential and the main challenges of BDA implementation. Based on these results, the paper proposes a practical BDA framework to support diabetes management strategies using explainable machine learning models.

2605.26769 2026-05-27 cs.CY cs.AI

Generative artificial intelligence and the marginalization of minoritized knowledges in higher education: the case of disability

生成式人工智能与高等教育中少数群体知识的边缘化:以残疾为例

Fatiha Tali-Otmani

AI总结 研究通过教育科学、批判技术研究和残疾研究,揭示生成式人工智能如何通过以英语和西方为中心的训练数据集强化认知殖民性,导致残疾人群体的双重边缘化,并探讨研究者与机器混合以维护认知多样性的可能性及其结构性限制。

详情
AI中文摘要

生成式人工智能通过重构科学知识的生产和验证过程,重新定义了高等教育。这些系统并非中立;它们积极促进了非霸权认识论的边缘化。本研究借鉴教育科学、批判技术研究和残疾研究,证明训练数据集(主要来自英语和西方中心)强化了认知殖民性。残疾人的情况特别清晰地说明了这一现象。技术架构常常将这些个体限制在刻板的刻板印象中,或将他们排除在设计过程之外,导致双重边缘化。本文探讨了研究者与机器之间的混合是否可能维护认知多样性,同时承认当算法校正作为纯粹姑息策略时固有的结构性限制。

英文摘要

Generative artificial intelligence redefines higher education by restructuring the processes through which scientific knowledge is produced and validated. These systems are not neutral; they actively contribute to the marginalization of non-hegemonic epistemologies. This research draws upon educational sciences, critical technology studies, and disability studies to demonstrate that training datasets, which remain predominantly Anglophone and Western-centric, reinforce epistemic coloniality. The situation of persons with disabilities provides a particularly clear illustration of this phenomenon. Technological architectures frequently confine these individuals to reductive stereotypes or exclude them from the design process, leading to a double marginalization. This article examines whether a hybridization between the researcher and the machine might preserve epistemic plurality, while acknowledging the structural limitations inherent in algorithmic correction when used as a purely palliative strategy.

2605.26754 2026-05-27 cs.CR cs.AI

Cordon-MAS: Defending RAG against Knowledge Poisoning via Information-Flow Control

Cordon-MAS:通过信息流控制防御 RAG 的知识投毒

Zhe Yu, Wenpeng Xing, Gaolei Li, Shuguang Xiong, Hongzhi Wang, Xuyang Teng, Meng Han

AI总结 针对检索增强生成(RAG)中的 Confundo 式投毒攻击,提出 Cordon-MAS 框架,通过分离证据提取、跨源审计和答案合成到具有非对称内存权限的智能体中,将攻击成功率相对降低 92.4%,将投毒问题从检测重新定义为信息流控制。

详情
AI中文摘要

检索增强生成(RAG)日益支撑着高风险应用,但仍易受到 Confundo 式投毒攻击,其中对抗性优化的文档操纵生成的输出。现有防御假设检测到中毒证据即可防止危害。我们证明这一假设不正确:模型存在监控-控制差距——它们可以检测到检索证据中的矛盾,但仍会依据中毒声明行动。我们引入 Cordon 原则——任何能够进行最终合成的智能体都不得访问不可信的自然语言证据——并通过 CORDON-MAS 实现该原则,这是一个隔离框架,通过将证据提取、跨源审计和答案合成分离到具有非对称内存权限的智能体中,在架构上强制执行该原则。在五个 BEIR 数据集上,CORDON-MAS 相对于未防御的 RAG 将攻击成功率降低了 92.4%。这将 RAG 投毒问题从检测问题重新定义为信息流控制问题。

英文摘要

Retrieval-augmented generation (RAG) increasingly underpins high-stakes applications, yet remains vulnerable to Confundo-style poisoning where adversarially optimized documents manipulate generated outputs. Existing defenses assume that detecting poisoned evidence prevents harm. We show this assumption is incorrect: models exhibit a monitoring-control gap -- they can detect contradictions in retrieved evidence yet still act on poisoned claims. We introduce the Cordon Principle -- no agent capable of final synthesis may access untrusted natural-language evidence -- and realize it through CORDON-MAS, a compartmentalized framework that enforces this principle architecturally by separating evidence extraction, cross-source audit, and answer synthesis into agents with asymmetric memory privileges. Across five BEIR datasets, CORDON-MAS reduces attack success rate by 92.4\% relative to undefended RAG. This reframes RAG poisoning from a detection problem to an information-flow control problem.