arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.02643 2026-06-03 cs.CR cs.AI cs.DB

Inference Cost Attacks for Retrieval-Augmented Large Language Models

检索增强型大语言模型的推理成本攻击

Chengliang Liu, Liangbo Ning, Yujuan Ding, Wenqi Fan

发表机构 * The Hong Kong Polytechnic University（香港理工大学）

AI总结提出RA-ICA攻击范式，通过向外部知识库注入恶意文档，利用CREEP框架和MA-GRPO算法，使RAG增强的LLM系统推理时token消耗增加高达13.12倍且成功率超过90%。

Comments Accepted at The ACM Web Conference 2026 (WWW '26)

详情

DOI: 10.1145/3774904.3792683
Journal ref: Proceedings of the ACM Web Conference 2026 (WWW '26), April 13-17, 2026, Dubai, United Arab Emirates

AI中文摘要

检索增强生成（RAG）增强的LLM系统虽然强大，但由于包含额外的多阶段流水线（动态检索和综合外部知识源的信息），引入了大量的推理成本。这种高运营成本暴露了一个关键漏洞，即推理成本攻击（ICA）。然而，现有的ICA通常依赖于直接提示操纵的不切实际的假设。我们认为，对RAG增强的LLM系统更可行且更强大的威胁来自污染外部知识库（例如，来自互联网的网络知识）。在这项工作中，我们引入了检索增强推理成本攻击（RA-ICA），这是一种新颖的攻击范式，通过向外部知识语料库注入恶意文档来针对RAG增强的LLM系统的计算成本。为了实现这种攻击，我们提出了通过外部投毒耗尽计算资源（CREEP），这是一种新颖的框架，利用LLM代理自动制作恶意文档，这些文档在语义上相关以便检索，并且能够有效诱导推理阶段token消耗的异常增加。为了提高攻击的有效性，我们引入了记忆增强组相对策略优化（MA-GRPO），这是一种新颖的强化学习算法，通过从历史最佳对抗文档的动态记忆中学习来微调代理。在三个真实世界数据集上的大量实验表明，RA-ICA在不降低生成答案完整性的情况下，将token消耗增加了高达13.12倍，成功率超过90%。

英文摘要

Retrieval-Augmented Generation (RAG)-enhanced LLM systems, while powerful, introduce substantial inference costs due to the inclusion of an extra multi-stage pipeline that dynamically retrieves and synthesizes information from external knowledge sources. This high operational cost exposes a critical vulnerability to Inference Cost Attacks (ICAs). However, existing ICAs often rely on the impractical assumption of direct prompt manipulation. We argue that a more feasible and potent threat to RAG-enhanced LLM systems arises from poisoning external knowledge bases (e.g., web knowledge from the Internet). In this work, we introduce the Retrieval-Augmented Inference Cost Attack (RA-ICA), a novel attacking paradigm that targets the computational cost of RAG-enhanced LLM systems by injecting malicious documents into external knowledge corpus. To operationalize this attack, we propose Computational Resource Exhaustion via External Poisoning (CREEP), a novel framework that leverages LLM agents to automatically craft malicious documents that are both semantically relevant for retrieval and potent for inducing an abnormal increase in token consumption during the inference phase. To enhance the attack's effectiveness, we introduce Memory-Augmented Group Relative Policy Optimization (MA-GRPO), a novel reinforcement learning algorithm that fine-tunes the agents by learning from a dynamic memory of historical best adversarial documents. Extensive experiments across three real-world datasets demonstrate that RA-ICA increases token consumption by up to 13.12 times with an over 90% success rate, without degrading the integrity of the generated answer.

URL PDF HTML ☆

赞 0 踩 0

2606.02630 2026-06-03 cs.CR cs.AI

MultiTurnPSB: Evaluating Multi-Turn Jailbreak Attacks an dClassifier-Based Defenses for Medical AI Safety

MultiTurnPSB：评估多轮越狱攻击与基于分类器的防御在医疗AI安全中的应用

Anushka Sheoran, Yiduo Hao

发表机构 * University of Pennsylvania（宾夕法尼亚大学）

AI总结提出多轮对抗基准MultiTurnPSB，通过四轮对话评估医疗聊天机器人的安全漏洞，发现多轮攻击下不安全响应率从35%升至近80%，并验证了轻量级输入分类器可降低52个百分点的不安全响应但存在高误报率。

详情

AI中文摘要

面向患者的医疗聊天机器人通常在单轮提示上进行评估，但真实用户在被拒绝后会继续追问、增加紧迫感并援引权威。我们引入了MultiTurnPSB，这是PatientSafetyBench的一个四轮对抗扩展，并在固定模板、模板自适应和实时对抗攻击下评估了GPT-4.1-mini。在实时攻击下，不安全响应从第1轮的35%上升到第4轮的近80%。在相同的攻击者下，GPT-4.1-mini和Claude Sonnet 4.5在基线时统计上无差异，但到第4轮时差距扩大到19倍，这种差异在单轮评估中不可见。我们描述了四种退化轨迹特征，并识别出一个导致大多数灾难性失败的双元素攻击公式。一个轻量级的输入侧分类器将第4轮不安全响应降低了52个百分点，尽管准确性严重下降，但对良性查询的45%误报率是主要的部署限制。还出现了一个方法论发现：Claude Sonnet在超过一半的后期对话中拒绝生成对抗性消息，尽管有明确的红队框架，这表明安全训练可能泛化到攻击者角色。

英文摘要

Patient-facing medical chatbots are commonly evaluated on single-turn prompts, yet real users push back after refusals, add urgency, and invoke authority. We introduce MultiTurnPSB, a four-turn adversarial extension of PatientSafetyBench, and evaluate GPT-4.1-mini under fixed template, template-adaptive, and live adversarial attacks. Unsafe responses rise from 35% to nearly 80% by Turn 4 under live attack. Under the same adversary, GPT-4.1-mini and Claude Sonnet 4.5 are statistically indistinguishable at baseline but diverge to a 19x gap by Turn 4, a difference invisible to single-turn evaluation. We characterize four degradation trajectory signatures and identify a two-element attack formula responsible for most catastrophic failures. A lightweight input-side classifier reduces Turn 4 unsafe responses by 52 percentage points despite severe accuracy degradation, but the 45% false alarm rate on benign queries is the primary deployment constraint. A methodological finding also emerges: Claude Sonnet refused to generate adversarial messages in over half of late-turn conversations despite explicit red team framing, suggesting safety training may generalize to the attacker role.

URL PDF HTML ☆

赞 0 踩 0

2606.02623 2026-06-03 cs.NE cs.AI cs.LG

Oscillatory State-Space Models as Inductive Biases for Physics-Informed Neural PDE Solvers

振荡状态空间模型作为物理信息神经PDE求解器的归纳偏置

Abhishek Chandra, Taniya Kapoor

发表机构 * KTH Royal Institute of Technology（皇家理工学院）； Wageningen University & Research（瓦赫宁根大学与研究中心）

AI总结提出一种结合振荡状态空间动力学和PDE感知空间谱的PINN方法，以改进时变PDE求解的精度和内存效率。

详情

AI中文摘要

求解时变偏微分方程（PDE）是计算科学与工程中的一个重要问题。物理信息神经网络（PINN）从控制方程中学习PDE解。然而，准确捕捉时间演化仍然具有挑战性。最近的基于序列模型的方法使用通用序列模型参数化时间演化，这些模型捕捉时间依赖性，但没有显式编码PDE解的结构化动力学。此外，它们的内存需求可能随序列长度和分辨率而不利地扩展，限制了在大规模或高维设置中的适用性。本文介绍了一种PINN方法，该方法结合了振荡状态空间动力学来表示PDE解的模态结构。所提出的方法利用基于线性振荡器的时间演化，以及空间上的PDE感知谱基。这种设计实现了闭式空间微分和边界条件的一致强制执行。该方法在前向、逆和高维PDE问题上进行了评估，包括高达100个空间维度的情况。结果表明，与最近基于序列模型的PINN方法相比，该方法提高了精度并减少了内存使用。总体而言，本文强调了将结构化动力学先验纳入神经PDE求解器的时间演化中的好处，并建议设计更符合物理和计算高效的PINN架构。

英文摘要

Solving time-dependent partial differential equations (PDEs) is an important problem in computational science and engineering. Physics-informed neural networks (PINNs) learn PDE solutions from governing equations. However, accurately capturing temporal evolution remains challenging. Recent sequence-model-based approaches parameterize time evolution using general-purpose sequence models, which capture temporal dependencies but do not explicitly encode the structured dynamics of PDE solutions. In addition, their memory requirements can scale unfavorably with sequence length and resolution, limiting applicability in large-scale or high-dimensional settings. This work introduces a PINN approach that incorporates oscillatory state-space dynamics to represent the modal structure of PDE solutions. The proposed method leverages a linear-oscillator-based temporal evolution, together with a PDE-aware spectral basis in space. This design enables closed-form spatial differentiation and consistent enforcement of boundary conditions. The method is evaluated on forward, inverse, and high-dimensional PDE problems, including cases up to 100 spatial dimensions. The results show improved accuracy and reduced memory usage compared to recent sequence-model-based PINN approaches. Overall, this work highlights the benefits of incorporating structured dynamical priors into the temporal evolution of neural PDE solvers and suggests designing more physics-aligned and computationally efficient PINN architectures.

URL PDF HTML ☆

赞 0 踩 0

2606.02618 2026-06-03 cs.CE cs.AI cs.MA physics.chem-ph

Closed-Loop Molecular Design with Calibrated Deference

闭环分子设计中的校准式退让

Newman Cheng, Gordon Broadbent, Jason Dong, Syed Mohammed Ali Hussaini, Farman Ullah, Morris Sharp, Gabrielle Barnes, Nanlin Guo, Deyu Zou, Karin Strauss, William Chappell, David G. Kwabi, Bichlien H. Nguyen, Jake A. Smith

发表机构 * Microsoft Discovery & Quantum（微软发现与量子）； Microsoft Research（微软研究院）； Department of Chemical and Environmental Engineering, Yale University（耶鲁大学化学与环境工程系）； CanAm Bioresearch Inc.（CanAm 生物研究公司）

AI总结提出CLIO智能体，通过持续更新的信念状态图和递归计划-行动循环实现校准式退让，在闭环人机协作中成功设计出性能优于文献基准的AORFB负极电解液。

详情

AI中文摘要

我们提出了通过原位优化实现认知循环（CLIO），这是一种将持续更新的信念状态图与递归计划-行动循环相结合的智能体。结果产生了一个推理智能体，能够贡献某种定性的不同之处，我们称之为“校准式退让”：即识别自身工具或假设何时失败、相应调整策略、并生成指导实验修订的机制性假设的能力。我们在一个闭环人机协作活动中测试了CLIO，以设计一种水性有机氧化还原液流电池（AORFB）负极电解液，CLIO在与合成、表征并参与设计选择的化学家密切合作中主导了提议和解释。在三轮共17个候选分子中，CLIO收敛于一个最佳的膦酸酯候选物；表征证实其氧化还原电位比文献基准提高了130 mV。随后表征揭示了出乎意料的差电化学可逆性——这是所有性质预测器都未能标记的回归。CLIO生成了相互竞争的机制性假设，优先安排了诊断性实验，将失败归因于膦酸酯-钾离子配对，并建议用磺酸酯替代。所得化合物显示出显著改善的电化学可逆性，并保持了90 mV的氧化还原电位提升，从而闭环了设计-制造-测试-再设计循环。

英文摘要

We present Cognitive Loop via In-Situ Optimization (CLIO), an agent that couples a continuously-updated belief-state graph with a recursive plan-then-act loop. The result is a reasoning agent that can contribute something qualitatively different, which we term \emph{calibrated deference}: the capacity to recognize when its own tools or assumptions are failing, to adapt its strategy in response, and to generate mechanistic hypotheses that guide experimental revision. We tested CLIO in a closed-loop human-AI campaign to design an aqueous organic redox flow battery (AORFB) negolyte, with CLIO leading proposal and interpretation in close partnership with chemists who synthesized, characterized, and weighed in on design choices. Across 17 candidates over three rounds, CLIO converged on a top phosphonate candidate; characterization confirmed a 130~mV improvement in redox potential over the literature baseline. Characterization then revealed unexpectedly poor electrochemical reversibility -- a regression no property predictor had flagged. CLIO generated competing mechanistic hypotheses, prioritized discriminating diagnostics, traced the failure to phosphonate-potassium ion pairing, and prescribed a sulfonate replacement. The resulting compound showed substantially improved electrochemical reversibility and maintained a 90~mV improvement in redox potential, closing the design-make-test-redesign loop.

URL PDF HTML ☆

赞 0 踩 0

2606.02614 2026-06-03 cs.CE cs.AI

Margin Play: A Multi-Agent System For Public Policy Analysis In The Brazilian Equatorial Margin

边际博弈：巴西赤道边缘地区公共政策分析的多智能体系统

Antonio de Sousa Leitão Filho, Fabrício Saul Lima, Selby Mykael Lima dos Santos, Rejani Bandeira Vieira Sousa, Luís Jorge Mesquita de Jesus, Dennys Correia da Silva, Allan Kardec Duailibe Barros Filho

发表机构 * Aia Context ； Universidade Federal do Maranhão — UFMA（佛罗里达州立大学马纳汉分校）； Universidade Estadual de Campinas — UNICAMP（坎皮纳斯州立大学）

AI总结针对巴西赤道边缘地区石油勘探对马拉尼昂州福利影响的问题，提出基于多智能体强化学习（MARL）的仿真系统Margin Play，通过CTDE范式和BRO-MARL训练六个智能体，发现福利增益取决于制度安排，MA-Prospero配置可显著提升福利并降低环境负债。

详情

AI中文摘要

巴西赤道边缘（BEM）是巴西下一个海上石油前沿，预计于2026年在亚马逊福斯盆地开始运营。其资产在财政和领土上主要与马拉尼昂州相关联——该州在联邦中人类发展指数最低（0.676，IBGE 2022）。这引出了核心政策问题：在什么条件下，BEM的勘探能为马拉尼昂州产生净正外部性？问题本质上是多智能体的：联邦政府寻求收入和能源安全；州政府在宪法规定的特许权使用费专用下寻求区域福利；运营商在风险下最大化利润；ANP和IBAMA持有冲突的职责；亚马逊社区优先考虑领土和环境因素而非货币收入。我们提出Margin Play，一个多智能体强化学习（MARL）系统，在巴西经验校准和经典经济学文献下模拟这些张力。它实现了CTDE范式下的六个智能体，使用BRO-MARL进行训练。来自六个场景中60,000个回合的结果表明，答案取决于制度安排：在参考基线之下，福利增益微乎其微（Waval约1.68），而MA-Prospero配置产生Delta W = +17.5%和Delta Rcom = +21.3%，同时环境负债较低（Eamb = 0.048 vs. 0.076）。根本问题并非生产与福利之间的权衡，而是与勘探相关的公共政策制度的选择。

英文摘要

The Brazilian Equatorial Margin (BEM) is Brazil's next offshore oil frontier, with operations expected to begin in 2026 in the Foz do Amazonas basin. Its assets are fiscally and territorially linked primarily to Maranhao -- the state with the lowest HDI in the Federation (0.676, IBGE 2022). This raises the central policy question: under what conditions does BEM exploration generate net positive externalities for Maranhao? The problem is intrinsically multi-agent: the Federal Government seeks revenue and energy security; the state seeks regional welfare under constitutional royalty earmarking; the operator maximizes profit under risk; ANP and IBAMA hold conflicting mandates; and Amazonian communities prioritize territorial and environmental vectors over monetary income. We present Margin Play, a Multi-Agent Reinforcement Learning (MARL) system simulating these tensions under Brazilian empirical calibration and classical economic literature. It implements six agents under the CTDE paradigm, trained with BRO-MARL. Results from 60,000 episodes across six scenarios indicate the answer is conditional on the institutional regime: under the reference baseline, the welfare gain is marginal (Waval approx. 1.68), whereas the MA-Prospero configuration yields Delta W = +17.5% and Delta Rcom = +21.3%, with a lower environmental liability (Eamb = 0.048 vs. 0.076). The fundamental problem is not a trade-off between production and welfare, but the choice of public policy regime linked to exploration.

URL PDF HTML ☆

赞 0 踩 0

2606.02610 2026-06-03 cs.CE cs.AI cs.LG physics.ao-ph

Samudra 2: Scaling Ocean Emulators across Resolutions

Samudra 2: 跨分辨率扩展海洋仿真器

Yuan Yuan, Jesse Rusak, Alexander Merose, Adam Subel, Pavel Perezhogin, Alistair Adcroft, Carlos Fernandez-Granda, Laure Zanna

发表机构 * Courant Institute School of Mathematics, Computing, and Data Science, New York University（Courant学院数学、计算与数据科学系，纽约大学）； Open Athena AI Foundation, Inc.（开放Athena人工智能基金会）； Program in Atmospheric and Oceanic Sciences, Princeton University（大气与海洋科学项目，普林斯顿大学）

AI总结针对现有海洋神经仿真器在长期自回归滚动中出现的方差崩溃和印记伪影问题，提出Samudra 2，通过改进U-Net骨干网络和动态损失函数，在1°分辨率下将上层海洋全球平均温度R²从0.56提升至0.87，并将深层海洋温度误差降低约七倍，且可扩展至1/2°和1/4°分辨率。

详情

AI中文摘要

海洋环流模式（OGCM）对气候科学至关重要，但计算成本高，限制了集合规模和强迫情景。神经仿真器有望实现数量级的加速，然而现有的海洋仿真器未能将精细空间分辨率与多年自回归滚动相结合。Samudra是第一个产生多十年全球滚动的自回归神经海洋仿真器，但仅限于$1^\\\circ$分辨率，并表现出两种长期故障模式：\\emph{方差崩溃}，即时间变异性的丧失，以及\\emph{印记伪影}，即速度模式泄漏到深海场中。我们提出Samudra 2，它引入了更宽的U-Net骨干网络，采用修改后的ConvNeXt风格块和减小的块内扩展因子，以及一个动态损失函数，根据预测误差重新加权输出通道，从而增强缓慢演变的深海场的梯度。在$1^\\\circ$分辨率下，Samudra 2将上层海洋全球平均温度$R^2$从0.56提高到0.87，并将深海温度误差降低约七倍。相同的架构可扩展到$1/2^\\\circ$和$1/4^\\\circ$分辨率，在大约8年的自回归滚动中恢复中尺度涡旋和尖锐的西边界流。在单个GPU上运行，Samudra 2能够为海平面预测、海洋热吸收和气候变率研究提供更大的集合。我们在此https URL提供代码、文档和基准资源。

英文摘要

Ocean general circulation models (OGCMs) are essential to climate science but computationally expensive, limiting ensemble size and forcing scenarios. Neural emulators promise orders-of-magnitude speedups, yet existing ocean emulators have not combined fine spatial resolution with multi-year autoregressive rollouts. Samudra, the first autoregressive neural ocean emulator to produce multi-decade global rollouts, is limited to $1^\circ$ resolution and exhibits two long-horizon failure modes: \emph{variance collapse}, the loss of temporal variability, and \emph{imprinting artifacts}, in which velocity patterns leak into deep-ocean fields. We present Samudra 2, which introduces a wider U-Net backbone with modified ConvNeXt-style blocks and a reduced block-internal expansion factor, together with a dynamic loss that reweights output channels according to their prediction errors, strengthening gradients for slow-evolving deep-ocean fields. At $1^\circ$, Samudra 2 increases upper-ocean global-mean temperature $R^2$ from 0.56 to 0.87 and reduces deep-ocean temperature error by roughly sevenfold. The same architecture scales to $1/2^\circ$ and $1/4^\circ$ over approximately 8-year autoregressive rollouts, recovering mesoscale eddies and sharp western boundary currents. Running on a single GPU, Samudra 2 enables larger ensembles for sea-level projections, ocean heat uptake, and climate variability studies. We provide code, documentation, and benchmark resources at https://openathena.ai/Ocean_Emulator/.

URL PDF HTML ☆

赞 0 踩 0

2606.02588 2026-06-03 cs.LO cs.AI cs.PL

Lean-GAP: A Dataset of Formalized Graduate Algebra Problems

Lean-GAP：形式化研究生代数问题数据集

Seewoo Lee, Byung-Hak Hwang, Hyojae Lim, Jihoon Hyun, Ilkyoo Choi, Yeachan Park, Jineon Baek, Hyukpyo Hong, Keewoo Lee, Jaeseong Heo, Hyungryul Baik, Chul-hee Lee, Kyu-Hwan Lee

发表机构 * University of California, Berkeley（加州大学伯克利分校）； Korea Advanced Institute of Science and Technology（韩国科学技术院）； Hanyang University（翰阳大学）； Hufs University（Hufs大学）； Sungkyunkwan University（成均馆大学）； University of Wisconsin - Madison（威斯康星大学麦迪逊分校）； Sejong University（世宗大学）； University of Connecticut（康涅狄格大学）

AI总结本文提出Lean-GAP数据集，包含430个来自Dummit和Foote《抽象代数》的形式化研究生代数问题，并开发了从PDF预处理到自动形式化再到验证的可扩展流水线。

2606.02582 2026-06-03 cs.CE cs.LG cs.NA math.NA

Applying Two-Grid Preconditioner for Subsurface Flow Simulation using Attention-enhanced Hybrid Network to Accelerate Multiscale Discretization in High-contrast Media

应用注意力增强混合网络的两网格预条件子进行高对比度介质中地下流动模拟以加速多尺度离散化

Peiqi Li, Jie Chen, Shubin Fu

发表机构 * xjtlu.edu.cn（XTL大学）

AI总结提出一种结合学习与多尺度数值方法的混合框架，利用注意力增强混合网络预测多尺度基函数，并通过两网格预条件求解器加速高对比度介质中达西方程的数值求解。

详情

AI中文摘要

本文研究了强非均质、高对比度渗透率介质中达西方程的高效数值求解，提出了一种结合学习与多尺度数值方法的混合框架。学习组件用于预测混合广义多尺度有限元方法（混合GMsFEM）中的多尺度基函数，旨在减少离线阶段所需的重复局部计算。一旦预测出这些基函数，全局系统被组装，并通过两网格预条件求解器计算压力场。所提方法加速了昂贵的局部基函数构建阶段，同时保留了底层求解器的多尺度离散化和预条件迭代结构。在二维非均质达西问题上的数值实验表明，与几种代表性基于学习的方法相比，所提框架能获得更准确的最终压力重构，并在强非均质和高对比度系数下保持稳定。与传统混合GMsFEM相比，其主要优势在于基函数生成阶段的效率，而全局求解的质量仍由两网格预条件子保证。这些结果表明，通过学习加速多尺度基函数构建，同时保留成熟的全局问题数值求解器，为高分辨率达西型模拟提供了一种可行方法。

英文摘要

In this paper, we study the efficient numerical solution of Darcy equations in strongly heterogeneous media with high-contrast permeability and propose a hybrid framework that combines learning with multiscale numerical methods. The learning component is used for the prediction of multiscale basis functions in the mixed generalized multiscale finite element method (mixed GMsFEM), with the goal of reducing the repeated local computations required in the offline stage. Once these basis functions are predicted, the global system is assembled and the pressure field is computed by a two-grid preconditioned solver. The resulting method accelerates the costly local basis-construction stage while retaining the multiscale discretization and preconditioned iterative structure of the underlying solver. Numerical experiments on two-dimensional heterogeneous Darcy problems show that the proposed framework yields more accurate final pressure reconstruction than several representative learning-based methods and remains stable under strong heterogeneity and high-contrast coefficients. In comparison with the traditional mixed GMsFEM, its main advantage lies in the efficiency of the basis-generation stage, while the quality of the global solve is still ensured by the two-grid preconditioner. These results indicate that accelerating multiscale basis construction through learning, while preserving a mature numerical solver for the global problem, provides a viable approach for high-resolution Darcy-type simulations.

URL PDF HTML ☆

赞 0 踩 0

2606.02758 2026-06-03 math.DG cs.LG math.CT

Theoretical Aspects of Lie Groupoid and Lie Algebroid Equivariant Convolutional Neural Networks

李群胚与李代数胚等变卷积神经网络的理论方面

Michael Astwood

发表机构 * Department of Mathematics, University of Manitoba（曼尼托巴大学数学系）

AI总结本文引入李群胚等变神经网络作为拓扑范畴等变神经网络在可微情形的特化，证明其与李代数胚等变神经网络的等价性，并推广了群不变全局池化。

Comments 28 pages, 2 figures. Preliminary version. Comments and criticism welcome!

2606.03517 2026-06-03 quant-ph cs.AI cs.LG

Scalable On-Hardware Training of Quantum Neural Networks and Application to Clinical Data Imputation

可扩展的量子神经网络片上训练及其在临床数据填补中的应用

Natansh Mathur, Panagiotis Kl. Barkoutsos, Masako Yamada, Martin Roetteler, Iordanis Kerenidis

发表机构 * IRIF, CNRS and Université Paris Cité（巴黎-萨克雷大学 IRIF 实验室、法国国家科学研究中心和巴黎-萨克雷大学）； QC Ware, France（法国 QC Ware 公司）； IonQ（IonQ 公司）； Quantum Signals（量子信号）

AI总结提出一种结合蝴蝶电路架构、逐层训练策略和并行化参数位移规则的训练框架，将梯度估计成本从O(n^2)降至O(log n)，并在MIMIC-III数据集上验证了其可扩展性和性能。

Comments 13 pages, 9 figures

详情

AI中文摘要

在量子硬件上训练量子神经网络（QNN）目前受限于梯度估计的成本：标准参数位移方法所需的电路评估次数随可训练参数数量二次增长，使得在小型系统之外难以进行基于硬件的优化。在这项工作中，我们引入了一个训练框架，将该成本降低到量子比特数的对数级别，使得在近期硬件上以更大规模进行基于梯度的QNN优化成为可能。我们的框架结合了三个协同设计的要素：（i）一种结构化的、保持子空间的蝴蝶电路架构，具有$O(n \log n)$个参数和对数深度；（ii）一种逐层训练策略，将片上优化限制在每次一个小型、结构良好的层上；（iii）一种并行化的参数位移规则，利用每个蝴蝶层内的交换结构，在恒定数量的电路执行中提取所有梯度。这些共同将每个优化步骤所需的独立电路评估次数从$O(n^2)$减少到$O(\log n)$。我们使用MIMIC-III电子健康记录数据集在临床数据填补上验证了该框架，这是一个对优化不稳定性和模型方差敏感的高要求基准。混合经典-量子模型直接在IonQ Forte Enterprise离子阱硬件上以16量子比特进行训练，性能相对于理想或噪声模拟没有下降，并通过张量网络模拟以32量子比特进行训练，32量子比特推理在硬件上执行。得到的模型在下游患者生存预测中匹配或超过强经典神经网络基线，同时表现出跨运行的低方差，证明了所提出的框架在现实硬件约束下实现了实用、可扩展的QNN训练。

英文摘要

Training quantum neural networks (QNNs) on quantum hardware is currently bottlenecked by the cost of gradient estimation: standard parameter-shift methods require a number of circuit evaluations that grows quadratically with the number of trainable parameters, making hardware-based optimisation impractical beyond small system sizes. In this work, we introduce a training framework that reduces this cost to logarithmic in the number of qubits, making gradient-based QNN optimisation feasible on near-term hardware at increasing scales. Our framework combines three co-designed ingredients: (i) a structured, subspace-preserving Butterfly circuit architecture with $O(n \log n)$ parameters and logarithmic depth; (ii) a layer-wise training strategy that confines on-hardware optimisation to one small, well-structured layer at a time; and (iii) a parallelised parameter-shift rule that exploits the commuting structure within each Butterfly layer to extract all gradients in a constant number of circuit executions. Together these reduce the number of distinct circuit evaluations per optimisation step from $O(n^2)$ to $O(\log n)$. We validate the framework on clinical data imputation using the MIMIC-III electronic health record dataset, a demanding benchmark sensitive to optimisation instability and model variance. Hybrid classical-quantum models are trained directly on IonQ Forte Enterprise trapped-ion hardware at 16 qubits without performance degradation relative to ideal or noisy simulation and via tensor-network simulation at 32 qubits, with 32-qubit inference executed on hardware. The resulting models match or exceed strong classical neural baselines in downstream patient survival prediction while exhibiting reduced variance across runs, demonstrating that the proposed framework enables practical, scalable QNN training under realistic hardware constraints.

URL PDF HTML ☆

赞 0 踩 0

2606.02655 2026-06-03 quant-ph cs.GT cs.LG math.OC

Coherent Swap Regret and Channel-Proof Learning

相干交换遗憾与信道证明学习

Sohail Sarkar

发表机构 * Sohail (Neel) Sarkar

AI总结针对量子博弈中局部CPTP偏差，提出相干交换遗憾作为基准，并通过熵镜像上升算法实现O(√(dT log d))的遗憾界，揭示了非幺正使用推荐寄存器是困难根源，并应用于有限量子博弈达到ε-近似可分量子相关均衡。

Comments 23 pages

详情

AI中文摘要

外部遗憾仅保证相对于固定替代行为的稳定性。在量子博弈中，这遗漏了一个自然的物理操作：玩家可以对其实际接收或制备的状态应用局部完全正迹保持（CPTP）映射。我们引入相干交换遗憾作为针对所有此类局部CPTP偏差的遗憾基准，并给出一种算法，通过熵镜像上升在CPTP Choi切片上结合不动点博弈规则，实现O(√(dT log d))的相干交换遗憾。主要结果是一个三级偏差类景观。替换通道以Θ(√(T log d))的速率恢复普通外部遗憾。幺正通道（包括幺正偏差和幺正混合）具有零极小极大遗憾。确定性测量-制备通道在中等时间范围内已迫使Ω(√(dT log d))的遗憾，且该速率对所有CPTP偏差也是充分的。因此，困难源于对推荐寄存器的非幺正使用，而非仅量子相干性。作为应用，有限量子博弈中的去中心化完全信息学习在T=O(max_i d_i log d_i/ε^2)轮后达到ε-近似可分量子相关均衡。我们将这些均衡与中介量子推荐协议的信道证明性等同，给出适用于任意有限维状态的局部CPTP可剥削性的SDP审计，并包含一个在Haar随机纯态探测下具有伪遗憾O(d^{4/3}T^{2/3}(log d)^{1/3})的探测-赌博机扩展。

英文摘要

External regret certifies stability only against replacing one's behavior by a fixed alternative. In a quantum game, this misses a natural physical move: a player can apply a local completely positive trace-preserving (CPTP) map to the state it actually received or prepared. We introduce coherent swap regret as the regret benchmark against all such local CPTP deviations, and give an algorithm achieving $O(\sqrt{dT\log d})$ coherent swap regret via entropic mirror ascent on the CPTP Choi slice with a fixed-point play rule. The main result is a three-level deviation-class landscape. Replacement channels recover ordinary external regret at rate $Θ(\sqrt{T\log d})$. Unital channels, including unitary deviations and mixtures of unitaries, have zero minimax regret. Deterministic measurement-and-preparation channels already force $Ω(\sqrt{dT\log d})$ regret in the moderate-horizon regime, and this rate is also sufficient for all CPTP deviations. Thus the hardness comes from non-unital use of the recommendation register, not from quantum coherence alone. As an application, decentralized full-information learning in finite quantum games reaches an $\varepsilon$-approximate separable quantum correlated equilibrium after $T=O(\max_i d_i\log d_i/\varepsilon^2)$ rounds. We identify these equilibria with channel-proofness of mediated quantum recommendation protocols, give an SDP audit for local CPTP exploitability applicable to arbitrary finite-dimensional states, and include a probing-bandit extension with pseudo-regret $O(d^{4/3}T^{2/3}(\log d)^{1/3})$ under Haar-random pure-state probes.

URL PDF HTML ☆

赞 0 踩 0

2606.03917 2026-06-03 physics.app-ph cs.LG

Beyond Gradient Descent: Adam for Analog Ising Machines

超越梯度下降：用于模拟伊辛机的Adam优化器

Stijn Van Vooren, Guy Van der Sande, Guy Verschaffelt

发表机构 * Applied Physics research group, Vrije Universiteit Brussel（应用物理研究组，布鲁塞尔自由大学）

AI总结研究将动量法和Adam优化器应用于模拟连续时间伊辛机，通过推导连续时间版本，在Max-Cut基准测试中显著缩短求解时间并提高解质量，并引入一阶连续时间近似作为物理实现的简化起点。

Comments submitted to Physical Review E

详情

AI中文摘要

随着摩尔定律达到极限，伊辛机为难优化问题提供了一种有前景的替代计算方法。然而，许多模拟、时间连续的伊辛机依赖类似梯度下降的动力学来寻找解，这可能限制速度和鲁棒性。我们研究了动量法和Adam优化是否能改进这些系统。由于这些优化器传统上以离散时间形式表述，我们推导了适用于模拟、时间连续伊辛机动力的连续时间版本。在Max-Cut基准测试中，我们发现基于Adam的动力学相比基于梯度下降和动量的动力学，显著减少了达到目标的时间并提高了解质量。我们进一步引入了Adam的一阶连续时间近似，旨在作为未来物理实现的更简单起点，并且在连续时间设置中表现优于完整的Adam公式。我们还研究了纯算法离散时间设置，其中在较容易的问题实例上性能差距缩小，而在较难的加权问题实例上基于Adam的更新规则表现最佳。这些结果将连续时间Adam动力学确定为模拟伊辛机的一个强大设计原则。

英文摘要

As Moore's law reaches its limits, Ising machines offer a promising alternative computing approach for difficult optimization problems. However, many analog, time-continuous Ising machines rely on gradient-descent-like dynamics to find solutions, which can limit speed and robustness. We investigate whether momentum and Adam optimization can improve these systems. Since these optimizers are traditionally formulated in discrete time, we derive continuous-time versions suitable for analog, time-continuous Ising-machine dynamics. On Max-Cut benchmarks, we find that Adam-based dynamics substantially reduce time-to-target and improve solution quality compared with gradient-descent- and momentum-based dynamics. We further introduce a first-order continuous-time approximation of Adam that is intended as a simpler starting point for future physical implementations and while performing better than the full Adam formulation in a continuous-time setting. We also study a purely algorithmic discrete-time setting, where the performance gap is reduced on easier problem instances, while the Adam-based update rule performs best on harder weighted problem instances. These results identify continuous-time Adam dynamics as a powerful design principle for analog Ising machines.

URL PDF HTML ☆

赞 0 踩 0

2606.02646 2026-06-03 physics.soc-ph cs.AI cs.MA

The Ringelmann Effect in Multi-Agent LLM Systems: A Scaling Law for Effective Team Size

多智能体大语言模型系统中的林格曼效应：有效团队规模的缩放定律

Blaž Bertalanič, Carolina Fortuna

发表机构 * Jozef Stefan Institute（乔泽夫·斯蒂芬研究所）

AI总结本文推导出两参数缩放定律 $R(N) = N_\text{eff}/N = 1/(1+c(N-1)N^{-\beta})$，将多智能体LLM系统分为三种渐近状态，并通过44个实验单元验证了该定律，发现密集辩论无法增加答案多样性，噪声安慰剂可模拟自我修正效果，且仅异构团队能突破硬上限。

Comments 41 pages, 9 figures, 20 tables

详情

AI中文摘要

推理时多智能体大语言模型缩放缺乏共享单位：计数名义智能体混淆了成本与独立证据。我们推导出一个两参数缩放定律 $R(N) = N_\text{eff}/N = 1/(1+c(N-1)N^{-\beta})$，其中状态指数 $\beta$ 将任何配置分类为三种渐近状态之一——硬上限为 $1/c$（$\beta = 0$）、亚线性为 $N^\beta/c$（$0 < \beta < 1$）或线性（$\beta \ge 1$），并且平均场定理预测智能体辩论中的同伴数量 $k$ 和轮次 $\tau$ 仅通过其乘积 $k\tau$ 进入动力学。该定律适用于两个层面：答案多样性和正确性冗余。在44个（模型 $\times$ 任务 $\times$ 条件）单元中，涵盖同伴辩论、自我修正、随机噪声安慰剂、自一致性、三个开放权重系列（Qwen、Llama、Ministral）从7B到32B规模，并辅以前沿API检查（Gemini）、思维模型、异构团队和稀疏通信，函数形式在每个条件下拟合 $R^2 > 0.99$；仅 $(c, \beta)$ 发生偏移。在自由形式数学问题上，密集同伴影响将答案层面状态从亚线性坍缩为硬上限；正确性层面拟合始终保持硬上限。三个发现具有实际意义。 (i) 三十个密集辩论智能体在MMLU-Hard上产生的答案多样性不超过一个智能体。 (ii) 噪声安慰剂在自由形式数学问题及4倍规模下追踪自我修正，因此在同质团队中，通常归因于“辩论”的收益来自重新评估，而非同伴内容。 (iii) 单个 $N \le 5$ 的试点预测了 $N=30$ 的结构上限，并且在测试的配置中，只有架构多样性（异构团队）降低了 $c$ 并逃离了硬上限状态，通信模式干预则不能。

英文摘要

Inference-time multi-agent LLM scaling lacks a shared unit: counting nominal agents conflates cost with independent evidence. We derive a two-parameter scaling law $R(N) = N_\text{eff}/N = 1/(1+c(N-1)N^{-β})$ where the regime exponent $β$ classifies any configuration into one of three asymptotic regimes -- hard-ceiling at $1/c$ ($β= 0$), sublinear at $N^β/c$ ($0 < β< 1$), or linear ($β\ge 1$), and a mean-field theorem predicts that peer count $k$ and rounds $τ$ during agent debate enter the dynamics only through their product $kτ$. The law applies at two levels: answer diversity and correctness redundancy. Across 44 (model $\times$ task $\times$ condition) cells spanning peer debate, self-correction, random-noise placebo, self-consistency, three open-weight families (Qwen, Llama, Ministral) at scales from 7B to 32B with a frontier API check (Gemini), thinking models, heterogeneous teams, and sparse communication, the functional form fits every condition at $R^2 > 0.99$; only $(c, β)$ shifts. On free-form math, dense peer influence collapses the answer-level regime from sublinear into hard-ceiling; correctness-level fits remain hard-ceiling throughout. Three findings have practical implications. \emph{(i)}~Thirty dense debating agents produce no more answer diversity than one on MMLU-Hard. \emph{(ii)}~A noise placebo tracks self-correction on free-form math and at $4\times$ scale, so within homogeneous teams the gain commonly attributed to ``debate'' comes from re-evaluation, not peer content. \emph{(iii)}~A single $N \le 5$ pilot predicts the $N=30$ structural ceiling, and within the configurations tested only architectural diversity (heterogeneous teams) lowers $c$ and escapes the hard-ceiling regime, communication-mode interventions do not.

URL PDF HTML ☆

赞 0 踩 0

2606.03735 2026-06-03 nlin.CD cs.MA cs.RO

On dynamic multi-agent pathfinding methods: review, simulations and modifications

动态多智能体路径规划方法：综述、仿真与改进

Gabriel Fejziaj, Salama Hassona, Wieslaw Marszalek

发表机构 * Department of Computer Science, Opole University of Technology（计算机科学系，奥波尔技术大学）

AI总结本文系统研究动态多智能体路径规划（D-MAPF）中的六种代表性算法，并提出一种基于模板的A**算法，通过离线几何路径生成与在线时间适应解耦，在频繁变化和有限感知环境中提高解质量。

2606.02600 2026-06-03 cond-mat.dis-nn cs.LG

High-Dimensional Latents Should Be Diagnosed Through Phase Structure

高维潜在变量应通过相结构进行诊断

Alejandro Ascarate, Leo Lebrat, Rodrigo Santa Cruz, Clinton Fookes, Olivier Salvado

发表机构 * Queensland University of Technology（昆士兰技术大学）

AI总结本文通过自旋玻璃理论分析自编码器和变分自编码器的潜在空间，提出基于相结构的诊断方法，并展示其在生成和异常检测任务中的实际效益。

Comments 9+22 pages, 4+6 figures, under review

详情

AI中文摘要

我们通过自旋玻璃理论的视角研究自编码器和变分自编码器的潜在空间。本文包含两个部分。首先，我们形式化了一个潜在空间自旋玻璃字典：对于固定的解码器，重建项与超球坐标先验共同在潜在球面上诱导出一个哈密顿量，其中潜在坐标扮演连续自旋的角色，先验则充当外部磁场。这使我们能够引入可操作的自旋玻璃诊断——重叠分布、磁化率和块自旋粗粒化——来检测训练后潜在表示中的有序、无序和边缘稳定相。其次，我们表明，有意将潜在系统推向拓扑平凡化区域的边缘稳定状态会带来具体的下游后果。在生成方面，超球压缩改善了CIFAR-10和CelebA64上的重建-生成权衡，在保持或改善重建的同时降低了自FID。在异常检测方面，相同的半有序潜在几何提高了完全无监督和条件性OOD检测的性能，包括真实世界的火星车和Galaxy Zoo数据集，以及基于CIFAR-10/100和Imagenette的OOD基准。因此，我们倡导对AE/VAE采用相感知的评估范式，其中自旋玻璃可观测量补充标准机器学习指标，并揭示在许多情况下决定下游成功或失败的潜在区域。

英文摘要

We study autoencoder and variational-autoencoder latent spaces through the lens of spin-glass theory. The paper has two components. First, we formalize a latent-space spin-glass dictionary: for a fixed decoder, the reconstruction term together with a hyperspherical coordinates prior induces a Hamiltonian on the latent sphere, where latent coordinates play the role of continuous spins and the prior acts as an external magnetic field. This allows us to import operational spin-glass diagnostics -- overlap distributions, susceptibility, and block-spin coarse-graining -- to detect ordered, disordered, and edge-of-stability phases in trained latent representations. Second, we show that deliberately driving the latent system toward the edge-of-stability of the topological trivialization regime has concrete downstream consequences. In generation, hyperspherical compression improves the reconstruction-generation trade-off on CIFAR-10 and CelebA64, yielding lower self-FID while preserving or improving reconstruction. In anomaly detection, the same semi-ordered latent geometry improves both fully unsupervised and conditional OOD detection, including real-world Mars Rover and Galaxy Zoo datasets, as well as CIFAR-10/100 and Imagenette-based OOD benchmarks. We therefore advocate a phase-aware evaluation paradigm for AEs/VAEs, in which spin-glass observables complement standard ML metrics and expose the latent regimes that underlie downstream success or failure in many cases.

URL PDF HTML ☆

赞 0 踩 0

2606.02912 2026-06-03 astro-ph.IM cs.LG gr-qc physics.geo-ph

Data-Driven Forecasting of three-Component Seismograms Using Transformer Architectures

基于Transformer架构的三分量地震图数据驱动预测

Waleed Esmail, Stuart Russell, Jana Klinge, Alexander Kappes, Christine Thomas

发表机构 * Institut für Kernphysik, Universität Münster（穆斯特大学核物理研究所）； Institut für Geophysik, Universität Münster（穆斯特大学地质物理研究所）； James Cook University（詹姆斯·库克大学）； Geological Survey of Denmark and Greenland（丹麦和格陵兰地质调查局）

AI总结提出基于Transformer的自回归模型SeismoGPT，通过物理约束的延续问题框架直接预测三分量地震波形，在合成数据上实现中位数归一化互相关>0.93，证明了Transformer序列模型可学习地震波场的稳定动力学延续。

Comments 35 pages, 13 figures and 4 tables

详情

AI中文摘要

由于地震波传播的非线性、色散和多尺度特性，预测超出观测数据的地震波形仍然具有挑战性。在这项工作中，我们引入了 extsc{SeismoGPT}，一种基于Transformer的自回归模型，旨在直接在时域中预测三分量地震波形。预测被表述为一个物理约束的延续问题，其中模型接收从P波到达开始并延伸至S波到达后定义时间的波形上下文，之后在没有真实样本的情况下递归生成未来运动。在合成地震图上进行评估，这些地震图覆盖了5--100 km的震源深度、10--90$^\circ$的震中距离以及$3 \leq M_w \leq 7$的震级。为了区分上下文长度和预测范围的影响，我们使用距离归一化上下文比率和固定的120秒及240秒预测范围定义了三种评估配置。在所有配置中，模型的中位数归一化互相关均高于0.93。对代表性预测的分析表明，成功的预测保留了相位一致性和频谱能量分布。在出现失败案例时，主要原因是自回归展开过程中的逐渐相位漂移，而非非物理的信号生成。这些结果表明，基于Transformer的序列模型可以学习地震波场的稳定动力学延续，凸显了基础模型方法在物理驱动时间序列预测中的潜力。该方法在地震预警和减灾中具有潜在应用，特别是对于下一代引力波观测站，如爱因斯坦望远镜。

英文摘要

Forecasting seismic waveforms beyond observed data remains challenging due to the nonlinear, dispersive, and multi-scale nature of seismic wave propagation. In this work, we introduce \textsc{SeismoGPT}, a transformer-based autoregressive model designed to forecast three-component seismic waveforms directly in the time domain. Forecasting is formulated as a physically constrained continuation problem in which the model receives waveform context beginning at the P-wave arrival and extending a defined time beyond the S-wave arrival, after which future motion is generated recursively without access to ground-truth samples. Evaluation is performed on synthetic seismograms spanning source depths of 5--100\,km, epicentral distances of 10--90$^\circ$, and magnitudes $3 \leq M_w \leq 7$. To disentangle the effects of context length and prediction horizon, we define three evaluation configurations using a distance-normalized context ratio and fixed prediction horizons of 120 and 240\,s. Across all configurations, the model achieves median normalized cross correlation above 0.93. Analysis of representative forecasts shows that successful predictions preserve both phase coherence and spectral energy distribution. Where failure cases arise, this is primarily due to gradual phase drift during autoregressive rollout rather than unphysical signal generation. These results demonstrate that transformer-based sequence models can learn stable dynamical continuation of seismic wavefields, highlighting the potential of foundation-model approaches for physics-driven time-series forecasting. There are potential applications of this methodology in seismic warning and hazard mitigation, particularly for next-generation gravitational-wave observatories, such as the Einstein Telescope.

URL PDF HTML ☆

赞 0 踩 0

2606.02788 2026-06-03 astro-ph.IM cs.LG

Neutrino Fingerprints: Image-Based Encodings of IceCube Events for CNN Direction Reconstruction

中微子指纹：基于图像的 IceCube 事件编码用于 CNN 方向重建

Floriano Tori, Brecht Verbeken, Vincent Ginis

发表机构 * Data Analytics Lab, Vrije Universiteit Brussel（自由大学布鲁塞尔数据分析实验室）； imec-SMIT, Vrije Universiteit Brussel（imec-SMIT，自由大学布鲁塞尔）； School of Engineering and Applied Sciences, Harvard University（哈佛大学工程与应用科学学院）

AI总结提出将 IceCube 中微子事件编码为紧凑的 72×72×3 图像（中微子指纹），利用 ResNet18 卷积网络实现方向重建，平均角误差为 1.10 rad，性能媲美更复杂架构。

Comments 6 pages, 1 figure

2606.02240 2026-06-03 cs.CR cs.AI cs.CL cs.ET

AgentRedBench: Dynamic Redteaming and Integration-Aware Defense for LLM Agents over SaaS Integrations

AgentRedBench: 针对SaaS集成的LLM代理的动态红队测试与集成感知防御

Hiskias Dingeto, William Leeney

发表机构 * StackOne Technologies（StackOne技术公司）

AI总结针对LLM代理在工具使用中面临的间接提示注入威胁，提出动态红队基准AGENTREDBENCH（覆盖24个企业集成、5种攻击类型）和基于集成多样语料训练的防御模型AGENTREDGUARD，将攻击成功率从69.9%降至2.4%，误报率仅0.37%。

详情

AI中文摘要

工具使用代理中的间接提示注入是一个具体的生产威胁：LLM代理读取来自集成（通过工具调用访问的第三方服务，如Gmail、Salesforce或Jira）的响应内容，用户既未编写也无法控制这些内容。现有基准低估了该威胁：大多数仅覆盖少量集成，且每次运行重复相同的攻击载荷，而开源防护模型是在聊天风格数据而非工具响应内容上训练的。我们引入了AGENTREDBENCH，这是一个动态的LLM驱动的红队测试基准，包含215个微妙的未明确授权场景（在用户请求授权边界上的攻击），涵盖9个功能家族、24个企业集成和5种攻击类型。在八模型面板（Anthropic、OpenAI、Google）上，无防护的攻击成功率（ASR）范围从32%（Claude Sonnet 4.6）到81%（Gemini 3 Flash）。为了保持场景集不在训练语料中，并随时间保持标题ASR的意义，我们开源了代码库、集成模式和AGENTREDGUARD模型；规范场景通过维护者中介渠道进行评估，具有不可变版本控制。我们随基准发布了AGENTREDGUARD：一个在集成多样化的对抗性工具响应内容语料上训练的防护模型。AGENTREDGUARD将面板ASR从69.9%降至2.4%，误报率为0.37%，在两个指标上均优于所有具有非平凡检测能力的开源基线（Llama Guard、PromptGuard 2、ProtectAI）。跨集成和跨攻击类型的保留测试均证实了增益在训练子集之外具有迁移性。

英文摘要

Indirect prompt injection in tool-use agents is a concrete production threat: LLM agents read from integrations (third-party services such as Gmail, Salesforce, or Jira accessed through tool calls) whose response content the user neither writes nor controls. Existing benchmarks under-measure the threat: most cover only a handful of integrations with the same attack payload replayed across runs, and open-source guards are trained on chat-style data rather than tool-response content. We introduce AGENTREDBENCH, a dynamic LLM-driven redteaming benchmark of 215 subtle underspecified authorization (attacks at the boundary of what the user's request authorises) scenarios across 24 enterprise integrations in nine functional families and five attack types. Across an eight-model panel (Anthropic, OpenAI, Google), no-guard ASR (attack success rate) ranges from 32% (Claude Sonnet 4.6) to 81% (Gemini 3 Flash). To keep the scenario set out of training corpora and preserve headline ASR meaning over time, we release the codebase, integration schemas, and AGENTREDGUARD model openly; the canonical scenarios are evaluated through a maintainer-mediated channel with immutable versioning. We release AGENTREDGUARD alongside the benchmark: a guard trained on an integration-diverse corpus of adversarial tool-response content. AGENTREDGUARD cuts panel ASR from 69.9% to 2.4% at 0.37% false-positive rate, outperforming every open-source baseline with non-trivial detection (Llama Guard, PromptGuard 2, ProtectAI) on both axes. Cross-integration and cross-attack type holdouts both confirm the gain transfers beyond the training subset.

URL PDF HTML ☆

赞 0 踩 0

2606.01472 2026-06-03 cs.DC cs.AI cs.LG

Hierarchical Online Prompt Mutation with Dual-Loop Feedback for Guardrailed Evidence Document Generation: A Production-Evaluation Case Study

分层在线提示变异与双环反馈用于有护栏的证据文档生成：生产评估案例研究

Nataraj Agaram Sundar, Tejas Morabia

发表机构 * eBay Inc.（eBay公司）

AI总结提出分层在线提示变异框架HOPM，通过双环反馈（人工审核与自动评判）优化提示策略，在真实市场纠纷证据生成中显著提升胜率和质量。

Comments 7 pages. Production-evaluation case study of guardrailed LLM evidence-document generation

详情

AI中文摘要

高风险生产文档生成系统要求语言模型具有适应性、基于证据且可审计。我们提出HOPM，一种分层在线提示变异框架，在真实市场纠纷证据工作流上评估。HOPM将提示视为在线策略：一个家族/版本路由器选择提示，确定性护栏将失败归因于可变的提示-令牌类别，来自人工审核和自动评判的双重反馈更新路由和变异优先级。主要证据是观察到的匹配生产评估消融：七个变体在相同的600个案例上评估，实现组件比较：静态提示、手动迭代、仅bandit路由、仅变异适应、仅人工反馈、仅自动评判反馈和全双环HOPM。全HOPM将计数胜率从34.7%提升至45.7%（+11.0个百分点；配对McNemar p=1.31e-11），金额加权胜率从22.3%提升至41.4%（+19.1个百分点；95%配对bootstrap CI [10.3, 28.9]个百分点）。它还将平均Likert质量从3.18提高到4.40，并将问题标记率从15.3%降低到5.2%。支持性审查工件涵盖770篇生成文本审查、318份标记审查员导出、一个10案例/61评分的校准切片和一个70案例/350评分的OCR基准；这些工件校准评分标准、护栏、标题风险和OCR风险解释，而非替代生产消融。论文包括控制设置、样本量、置信区间、配对检验、提示-令牌类别、伪代码、模式、评分标准、护栏分类法以及一个构造示例，以便在不暴露专有证据的情况下重现评估结构。

英文摘要

High-stakes production document-generation systems require language models to be adaptive, evidence-grounded, and auditable. We present HOPM, a hierarchical online prompt mutation framework evaluated on a real marketplace dispute-evidence workflow. HOPM treats prompts as online policies: a family/version router selects a prompt, deterministic guardrails attribute failures to mutable prompt-token categories, and dual feedback from human review and an automated judge updates both routing and mutation priorities. The primary evidence is an observed matched production-evaluation ablation: seven variants are evaluated on the same 600 cases each, enabling component comparisons against static prompting, manual iteration, bandit-only routing, mutation-only adaptation, human-only feedback, auto-judge-only feedback, and full dual-loop HOPM. Full HOPM improves count win rate over a static control from 34.7% to 45.7% (+11.0 pp; paired McNemar p = 1.31e-11) and amount-weighted win rate from 22.3% to 41.4% (+19.1 pp; 95% paired bootstrap CI [10.3, 28.9] pp). It also increases mean Likert quality from 3.18 to 4.40 and reduces issue-flag rate from 15.3% to 5.2%. Supporting review artifacts cover 770 generated-text reviews, 318 labeled reviewer exports, a 10-case/61-rating calibration slice, and a 70-case/350-rating OCR benchmark; these artifacts calibrate rubric, guardrail, title-risk, and OCR-risk interpretation rather than substituting for the production ablation. The paper includes control setup, sample sizes, confidence intervals, paired tests, prompt-token categories, pseudocode, schema, rubric, guardrail taxonomy, and a constructed example so the evaluation structure can be reproduced without exposing proprietary evidence.

URL PDF HTML ☆

赞 0 踩 0

2606.01166 2026-06-03 cs.CR cs.CL

BraveGuard: From Open-World Threats to Safer Computer-Use Agents

BraveGuard: 从开放世界威胁到更安全的计算机使用代理

Yunhao Feng, Xiaohu Du, Xinhao Deng, Yifan Ding, Ming Wen, Yixu Wang, Yuxiang Xie, Baihui Zheng, Yingshui Tan, Yige Li, Yutao Wu, Kerui Cao, Wenke Huang, Yanming Guo, Xingjun Ma, Yu-Gang Jiang

发表机构 * Fudan University（复旦大学）； Ant Group（蚂蚁集团）； Hunan Institute of Advanced Technology（湖南高级技术研究所）； Alibaba Group（阿里巴巴集团）； Singapore Management University（新加坡管理大学）； Deakin University（德肯大学）； Nanyang Technological University（南洋理工大学）； Shanghai Innovation Institute（上海创新研究院）

AI总结提出BraveGuard框架，通过从开放世界威胁信号和真实代理轨迹中训练防护模型，实现轨迹级别的安全检测，显著提升计算机使用代理的安全性。

详情

AI中文摘要

计算机使用代理将语言模型从文本生成扩展到与文件、终端、浏览器和外部工具的持续交互。这种转变带来了安全风险，这些风险难以从孤立的提示或最终响应中检测出来，因为危害通常只在多步执行轨迹中显现，而单个动作在局部看似无害。我们引入了BraveGuard，一个自我进化的防御框架，用于从开放世界威胁信号和真实代理轨迹中训练防护模型。BraveGuard挖掘近期研究来源以识别新兴风险和攻击模式，将其实例化为可执行的计算机使用任务，收集代理轨迹，并为防护模型训练提供轨迹级别的监督。随着新威胁和验证失败的出现，可以重复该流程，形成一个自适应防御循环，而不是静态的、基准驱动的训练过程。我们通过训练多个防护骨干模型（包括Qwen3-Guard和Llama-Guard变体）来实例化BraveGuard，并在轨迹级别的代理安全基准上评估生成的防护模型。BraveGuard在计算机使用轨迹上持续提高了安全检测能力。在AgentHazard上，与现成的防护模型相比，它显著提高了检测准确性，在平均防护模型设置下，准确率从38.79%提升到82.38%。这些结果表明，基于开放世界威胁发现和真实代理执行的防护监督可以超越固定分类法和合成提示级数据，改进安全监控。BraveGuard为面对不断变化的现实世界风险的计算机使用代理提供了一条可扩展的自适应防御路径。

英文摘要

Computer-use agents extend language models from text generation to sustained interaction with files, terminals, browsers, and external tools. This shift creates safety risks that are difficult to detect from isolated prompts or final responses, because harm often emerges only through multi-step execution traces whose individual actions appear locally benign. We introduce BraveGuard, a self-evolving defense framework for training guard models from open-world threat signals and realistic agent trajectories. BraveGuard mines recent research sources to identify emerging risks and attack patterns, instantiates them as executable computer-use tasks, collects agent rollouts, and derives trajectory-level supervision for guard model training. As new threats and validation failures appear, the pipeline can be repeated, yielding an adaptive defense loop rather than a static, benchmark-driven training process. We instantiate BraveGuard by training multiple guard backbones, including Qwen3-Guard and Llama-Guard variants, and evaluate the resulting guards on trajectory-level agent-safety benchmarks. BraveGuard consistently improves safety detection across computer-use trajectories. On AgentHazard, it substantially improves detection accuracy over off-the-shelf guard models, with accuracy increasing from 38.79% to 82.38% under the averaged guard-model setting. These results show that guard supervision grounded in open-world threat discovery and realistic agent execution can improve safety monitoring beyond fixed taxonomies and synthetic prompt-level data. BraveGuard offers a scalable path toward adaptive defenses for computer-use agents facing evolving real-world risks.

URL PDF HTML ☆

赞 0 踩 0

2606.00188 2026-06-03 cs.GR cs.CV cs.LG

PaintBench: Deterministic Evaluation of Precise Visual Editing

PaintBench: 精确视觉编辑的确定性评估

Kai Xu, Ellis Brown, Shrikar Madhu, Rob Fergus, He He, Saining Xie

发表机构 * New York University（纽约大学）

AI总结提出PaintBench基准，通过程序化生成20种基本视觉编辑操作，实现确定性像素级评估，发现当前模型性能低（最高mIoU 17.1%），并揭示任务分解和场景变化的影响。

Comments Project Page: https://paintbench.github.io/

详情

AI中文摘要

虽然当前的多模态模型在开放式视觉编辑方面表现熟练，但执行精确的单答案编辑仍然是一个重要障碍。为了探究这一挑战，我们引入了PaintBench，一个动态可扩展的基准测试，针对四个类别的20种基本精确视觉编辑操作：几何变换、结构操作、颜色变化和符号推理。具有可配置复杂性的程序化生成实现了有效无限、抗污染的评估套件，而确定性像素级评估消除了对易偏见的评判模型的依赖。在11个图像编辑模型中，我们发现整体性能较低，当前表现最佳的行业领先者仅得17.1%（mIoU）。任务分解揭示了特别具有挑战性的操作类型（几何变换、大多数结构操作、基于公式的颜色变化）和模型特定的专长。细粒度的基准诊断进一步显示了由对象数量、背景复杂性、配色方案和编辑区域大小等场景变化引起的性能下降。为了测试PaintBench分数对应用任务性能的泛化能力，我们创建了一个用于数据可视化编辑的程序化确定性评估（TinyGrafixBench），并发现其与PaintBench分数之间存在强线性相关性（$R^2 = 0.91$, $p < 0.001$）。总之，PaintBench为衡量和推动精确多模态视觉编辑的进展提供了严格的基础。

英文摘要

While current multimodal models are proficient at open-ended visual editing, executing precise single-answer edits remains an important obstacle. To probe this challenge, we introduce PaintBench, a dynamically scalable benchmark targeting 20 fundamental precise visual editing operations across four categories: geometric transformation, structural manipulation, color change, and symbolic reasoning. Procedural generation with configurable complexity enables an effectively infinite, contamination-resistant evaluation suite, and deterministic pixel-level evaluation eliminates reliance on bias-prone judge models. Across 11 image editing models, we find overall low performance, with the current highest-performing industry leader scoring only 17.1% (mIoU). Task decomposition reveals especially challenging operation types (geometric transformation, most structural manipulation, formula-based color change) and model-specific specializations. Fine-grained benchmark diagnostics further show performance degradations induced by scene variations in object count, background complexity, color scheme, and edit-region size. To test generalization of PaintBench scores to applied task performance, we create a procedural, deterministic evaluation for data visualization editing (TinyGrafixBench) and find strong linear correlation with PaintBench scores ($R^2 = 0.91$, $p < 0.001$). Altogether, PaintBench provides a rigorous foundation for measuring and driving progress in precise multimodal visual editing.

URL PDF HTML ☆

赞 0 踩 0

2605.31530 2026-06-03 eess.AS cs.SD

UNISON: A Unified Sound Generation and Editing Framework via Deep LLM Fusion

UNISON: 通过深度LLM融合的统一声音生成与编辑框架

Zhaoqing Li, Haoning Xu, Jingran Su, Yaofang Liu, Zhefan Rao, Huimeng Wang, Jiajun Deng, Tianzi Wang, Zengrui Jin, Rui Liu, Haoxuan Che, Xunying Liu

发表机构 * The Chinese University of Hong Kong（香港中文大学）； The Hong Kong Polytechnic University（香港理工大学）； City University of Hong Kong（香港城市大学）； The Hong Kong University of Science and Technology（香港科学与技术大学）； Tsinghua University（清华大学）； Huawei Research Hong Kong（华为香港研究）

AI总结提出UNISON，一个基于潜在扩散的统一框架，通过层间深度LLM融合和多任务架构，实现语音生成、声音生成和音频编辑，在多个任务上达到或超越专业模型性能，且参数量减少约4倍。

详情

AI中文摘要

我们提出UNISON，一个潜在扩散框架，将语音生成、声音生成和音频编辑统一在单个模型中。单个模型处理文本到音频、文本到语音、零样本说话人克隆、混合语音与声音生成、场景级音频编辑、场景中语音编辑以及定时时间组合，所有这些任务共享一组权重。我们的架构具有两个核心设计：(1) 层间深度LLM融合，通过学习的投影将来自冻结MLLM均匀采样层的隐藏状态注入对应的MM-DiT块，提供深度匹配的语义条件，改善指令遵循能力，优于单层基线；(2) 统一的多任务架构，其中任务身份仅由通道掩码编码，源音频通过VAE编码的通道拼接提供。训练通过在线GPU端多任务数据合成流水线（具有任务同质批处理和两阶段课程）稳定进行。拥有621M至732M可训练参数，UNISON在评估的各个领域取得了与任务专业模型竞争或超越的结果，同时比类似统一系统小约4倍。

英文摘要

We present UNISON, a latent diffusion framework that unifies speech generation, sound generation, and audio editing within a single model. A single model handles text-to-audio, text-to-speech, zero-shot speaker cloning, mixed speech-and-sound generation, scene-level audio editing, speech-in-scene editing, and timed temporal composition, all of which share a single set of weights. Our architecture features two core designs: (1) Layer-wise deep LLM fusion, which injects hidden states from uniformly sampled layers of a frozen MLLM into corresponding MM-DiT blocks via learned projections, providing depth-matched semantic conditioning that improves instruction following over single-layer baselines; and (2) a unified multi-task architecture where task identity is encoded solely by a channel-wise mask and source audio is provided through VAE-encoded channel concatenation. Training is stabilized by an online GPU-side multi-task data synthesis pipeline with task-homogeneous batching and a two-stage curriculum. With 621M--732M trainable parameters, UNISON achieves results competitive with or exceeding task-specialist models across evaluated domains, while being roughly $4\times$ smaller than comparable unified systems.

URL PDF HTML ☆

赞 0 踩 0

2605.27454 2026-06-03 eess.IV cs.CV

NL-MambaXCT: Self-Supervised Nested-Learning Mamba for Nomex Honeycomb X-ray CT Defect Classification

NL-MambaXCT：用于Nomex蜂窝X射线CT缺陷分类的自监督嵌套学习Mamba

Ghaleb Aldoboni, Lobna Nassar, Fakhri Karray, Reem Alshamsi

发表机构 * Aurak Academy of Arts and Sciences（阿劳克艺术与科学学院）； Machine Intelligence Institute（人工智能研究所）； University of Waterloo（滑铁卢大学）

AI总结提出NL-MambaXCT框架，结合自监督掩码图像建模和嵌套学习，实现Nomex蜂窝XCT缺陷的高效分类，在测试集上达到96.91%准确率。

详情

AI中文摘要

X射线计算机断层扫描（XCT）广泛应用于航空航天制造中Nomex蜂窝结构的无损检测，但工业检测仍严重依赖人工解读和基于有限标注数据训练的监督模型。本文提出NL-MambaXCT，一个基于Mamba的框架，结合自监督掩码图像建模和嵌套学习（NL）公式，用于从生产XCT切片中进行自动化、标签高效的缺陷分类。骨干网络是一个四阶段2D编码器，早期阶段使用RegNet卷积块，深层阶段使用基于Mamba的序列混合与注意力。该网络在19,961张未标注的工业XCT切片上通过掩码图像建模进行预训练，并在按生产顺序划分的2,000张重新标注的Nomex XCT切片上进行微调。NL通过双时间尺度参数动态实现：选定投影保持慢速指数移动平均轨迹与快速权重并行，而深度动量优化器引入额外的慢速参数更新轨迹。在保留测试集上，MIM预训练的NL-MambaXCT模型达到96.91%的准确率和96.8%的宏F1分数，在准确率上比CNN、注意力和单时间尺度Mamba基线高出3.11-10.31个百分点。结果表明，将掩码自监督与NL风格的快/慢学习动态相结合，是Nomex蜂窝XCT检测中鲁棒缺陷分类的一种有前景的策略。

英文摘要

X-ray computed tomography (XCT) is widely used for non-destructive testing of Nomex honeycomb structures in aerospace manufacturing, but industrial inspection still relies heavily on manual interpretation and supervised models trained on limited labeled data. This work introduces NL-MambaXCT, a Mamba-based framework that combines self-supervised masked image modelling with a Nested Learning (NL) formulation for automated, label-efficient defect classification from production XCT slices. The backbone is a four-stage 2D encoder with RegNet convolutional blocks in the early stages and Mamba-based sequence mixing with attention in the deeper stages. It is pretrained by masked image modelling on 19,961 unlabeled industrial XCT slices and fine-tuned on 2,000 relabeled Nomex XCT slices split by production order. NL is instantiated through two-timescale parameter dynamics: selected projections maintain slow exponential-moving-average traces alongside fast weights, while a deep-momentum optimizer introduces an additional slow parameter-update trajectory. On the held-out test set, the MIM-pretrained NL-MambaXCT model achieves 96.91% accuracy and 96.8% macro F1, outperforming CNN, attention, and single-timescale Mamba baselines by 3.11--10.31 percentage points in accuracy. The results suggest that combining masked self-supervision with NL-style fast/ slow learning dynamics is a promising strategy for robust defect classification in Nomex honeycomb XCT inspection.

URL PDF HTML ☆

赞 0 踩 0

2605.30253 2026-06-03 stat.ML cs.LG math.FA math.OC math.PR stat.CO

Wasserstein Contraction of Coordinate Ascent Variational Inference

坐标上升变分推断的Wasserstein收缩

Rocco Caprio, Adrien Corenflos, Sam Power

发表机构 * Department of Statistics, University of Warwick（沃里克大学统计系）； School of Mathematics, University of Bristol（布里斯托大学数学学院）

AI总结研究坐标上升变分推断算法在Wasserstein距离下的收缩性，通过不动点处的传输-信息不等式和函数光滑性条件给出局部收敛保证，并应用于贝叶斯高斯混合模型、高维贝叶斯Probit回归及Pólya-Gamma逻辑回归。

Comments 17 pages + 3 pages appendix, 3 figures. V2 fixes some citations not displaying properly in the appendix. No content change compared to prior version

2605.30166 2026-06-03 cs.SI cs.LG

SAHG: Sector-Anisotropic Hyperbolic Graph Model for Social Bot Detection

SAHG：用于社交机器人检测的扇区各向异性双曲图模型

Hanning Lu, Yingguang Yang, Jinwei Su, Yang Liu, Zhaoqian Yao, Yaoming Li, Taoran Liang, Ziyi Zhang, Ran Ran, Kefu Xu, Bin Chong

发表机构 * University of Leeds（利兹大学）； University of Science and Technology of China（中国科学技术大学）； South China Normal University（华南师范大学）； Tsinghua University（清华大学）； The Chinese University of Hong Kong（香港中文大学）； Harbin University of Commerce（哈尔滨商业大学）； Beijing University of Posts and Telecommunications（北京邮电大学）； Peking University（北京大学）； University of California, Berkeley（加州大学伯克利分校）

AI总结提出扇区各向异性双曲图模型SAHG，通过方向依赖曲率场和扇区原型解决欧几里得GNN在层次无标度社交图中的失真问题以及异质连接导致的信号污染问题，在三个基准上取得最佳性能。

详情

AI中文摘要

LLM驱动的社交机器人能生成流畅类人文本，降低了纯内容检测的判别优势。然而，协调活动仍留下关系模式——交互、行为相似性、共享邻居、社区位置和协调活动——图方法可利用这些模式。现有图检测器在利用此类证据时面临两个挑战。首先，欧几里得GNN扭曲了层次和无标度社交图；虽然双曲几何解决了这种体积增长不匹配，但固定曲率模型仍对不同密度和分离需求的结构方向分配均匀的几何分辨率。其次，关系证据并不总是可靠：复杂机器人与真实用户伪造异质连接，导致邻域聚合混合机器人和人类信号，稀释账户级证据。我们提出SAHG（扇区各向异性双曲图），解决这两个挑战。SAHG学习方向依赖的曲率场γ(u)，适应结构方向上的几何分辨率，并使用扇区原型将角度集中和对齐转换为分类器可读特征。为防止受污染的聚合淹没账户级证据，SAHG在两个独立的SAH通道中编码每个账户特征和图邻域表示，仅在分类器处融合。在Fox8-23、BotSim-24和MGTAB上的实验表明，SAHG在所有三个基准上实现了最高准确率和F1，优于基于特征、基于图、基于LLM和各向同性双曲基线。消融和几何分析证实了各向异性几何和双通道设计的有效性。

英文摘要

LLM-driven social bots can generate fluent, human-like text, reducing the discriminative advantage of content-based detection alone. However, coordinated campaigns still leave relational patterns -- interactions, behavioral similarity, shared neighborhoods, community positions, and coordinated activity -- that graph-based methods can exploit. Existing graph detectors face two challenges when exploiting such evidence. First, Euclidean GNNs distort hierarchical and scale-free social graphs; while hyperbolic geometry addresses this volume-growth mismatch, fixed-curvature models still assign uniform geometric resolution to structural directions with different densities and separation needs. Second, relational evidence is not always reliable: sophisticated bots forge heterophilic connections with genuine users, causing neighborhood aggregation to mix bot and human signals and dilute account-level evidence. We propose SAHG (Sector-Anisotropic Hyperbolic Graph), addressing both challenges. SAHG learns a direction-dependent curvature field $γ(u)$ that adapts geometric resolution across structural directions, and uses sector prototypes to convert angular concentration and alignment into classifier-readable features. To prevent contaminated aggregation from overwhelming account-level evidence, SAHG encodes per-account features and graph-neighborhood representations in two independent SAH channels, fusing them only at the classifier. Experiments on Fox8-23, BotSim-24, and MGTAB show that SAHG achieves the highest accuracy and F1 on all three benchmarks, outperforming feature-based, graph-based, LLM-based, and isotropic hyperbolic baselines. Ablation and geometric analyses confirm the effectiveness of the anisotropic geometry and dual-channel design.

URL PDF HTML ☆

赞 0 踩 0

2605.12925 2026-06-03 cs.SE cs.AI

AgentLens: Revealing The Lucky Pass Problem in SWE-Agent Evaluation

AgentLens: 揭示 SWE-Agent 评估中的幸运通过问题

Priyam Sahoo, Gaurav Mittal, Xiaomin Li, Shengjie Ma, Benjamin Steenhoek, Pingping Lin, Yu Hu

发表机构 * University of Illinois, Urbana-Champaign（伊利诺伊大学厄巴纳-香槟分校）； Microsoft（微软）

AI总结针对软件工程智能体评估中仅依赖最终补丁是否通过测试的二元信号问题，提出AgentLens框架进行过程级评估，通过构建前缀树接受器参考和上下文敏感意图标注器，识别出10.7%的通过轨迹存在“幸运通过”行为，并基于质量分数将轨迹分为幸运、扎实和理想三个等级。

详情

AI中文摘要

以下是更新后的摘要：软件工程（SWE）智能体的评估主要依赖一个二元信号：最终补丁是否通过测试。这种仅关注结果的观点将原则性解决方案与混乱的试错过程视为等价。我们证明这种等价性在经验上是错误的。我们在60个SWE-bench验证任务上评估了来自八个模型后端的2,614条OpenHands轨迹。其中，47个任务有足够多的通过轨迹来构建任务级过程参考，从而得到一个包含1,815条轨迹的评估子集。在该子集的通过轨迹中，10.7%表现出我们称之为“幸运通过”的行为：回归循环、盲目重试、缺少验证，或探索、实现和验证在时间上无序。我们引入AgentLens，一个用于SWE智能体轨迹过程级评估的框架，并定义AgentLens-Bench，一个包含1,815条轨迹的数据集，这些轨迹标注有质量分数、浪费信号、分歧点以及47个任务级前缀树接受器（PTA）参考。AgentLens通过合并同一任务的多个通过解决方案来构建PTA参考，并使用上下文敏感的意图标注器，基于轨迹历史而非仅工具身份将动作分配给探索、实现、验证或编排。在AgentLens-Bench上，质量分数将通过轨迹分为幸运、扎实和理想三个等级，并进一步将幸运通过分解为五种重复出现的机制。在八个模型后端中，幸运率从0.5%到23.2%不等，当按质量分数而非通过率排序时，一些模型的排名变动多达五位。我们计划很快发布项目仓库，包括AgentLens-Bench工件、AgentLens SDK和分析工具。

英文摘要

Evaluation of software engineering (SWE) agents is dominated by a binary signal: whether the final patch passes the tests. This outcome-only view treats a principled solution and a chaotic trial-and-error process as equivalent. We show that this equivalence is empirically false. We evaluate 2,614 OpenHands trajectories from eight model backends on 60 SWE-bench Verified tasks. Of these, 47 have enough passing trajectories to construct task-level process references, yielding a 1,815-trajectory evaluation subset. Among passing trajectories in this subset, 10.7% exhibit behavior we call a Lucky Pass: regression cycles, blind retries, missing verification, or temporally disordered exploration, implementation, and verification. We introduce AgentLens, a framework for process-level assessment of SWE-agent trajectories, and define AgentLens-Bench, a dataset of 1,815 trajectories annotated with quality scores, waste signals, divergence points, and 47 task-level Prefix Tree Acceptor (PTA) references. AgentLens builds PTA references by merging multiple passing solutions for the same task, and uses a context-sensitive intent labeler to assign actions to Exploration, Implementation, Verification, or Orchestration based on trajectory history rather than tool identity alone. On AgentLens-Bench, the quality score separates passing trajectories into Lucky, Solid, and Ideal tiers and further decomposes Lucky Passes into five recurring mechanisms. Across the eight model backends, Lucky rates range from 0.5% to 23.2%, and some models move by as many as five rank positions when ranked by quality score instead of pass rate. We plan to release the project repository soon, including AgentLens-Bench artifacts, the AgentLens SDK, and the analysis tooling.

URL PDF HTML ☆

赞 0 踩 0

2605.24391 2026-06-03 cs.AR cs.AI

MX-SAFE: Versatile Inference- and Training-Proof Microscaling Format with On-the-Fly Exponent and Mantissa Bit Allocation

MX-SAFE：具有即时指数和尾数位分配的多功能推理与训练验证微缩放格式

Dahoon Park, Jahyun Koo, Sangwoo Hwang, Jaeha Kung

发表机构 * Institute of Information & Communications Technology Planning & Evaluation (IITP)（信息与通信技术规划与评估院）； Korea government (MSIT)（韩国政府）； National Research Foundation of Korea (NRF)（韩国国家研究基金会）； Ministry of Science and ICT（科学技术信息通信部）； IC Design Education Center (IDEC)（集成电路设计教育中心）

AI总结提出一种名为MX-SAFE的微缩放格式，通过自适应切换宽尾数模式和亚正规FP模式，同时支持训练和直接推理，并采用基于瓦片的块设计提高硬件效率，在推理和训练中相比MXFP8 E2M5和MXFP8 E4M3分别平均提升0.05%/11.1%和3.55%/3.57%的准确率，且能耗降低24.9%。

Comments Accepted to DATE 2026 (7 pages, 7 figures). Typo updates for Fig. 3 and Table 4, 5 are reflected

详情

AI中文摘要

随着深度学习需求的增长，通过量化降低训练和推理成本变得至关重要。2022年，开放计算项目（OCP）联盟标准化了用于深度学习的窄精度格式，称为微缩放（MX）格式。MX格式是一种硬件友好的动态量化方案，通过在多个操作数之间共享8位指数来有效减小数据大小。MX格式可分为两类，各有优势：（i）MXINT，仅由尾数位组成，注重高精度；（ii）MXFP，通过允许局部指数位来提供更宽的动态范围。本文提出了一种多功能的MXFP格式，称为MX-SAFE（简称MXSF），它自适应地使用两种模式，即宽尾数模式（FP8 E2M5）和亚正规FP模式（FP5 E3M2），以支持训练和直接推理。此外，我们提出了一种基于瓦片的块设计，通过减少使用MXSF格式训练期间重量化过程的负担来提高硬件效率。由于采用了所提出的MXSF格式，与MXFP8 E2M5和MXFP8 E4M3相比，推理/全训练的平均准确率分别提高了0.05%/11.1%和3.55%/3.57%。此外，我们提出了一种支持MXSF格式的训练推理加速器，在实现与BF16基线相似准确率的同时，总能耗降低了24.9%。

英文摘要

As the demand for deep learning grows, cost reduction through quantization has become essential for both training and inference. In 2022, the Open Compute Project (OCP) consortium standardized narrow precision formats for deep learning, called the microscaling (MX) format. The MX format is a hardware-friendly dynamic quantization scheme that effectively reduces the data size by sharing an 8-bit exponent across multiple operands. The MX format can be categorized into two types with their own strengths: (i) MXINT which focuses on a high precision consisting only of mantissa bits and (ii) MXFP which focuses on a wider dynamic range by allowing local exponent bits. In this work, we present a versatile MXFP format, called MX-SAFE (MXSF in short), that adaptively uses two modes, i.e., a wider mantissa mode (FP8 E2M5) and a subnormal FP mode (FP5 E3M2), to support both training and direct-cast inference. Furthermore, we propose a tile-based block design to increase hardware efficiency by reducing the burden of re-quantization process during the training with the MXSF format. Owing to the use of the proposed MXSF format, 0.05%/11.1% and 3.55%/3.57% improvements in accuracy, on average, for inference/full-training compared to MXFP8 E2M5 and MXFP8 E4M3 are observed, respectively. Moreover, we present a training-inference accelerator that supports the MXSF format and it achieves similar accuracy to the BF16 baseline while using 24.9% less total energy consumption.

URL PDF HTML ☆

赞 0 踩 0

2601.00990 2026-06-03 eess.IV cs.CV

Uncertainty-Calibrated Explainable Artificial Intelligence for Fetal Ultrasound Plane Classification: A Systematic Review

不确定性校准的可解释人工智能用于胎儿超声平面分类：系统综述

Gustav Olaf Yunus Laitinen-Fredriksson Lundström-Imanov, Ozkan Gunalp

发表机构 * Department of Life Sciences and Medicine, Faculty of Science, Technology and Medicine, University of Luxembourg（卢森堡大学生命科学与医学系，科学、技术与医学学院）； Department of Biostatistics and Medical Informatics, Institute of Health Sciences, Ege University（伊兹密尔大学健康科学学院生物统计学与医学信息学系）

AI总结通过系统综述78项研究，提出CALIB-XFUS框架，强调校准、解释忠实性和公平性，以满足监管要求。

Comments 12 pages, 5 figures, 1 table, 75 references; systematic review (PRISMA 2020); manuscript prepared for submission to The Lancet Digital Health (Reviews section)

详情

AI中文摘要

胎儿超声是产前护理的基石，准确识别一小组标准解剖平面支撑着生物测量、生长监测和结构异常检测。深度学习分类器现在在精心策划的基准上达到或超过专家准确性，但大多数仍然不透明且校准不良，使临床医生缺乏安全决策支持所需的校准置信度或忠实解释。我们按照PRISMA 2020系统综述了2015年1月1日至2026年4月30日期间发表的78项研究，这些研究将自动胎儿平面分类与可解释性或预测不确定性量化相结合。六个标准平面的合并平衡准确率为0.93（95% CI 0.91至0.95），但只有19项研究（24%）报告了校准，14项（18%）报告了选择性预测。我们提出了CALIB-XFUS，一个22项报告框架，将校准、解释忠实性和公平性操作化，用于受监管的胎儿超声人工智能。该框架涵盖六个领域：临床任务和使用指征；数据集来源和代表性；模型和训练流程；校准和选择性预测；解释忠实性和临床医生验证；以及上市后监测。我们认为，根据FDA良好机器学习实践原则和欧盟AI法案高风险义务，不确定性校准、忠实解释和公平审计的胎儿超声人工智能现在在技术上可行且在监管上被期望。

英文摘要

Fetal ultrasound is the cornerstone of antenatal care, and accurate recognition of a small set of standard anatomical planes underpins biometry, growth surveillance, and detection of structural anomalies. Deep learning classifiers now match or exceed expert accuracy on curated benchmarks, but most remain opaque and miscalibrated, leaving clinicians without the calibrated confidence or faithful explanations needed for safe decision support. We systematically reviewed 78 studies published between January 1, 2015 and April 30, 2026 that paired automated fetal plane classification with explainability or predictive uncertainty quantification, following PRISMA 2020. Pooled balanced accuracy across six standard planes was 0.93 (95% CI 0.91 to 0.95), but only 19 studies (24%) reported calibration and 14 (18%) reported selective prediction. We propose CALIB-XFUS, a 22-item reporting framework that operationalises calibration, explanation faithfulness, and fairness for regulated fetal ultrasound artificial intelligence. The framework spans six domains: clinical task and indication for use; dataset provenance and representativeness; model and training pipeline; calibration and selective prediction; explanation faithfulness and clinician validation; and post-market surveillance. We argue that uncertainty-calibrated, faithfully explained, and fairness-audited fetal ultrasound AI is now both technically feasible and regulatorily expected under the FDA Good Machine Learning Practice principles and the EU AI Act high-risk obligations.

URL PDF HTML ☆

赞 0 踩 0

2605.18106 2026-06-03 math.OC cs.AI cs.LG stat.ML

Symmetry-Compatible Principle for Optimizer Design: Embeddings, LM Heads, SwiGLU MLPs, and MoE Routers

优化器设计的对称性兼容原理：嵌入、LM头、SwiGLU MLP和MoE路由器

Tim Tsz-Kit Lau, Weijie Su

发表机构 * University of Pennsylvania（宾夕法尼亚大学）； Wharton School（沃顿商学院）

AI总结针对现代神经网络参数空间的对称性与坐标级优化器之间的几何不匹配，提出对称性兼容的优化器设计原则，并针对嵌入矩阵、LM头、SwiGLU MLP投影和MoE路由器等特殊参数块导出相应更新规则，实验证明其改善验证损失、负载平衡和训练稳定性。

详情

AI中文摘要

深度学习实践中长期存在一种显著的几何差异。现代神经网络架构自然展现出丰富的对称性和等变性，而流行的优化器如Adam及其变体本质上是坐标级的，无法尊重参数空间的等变结构。我们通过引入优化器设计的对称性兼容原则来解决这一差异：梯度更新规则应在作用于相应权重块的对称群下等变。遵循这一原则，我们首先为一般矩阵层提供了双正交等变更新的统一视角，如随机谱下降、Muon、Scion和极梯度方法所采用的。更重要的是，通过从正交群转向置换和共享移位对称性，我们为参数块（其对称性与一般矩阵层不同）推导了对称性兼容的优化器：嵌入和LM头矩阵、SwiGLU MLP投影以及MoE路由器矩阵。这些构造包括单边谱、行范数、混合行范数/谱、行感知、列感知、中心行范数和左谱更新。它们产生了一个端到端的逐层优化器堆栈，其中每个主要的矩阵值参数类被分配一个更新，其等变性与其对称群匹配。我们通过在密集和稀疏MoE语言模型上的预训练实验验证了这一原则，包括Qwen3-0.6B风格、Gemma 3 1B风格、OLMoE-1B-7B风格和缩小版gpt-oss架构。在这些实验中，对称性兼容的更新规则一致地改善了最终验证损失，减少了稀疏MoE模型中的负载不平衡，并在若干情况下比相应的AdamW更新提高了训练稳定性。

英文摘要

A striking geometric disparity has long persisted in the practice of deep learning. While modern neural network architectures naturally exhibit rich symmetry and equivariance properties, popular optimizers such as Adam and its variants operate inherently coordinate-wise, rendering them unable to respect the equivariance structures of the parameter space. We address this disparity by introducing a symmetry-compatible principle for optimizer design: the gradient update rule should be equivariant under the symmetry group acting on the corresponding weight block. Following this principle, we first provide a unified perspective on bi-orthogonally equivariant updates for general matrix layers, as employed by stochastic spectral descent, Muon, Scion, and polar gradient methods. More importantly, by moving from orthogonal groups to permutation and shared-shift symmetries, we derive symmetry-compatible optimizers for parameter blocks whose symmetries differ from those of general matrix layers: embedding and LM head matrices, SwiGLU MLP projections, and MoE router matrices. These constructions include one-sided spectral, row-norm, hybrid row-norm/spectral, row-aware, column-aware, centered row-norm, and left-spectral updates. They yield an end-to-end layerwise optimizer stack in which each major matrix-valued parameter class is assigned an update whose equivariance matches its symmetry group. We corroborate this principle through pre-training experiments on dense and sparse MoE language models, including Qwen3-0.6B-style, Gemma 3 1B-style, OLMoE-1B-7B-style, and downsized gpt-oss architectures. Across these experiments, symmetry-compatible update rules consistently improve final validation loss, reduce load imbalance in sparse MoE models, and in several cases improve training stability over the corresponding AdamW updates.

URL PDF HTML ☆

赞 0 踩 0

2605.17219 2026-06-03 cs.CR cs.AI cs.LG cs.NI eess.SP

Integration of AI in Cybersecurity: Current Trends with a Focused Look at Intrusion Detection Applications

AI在网络安全中的集成：当前趋势及入侵检测应用的聚焦分析

S. Tazili, A. Mansour, M. Y. Chkouri

发表机构 * SIGL Laboratory, ENSATE, Abdelmalek Essaâdi University, Tetouan, Morocco（SIGL实验室、ENSATE、阿卜杜勒马利克·埃萨迪大学、突塔努安、摩洛哥）

AI总结本文综述了当前基于AI的网络安全趋势，重点分析入侵检测方法，通过比较不同AI技术和性能指标揭示有意义见解。

Comments Accepted at AI2SD 2025. Forthcoming in Springer Lecture Notes in Networks and Systems (2026). Please cite this preprint as indicated in the paper!

详情

Journal ref: https://conferences.academyskills.net/ai2sd/2025/PapersManagement/all.php#:~:text=643174

AI中文摘要

人工智能（AI）如今被广泛采用，因其能够检测模式、自动化任务并减少各种应用中的时间和成本。AI与网络安全的整合引起了广泛关注，特别是在入侵检测、恶意软件分析以及钓鱼或垃圾邮件检测等领域。随着AI和网络安全的发展，新的方法和途径不断涌现。当前趋势包括使用生成式AI、自然语言处理、用于隐私保护协作训练的联邦学习以及可解释AI以确保可解释性和信任，这些在网络安全中至关重要。本文对当前基于AI的网络安全趋势进行了有趣的综述，重点聚焦入侵检测方法，旨在通过基于所采用的AI技术和报告性能的比较分析，揭示有意义的见解。

英文摘要

Artificial Intelligence (AI) is widely adopted today for its ability to detect patterns, automate tasks, and reduce time and cost across various applications. Its integration into Cybersecurity has garnered significant attention, particularly in areas such as intrusion detection, malware analysis, and phishing or spam detection. As AI and cybersecurity evolve, new methods and approaches emerge regularly. Current trends include the use of Generative AI, Natural Language Processing, Federated Learning for privacy-preserving collaborative training, and eXplainable AI to ensure interpretability and trust, which are vital in cybersecurity. This paper presents an interesting review of current AI-based cybersecurity trends, focusing on intrusion detection approaches and aiming to uncover meaningful insights through comparative analysis based on the employed AI techniques and reported performance.

URL PDF HTML ☆

赞 0 踩 0