arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 3406
2605.25058 2026-05-26 cs.HC cs.AI

Intent Signal Theory: A Computational Framework for Intent-State Control in Human-AI Interaction

意图信号理论:人机交互中意图状态控制的计算框架

Gang Peng

发表机构 * Huizhou Lateni AI Technology Co., Ltd.(惠州莱尼人工智能技术有限公司) Huizhou University(惠州大学)

AI总结 提出意图信号理论(IST),通过区分潜在源意图、可观测意图代理、编码载体和模型输出四个对象,形式化意图丢失定理,并基于六种大语言模型、三种语言和三个任务领域的实验验证了结构-保真度分裂等预测,将提示工程重新定义为意图协议设计。

Comments 10 pages, 2 figures. Theoretical framework paper grounded in four companion empirical studies. Data and code repository: https://github.com/PGlarry/prompt-protocol-specification

详情
AI中文摘要

当前的人工智能交互模型将提示视为主要的交换对象,忽略了一个关键层面:用户的潜在源意图,即提示之前并激发提示的目标状态。这里我们引入意图信号理论(IST),这是一个形式化这一缺失意图层的计算框架。IST区分了四个通常被混淆的对象:潜在源意图(I*)、可观测意图代理(I-hat)、编码载体(P)和模型输出(O)。它形式化了维度权重、编码掩码、结构和保真度恢复分数以及公私意图分解。不可逆意图丢失定理确立了:载体中缺失的私有意图无法通过通用替换恢复。来自四项配套研究的证据(涵盖六种大语言模型、三种语言和三个任务领域)显示了与IST预测一致的结构-保真度分裂、人类验证的度量分离以及权重容忍平台。IST将提示工程重新定义为意图协议设计,并识别了当前人工智能系统所缺乏的一个计算层面。

英文摘要

Current AI interaction models treat the prompt as the primary object of exchange, omitting a critical layer: the user's latent source intent, the goal state preceding and motivating the prompt. Here we introduce Intent Signal Theory (IST), a computational framework that formalises this missing intent layer. IST distinguishes four objects routinely conflated: latent source intent (I*), observable intent proxy (I-hat), encoded carrier (P), and model output (O). It formalises dimensional weights, encoding masks, structural and fidelity recovery scores, and public-private intent decomposition. The Theorem of Irreversible Intent Loss establishes that private intent absent from the carrier cannot be recovered beyond generic substitution. Evidence from four companion studies spanning six LLMs, three languages and three task domains shows structural-fidelity splits, human-validated metric dissociation, and weight-tolerance plateaus consistent with IST's predictions. IST reframes prompt engineering as intent-protocol design and identifies a computational layer that current AI systems lack.

2605.25057 2026-05-26 math.NA cs.LG cs.NA

Random Neural Network Expressivity for Non-Linear Partial Differential Equations

随机神经网络对非线性偏微分方程的表达能力

Muhammed Ali Mehmood, Lukas Gonon

发表机构 * Department of Mathematics(数学系) Imperial College London(帝国理工学院伦敦分校) School of Computer Science(计算机科学学院) University of St. Gallen(圣加尔登大学)

AI总结 研究随机生成隐藏权重的神经网络(RaNNs)对非线性偏微分方程解的逼近能力,推导了误差界并得到维数无关的逼近率1/2,应用于多孔介质方程和可压缩Navier-Stokes方程。

详情
AI中文摘要

随机生成隐藏权重的神经网络(RaNNs)已被广泛研究,既作为独立的机器学习方法,也作为全可训练深度学习方法的初始化。本文研究RaNNs在学习非线性偏微分方程(PDEs)解方面的表达能力。尽管在实际应用中广泛使用,但对此背景下RaNNs逼近性质的严格理论理解仍然有限。本文推导了RaNNs对时间依赖Sobolev函数的误差界,并对足够正则的函数获得了维数无关的逼近率$ rac{1}{2}$。我们将结果应用于两类重要的非线性PDEs:多孔介质方程和可压缩Navier-Stokes方程,表明RaNNs能够有效逼近这些复杂非线性PDEs的解。我们的理论分析得到了数值实验的支持,表明所获得的收敛速率超出了所考虑的设置。

英文摘要

Neural networks with randomly generated hidden weights (RaNNs) have been extensively studied, both as a standalone learning method and as an initialization for fully trainable deep learning methods. In this work, we study RaNN expressivity for learning solutions to non-linear partial differential equations (PDEs). Despite their widespread use in practical applications, a rigorous theoretical understanding of the approximation properties of RaNNs in this context remains limited. Here, we derive error bounds for RaNN approximations to time-dependent Sobolev functions and obtain a dimension-free approximation rate $\frac{1}{2}$ for sufficiently regular functions. We apply our results to two important classes of non-linear PDEs: Porous Medium Equations and Compressible Navier-Stokes Equations, showing that RaNNs are capable of efficiently approximating solutions to these complex, non-linear PDEs. Our theoretical analysis is supported by numerical experiments, showing that the obtained convergence rates extend beyond the considered setting.

2605.25050 2026-05-26 stat.AP cs.LG q-bio.QM stat.ML

Multimodality Stacking with Blockwise missing values and application to the PIONeeR biomarkers study for prediction of resistance to immunotherapy

具有分块缺失值的多模态堆叠及其在预测免疫治疗耐药性的PIONeeR生物标志物研究中的应用

Mohamed Boussena, Florence Monville, Jacques Fieschi-Meric, Frederic Vely, Pierre Milpied, Julien Mazieres, Maurice Perol, Eric Vivier, Laurent Greillier, Fabrice Barlesi, Sebastien Benzekry

发表机构 * Inria – Inserm team COMPO, COMPutational pharmacology and clinical Oncology, Centre Inria Sophia Antipolis - Méditerranée, Centre de Recherches en Cancérologie de Marseille, Inserm U1068, CNRS UMR7258, Institut Paoli-Calmettes, Pharmacy faculty, Aix-Marseille University(Inria - Inserm COMPO团队,计算药理学和临床肿瘤学,Inria Sophia Antipolis -地中海, Marseille癌症研究中心,Inserm U1068,CNRS UMR7258,Paoli-Calmettes研究所,药学系,Aix-Marseille大学) Veracyte SAS, Marseille, France(Veracyte SAS,法国马赛) Assistance Publique-Hôpitaux de Marseille (APHM), Marseille, France(马赛公共医院(APHM),法国马赛) Toulouse University Hospital, Toulouse, France(图卢兹大学医院,法国图卢兹) Centre Leon Berard, Lyon, France(Leon Berard中心,法国里昂) Innate Pharma, Marseille, France(Innate Pharma,法国马赛) Université Paris Saclay, Gustave Roussy, Inserm, Prédicteurs Moléculaires et nouvelles cibles en oncologie (U981), F-94805, Villejuif, France(巴黎萨克雷大学,Gustave Roussy,Inserm,分子预测与肿瘤学新靶点(U981),法国维尔若,F-94805)

AI总结 提出多模态堆叠框架MSB,通过独立建模各模态特征并利用交叉验证堆叠元学习器聚合预测,解决高维和分块缺失问题,在PIONeeR研究中预测非小细胞肺癌免疫治疗无进展生存期,性能优于基线算法。

详情
AI中文摘要

在临床肿瘤学中,整合多模态数据集常受到高维性和分块缺失的阻碍,即特定患者子集无法获得完整数据源。标准生存模型通常难以处理这些缺失,导致结果偏倚或患者排除。我们提出具有分块缺失值的多模态堆叠(MSB),一种用于生存分析的晚期融合框架,它独立建模模态特定特征,然后通过交叉验证的堆叠元学习器聚合预测。MSB在PIONeeR研究(n=443名患者,来自八个异质来源的378个生物标志物)中进行了验证,以预测接受免疫治疗的晚期非小细胞肺癌患者的无进展生存期。MSB产生了比基线算法更高的预测性能(C-index)。改进幅度因基线强度而异:线性模型提高了15.9%(Wilcoxon符号秩检验p<0.001),随机生存森林提高了5.4%(p=0.002),梯度提升方法提高了2.1%(p=0.030)。除了区分能力外,MSB还缩小了泛化差距(5折交叉验证重复3次的训练-测试差异:0.055 vs 线性模型的0.380)。置换重要性分析确定了常规实验室标志物、临床特征和PD-L1表达为主要预测驱动因素。缺失块指示器的重要性可忽略,表明模型从生物标志物值而非数据可用性模式中学习。MSB为具有分块缺失的多模态生存预测提供了一个统计验证的框架。通过无需完整数据即可进行系统性生物标志物评估,MSB为生物医学研究中的预测建模提供了实用工具,有待外部验证。实现代码可在https://github.com/MohamedBoussena/MSB 根据Inria许可证获取。

英文摘要

Integrating multimodal datasets in clinical oncology is frequently hindered by high dimensionality and blockwise missingness, where entire data sources are unavailable for specific patient subsets. Standard survival models often struggle with these gaps, leading to biased results or patient exclusion. We introduce Multimodality Stacking with Blockwise missing values (MSB), a late-fusion framework for survival analysis that independently models modality-specific features before aggregating predictions via a cross-validated stacking meta-learner. MSB was validated on the PIONeeR study (n=443 patients, 378 biomarkers across eight heterogeneous sources) to predict progression-free survival in advanced non-small cell lung cancer patients receiving immunotherapy. MSB yielded higher predictive performance (C-index) than baseline algorithms. Improvements varied by baseline strength: linear models showed a 15.9% increase (p<0.001 for the Wilcoxon signed-rank test), random survival forests gained 5.4% (p=0.002), and gradient boosting methods improved by 2.1% (p=0.030). Beyond discrimination, MSB reduced the generalization gap (train-test difference in 5 folds cross-validation repeated 3 times: 0.055 vs 0.380 for linear models). Permutation importance analysis identified routine laboratory markers, clinical features, and PD-L1 expression as primary predictive drivers. Missing block indicators showed negligible importance, suggesting the model learned from biomarker values rather than data availability patterns. MSB provides a statistically validated framework for multimodal survival prediction with blockwise missingness. By enabling systematic biomarker evaluation without requiring complete data, MSB offers a practical tool for predictive modeling in biomedical research, pending external validation. Implementation is available at https://github.com/MohamedBoussena/MSB under Inria license.

2605.24999 2026-05-26 q-bio.NC cs.AI cs.MA

Interpretation, Learning, and Empathy as One Constraint: A Residual-Adequacy Architecture with Accountable Abstention

解释、学习与共情作为单一约束:具有可问责弃权的残差充分性架构

Chainarong Amornbunchornvej

发表机构 * National Electronics and Computer Technology Center (NECTEC)(国家电子与计算机技术中心)

AI总结 提出一种认知架构,通过单一残差量统一处理解释、学习和共情,当情境超出表征能力时产生带类型和见证的弃权。

Comments First draft for journal submission. The code is at https://github.com/DarkEyes/RC-Arch

详情
AI中文摘要

一个智能体必须对当前情境采取行动,学习它尚无法表征的内容,并充分建模其他智能体以进行协调。这些能力通常由独立的机制实现,但它们共享一种失败模式:情境可能超出智能体当前能表征的范围,此时诚实的回应是原则性的拒绝,并说明缺失了什么。我们开发了一个小型认知架构,其中这些限制源于单一量。一个解释-决策单元(IDU)通过一组体制(具有私有基的局部表征框架)解释内容向量,并决定其许可哪些行动;内容相对于活跃体制表征范围的标量残差驱动该单元。低残差且许可清晰时发出行动;否则单元重新解释、尝试描述长度合理的扩展,或停止并给出带类型和见证的终止。我们证明该单元是总且确定性的:对于任何内容和固定配置,它在有限有界步数内停止,并带有唯一终止见证,因此弃权由构造携带其原因。通过绑定架构的开放参数而不改变其机制,相同的残差-范围约束在三个范围上恢复了三个有记录的现象:不知的类型学(类型化弃权);智能体之间的强制误解,局限于一个共享概念且对犯错的智能体不可见(有界共情);以及学习中的先决条件依赖,源于有界关注窗口而非假设(发展先决条件)。每个实例化都针对自然智能体和人工智能体进行了阐述,并提出了可证伪的预测,因此一个约束可以模拟人类和机器认知中的限制。该工作提供了一种统一和一种可问责弃权的概念,通过构造带有类型和见证。

英文摘要

An agent must act on the situation before it, learn what it cannot yet represent, and model other agents well enough to coordinate. These faculties are usually realized by separate mechanisms, yet they share a failure mode: the situation can exceed what the agent can currently represent, and the honest response is then a principled refusal that says what was missing. We develop a small cognitive architecture in which these limits arise from a single quantity. An Interpretation-Decision Unit (IDU) interprets a content vector through a family of regimes - local representational frames with private bases - and decides which actions it licenses; a scalar residual of the content against the active regimes' representational scope drives the unit. Low residual with a clean licensing emits an action; otherwise the unit re-interprets, attempts a description-length-justified expansion, or halts with a typed, witnessed terminal. We prove the unit is total and deterministic: for any content and fixed configuration it halts in finitely many bounded-cost steps with a unique terminal witness, so abstention carries its cause by construction. By binding the architecture's open parameters without changing its mechanics, the same residual-against-scope constraint recovers three documented phenomena at three scopes: the typology of not-knowing (typed abstention); a forced misunderstanding between agents, localized to one shared concept and invisible to the agent committing it (bounded empathy); and prerequisite dependence in learning derived from a bounded focus window rather than posited (developmental prerequisites). Each instantiation is worked for a natural and an artificial agent and states a falsifiable prediction, so one constraint can model limits in both human and machine cognition. The account contributes a unification and a notion of accountable abstention, typed and witnessed by construction.

2605.24992 2026-05-26 cs.NI cs.AI cs.LG cs.MA

Scaling up Energy-Aware Multi-Agent Reinforcement Learning for Mission-Oriented Drone Networks with Individual Reward

面向任务驱动无人机网络的能量感知多智能体强化学习扩展与个体奖励

Changling Li, Ying Li

发表机构 * Department of Computer Science, ETH Zurich(苏黎世联邦理工学院计算机科学系) Department of Computer Science, Colby College(科尔比学院计算机科学系)

AI总结 提出基于个体奖励函数的能量感知多智能体强化学习模型,利用深度Q网络解决无人机网络动态环境和电池容量限制下的轨迹规划问题,实验表明在任务密度高时成功率接近100%,且扩展性优于共享奖励模型。

Comments IEEE Internet of Things Journal

详情
Journal ref
volume=12, number=8, year=2025, pages=10640-10654
AI中文摘要

多智能体强化学习(MARL)因其通过交互学习的能力,在自动驾驶和智慧城市等协作系统中显示出广泛适用性。随着无人机网络的最新发展,研究人员也应用MARL来解决轨迹规划问题。然而,动态环境和有限的电池容量仍然是使用MARL实现高效协作任务执行的挑战。在本文中,我们提出了一种能量感知的MARL模型作为应对这些挑战的尝试,利用深度Q网络(DQN)和由任务执行进度及无人机剩余电量驱动的个体奖励函数。我们对所提出的模型进行了一系列仿真研究,并将其与共享奖励MARL进行比较,以探索MARL中信用分配的影响。结果表明,无论任务位置和长度如何,我们提出的模型都能达到至少80%的成功率。与共享奖励模式类似,个体奖励模式在任务密度高时可以获得更好的成功率,并且当任务密度接近40%时,几乎可以达到100%的成功率。我们提出的个体奖励模型的真正优势在环境扩展时得以显现。与共享奖励MARL的比较表明,我们提出的模型对环境大小和智能体数量的变化更加鲁棒。由于目标的清晰性,它可以用更少的步骤实现更高的成功率,从而更好地提高能源效率。

英文摘要

Multi-agent reinforcement learning (MARL) has shown wide applicability in collaborative systems such as autonomous driving and smart cities for its ability of learning through interaction. With the recent development of drone networks, researchers have also applied MARL to address the trajectory planning problems. However, the dynamic environment and the limited battery capacity are still challenging for using MARL to achieve efficient collaborative task execution. In this paper, we propose an energy-aware MARL model as an attempt to tackle these challenges, leveraging Deep Q-Networks (DQN) with \emph{individual reward functions} driven by the task execution progress and the remaining battery of drones. We conduct a set of simulation studies for the proposed mode and compare it with the shared reward MARL~\cite{Li2022MARL} to explore the impact of credit assignment in MARL. The results indicate that our proposed model can achieve at least 80\% success rate regardless of the task locations and lengths. Similar to the shared reward mode, the individual reward mode can achieve a better success rate when the task density is high, and it can hit nearly a 100\% success rate when task density gets close to 40\%. The true advantage of our proposed model with individual reward is revealed when scaling up the environment. The comparison to the shared reward MARL shows that the our proposed model is more robust towards the change of the environment size and agent numbers. It can achieve higher success rate with fewer steps due to the clarity of the goal which improves energy efficiency even better.

2605.24986 2026-05-26 cs.IR cs.LG

Self-Balancing Gradient Allocation for Heterogeneity-Aware Feature Generation in Click-Through Rate Prediction

点击率预测中面向异构感知特征生成的自平衡梯度分配

Moyu Zhang, Yun Chen, Yujun Jin, Jinxin Hu, Yu Zhang, Xiaoyi Zeng

发表机构 * Alibaba Group(阿里巴巴集团)

AI总结 针对生成式CTR方法中重建目标忽略特征场异构性导致难场欠拟合的问题,提出HeteGenCTR,通过可学习的场难度参数联合训练去噪网络,实现自平衡损失和难度引导注意力机制,在五个基准和在线A/B测试中取得显著提升。

Comments 12 pages, 5 figures, 4 tables

详情
AI中文摘要

通过离散扩散的生成式预训练在所有特征场上同时提供密集的重建监督,缓解了CTR预测中数据稀疏导致的表示崩溃。然而,所有现有的生成式CTR方法都有一个根本限制:重建目标对每个特征场赋予相同的训练权重,忽略了高基数ID字段、稀疏分类属性、数值和行为序列之间重建难度的深刻异质性。这导致容易的场主导训练梯度,而最难但信息最丰富的场长期欠拟合,我们将这个问题称为生成难度不平衡。我们提出HeteGenCTR,通过每个场可学习的难度参数与去噪网络联合训练来解决这种不平衡。这个统一信号驱动两个协调组件,无需额外超参数:一个自平衡损失,自动将梯度预算重新分配给更难的场,具有可证明的稳定均衡;以及一个难度引导的注意力机制,抑制已经收敛的容易场的影响,同时放大向难场的跨场信息流。两个组件共享相同的学习信号,并在整个训练过程中保持相互一致。在五个CTR基准和一个为期七天的在线A/B测试中,实验表明相对于最先进的基线具有一致且统计显著的改进,对冷启动和长尾用户有不成比例的增益。

英文摘要

Generative pre-training via discrete diffusion provides dense reconstruction supervision across all feature fields simultaneously, mitigating representation collapse from data sparsity in CTR prediction. However, all existing generative CTR methods share a fundamental limitation: the reconstruction objective assigns equal training weight to every feature field, ignoring the profound heterogeneity of reconstruction difficulty across high-cardinality ID fields, sparse categorical attributes, numerical values, and behavioral sequences. This causes easy fields to dominate training gradients while the hardest but most informative fields remain chronically underfit, a problem we term the generative difficulty imbalance.We propose HeteGenCTR, which resolves this imbalance through per-field learnable difficulty parameters jointly trained with the denoising network. This unified signal drives two coordinated components without additional hyperparameters: a self-balancing loss that automatically reallocates gradient budget toward harder fields with a provably stable equilibrium, and a difficulty-guided attention mechanism that suppresses the influence of already-converged easy fields while amplifying cross-field information flow toward hard fields. Both components share the same learned signal and remain mutually consistent throughout training. Experiments on five CTR benchmarks and a seven-day online A/B test demonstrate consistent, statistically significant improvements over state-of-the-art baselines, with disproportionate gains for cold-start and long-tail users.

2605.24949 2026-05-26 cs.CR cs.AI

APT-Agent: Automated Penetration Testing using Large Language Models

APT-Agent:利用大语言模型的自动化渗透测试

William Guanting Li, Alsharif Abuadbba, Kristen Moore, Dan Dongseong Kim

发表机构 * University of Queensland(昆士兰大学)

AI总结 提出APT-Agent框架,通过混合修正模块和命令特定记忆架构解决大语言模型在渗透测试中的幻觉和长期记忆问题,在Metasploitable 2上实现84.29%的端到端利用成功率。

Comments 11 pages, 8 figures

详情
AI中文摘要

渗透测试对于保护现代网络基础设施至关重要,然而传统的手动方法难以跟上其规模和复杂性。大语言模型(LLMs)为自动化这些任务提供了新的机会,但现有方法面临两个持续挑战:技术实体的幻觉和长期上下文记忆不足。为了解决这些问题,我们提出了APT-Agent,一个完全自动化的LLM驱动的渗透测试框架,系统性地协调侦察、利用和数据窃取。APT-Agent引入了一个混合修正模块来恢复幻觉命令,以及一个命令特定的记忆架构来跨多步攻击序列保留操作上下文。我们在Metasploitable 2上针对涵盖Web、数据库和网络协议的七个脆弱服务评估了我们的APT-Agent。APT-Agent实现了84.29%的端到端利用成功率,而在匹配条件下,Script Kiddie和PentestGPT分别为48.57%和18.57%。通过减少认知负担和最小化对人类干预的依赖,APT-Agent代表了向可扩展、可靠且认知高效的渗透测试自动化迈出的一步。

英文摘要

Penetration testing is essential to securing modern web infrastructures, yet traditional manual methods struggle to keep pace with their scale and complexity. Large Language Models (LLMs) offer new opportunities for automating these tasks, but existing approaches face two persistent challenges: hallucination of technical entities and insufficient long-term contextual memory. To address these issues, we present APT-Agent, a fully automated LLM-driven penetration testing framework that systematically orchestrates reconnaissance, exploitation, and exfiltration. APT-Agent introduces a hybrid rectification module to recover hallucinated commands and a command-specific memory architecture to preserve operational context across multi-step attack sequences. We evaluate our APT-Agent on Metasploitable 2 against seven vulnerable services spanning web, database, and network protocols. APT-Agent achieves an 84.29% end-to-end exploitation success rate, compared to 48.57% (Script Kiddie) and 18.57% (PentestGPT) under matched conditions. By reducing cognitive burden and minimizing reliance on human intervention, APT-Agent represents a step toward scalable, reliable, and cognitively efficient automation for penetration testing.

2605.24941 2026-05-26 cs.CR cs.LG

Memory-Induced Tool-Drift in LLM Agents

LLM代理中的记忆诱导工具漂移

Mahavir Dabas, Jihyun Jeong, Ming Jin, Ruoxi Jia

发表机构 * Virginia Tech(弗吉尼亚理工大学)

AI总结 研究LLM代理中长期记忆存储的个性偏见(如成本意识、不耐烦等)在不适用情境下静默影响工具调用的问题,提出MEMDRIFT基准测试,发现偏置记忆导致工具参数偏离基线,且现有防御措施无法消除该现象。

详情
AI中文摘要

现代LLM代理将用于个性化的长期记忆与用于在现实世界中采取行动的工具调用接口相结合——这一组合支撑着当代生产系统。我们研究了这种组合的一个先前未被检查的失败:当存储在记忆中的个性驱动偏见(成本意识、不耐烦、风险承受能力等)在不适用情境下静默影响工具调用时。我们称此为记忆诱导工具漂移,并通过MEMDRIFT将其操作化,MEMDRIFT是一个包含105个场景的基准测试,涵盖五个偏见维度和七个专业领域,通过自动化对抗性流水线生成。在七个前沿模型(包括具有扩展推理能力的模型)中,偏置记忆使偏转分数(一种由评判者评分的参数偏离无偏基线的度量)在1-5分制上提高高达+3.6分。当记忆管理由三种生产记忆架构处理时,工具漂移持续存在。该现象影响现实世界的工具:扫描288个经过验证的MCP服务器上的6,062个工具,我们标记了608个具有易受影响参数的工具,并在一个经过验证的子集上确认了工具漂移。从机制上讲,偏置记忆充当隐式引导向量,将激活沿与显式行为指令相同的潜在方向推动。它们还将注意力从任务相关上下文重新分配到与目标参数具有表面关键词重叠的记忆条目上。标准防御——基于提示的相关性指令和记忆过滤器——减少了漂移但未能消除它。随着代理以用户名义采取越来越重要的行动,记忆诱导工具漂移代表了当前安全措施未能解决的一个系统性漏洞,这激发了在记忆管理和工具调用生成交叉点上的专用防御。

英文摘要

Modern LLM agents combine long-term memory for personalization with tool-calling interfaces for taking actions in the world -- a combination underpinning contemporary production systems. We study a previously unexamined failure of this combination: when personality-driven biases stored in memory (cost-consciousness, impatience, risk tolerance, etc.) silently affect tool calls in contexts where they are not applicable. We call this memory-induced tool-drift and operationalize it through MEMDRIFT, a benchmark of 105 scenarios spanning five bias dimensions and seven professional domains, generated through an automated adversarial pipeline. Across seven frontier models -- including those with extended reasoning -- biased memories raise deflection scores (a judge-scored measure of parameter deviation from unbiased baselines) by up to $+3.6$ points on a 1--5 scale. Tool-drift persists when memory management is handled by three production memory architectures. The phenomenon affects real-world tools: scanning 6{,}062 tools across 288 verified MCP servers, we flag 608 with susceptible parameters and confirm tool-drift on a validated subset. Mechanistically, biased memories act as implicit steering vectors, pushing activations along the same latent directions as explicit behavioral instructions. They also redistribute attention from task-relevant context toward memory entries with surface-level keyword overlap to the target parameter. Standard defenses -- prompt-based relevance instructions and memory filters -- reduce drift but do not eliminate it. As agents take increasingly consequential actions on a user's behalf, memory-induced tool-drift represents a systematic vulnerability that current safeguards do not address, motivating dedicated defenses at the intersection of memory management and tool-call generation.

2605.24938 2026-05-26 cs.IR cs.AI cs.CV

Your Embedding Model is SMARTer Than You Think

你的嵌入模型比你想象的更聪明

Jianrui Zhang, Hyun Jung Lee, Sukanta Ganguly, Tae-Eui Kam, Donghyun Kim, Yong Jae Lee

发表机构 * UW-Madison(威斯康星大学麦迪逊分校) Korea University(韩国大学) NetApp, Inc.(NetApp公司)

AI总结 提出SMART框架,通过利用标准单向量模型的隐式多向量能力,在推理时应用后期交互,无需额外训练即可提升多模态检索性能。

详情
AI中文摘要

多模态检索严重依赖单向量检索器,它将丰富的顺序令牌序列压缩为单个全局表示。虽然高效,但它们丢弃了密集检索任务所需的关键细粒度局部证据。多向量方法作为解决方案被引入,但严格需要训练,且许多忽略了全局总结表示的必要性。为解决这一问题,我们引入SMART,一个释放标准单向量模型潜在多向量能力的框架。我们首先证明,在池化嵌入上的标准对比训练通过梯度流隐式塑造了前序隐藏状态的检索几何结构。通过在推理时对这些冻结的隐藏状态应用直接后期交互,SMART作为一种即插即用的升级,持续提升跨多种模态的性能,甚至在MMEB-V2上进一步改进了最先进的模型。我们还揭示了SMART的优越性能,简单的轻量级后训练不仅节省时间和计算,还在视觉文档检索上带来进一步改进,使单向量模型能够超越最先进的多向量对应模型。最终,SMART为多模态检索提供了高效的推理增强和强大的微调技术。我们在https://github.com/HanSolo9682/SMART开源了代码和权重。

英文摘要

Multimodal retrieval relies heavily on single-vector retrievers, which compress rich, sequential token sequences into one single global representation. While efficient, they discard fine-grained, local evidence critical for dense retrieval tasks. Multi-vector approaches were introduced as a solution, but they strictly require training and many ignore the necessity of a globally summarizing representation. To address this, we introduce SMART, a framework that unlocks the latent multi-vector capabilities of standard single-vector models. We first demonstrate that standard contrastive training on the pooled embedding implicitly shapes the retrieval geometry of preceding hidden states via gradient flow. By applying direct late-interaction over these frozen hidden states during inference, SMART acts as a plug-and-play upgrade that consistently improves performance across diverse modalities, improving even the state-of-the-art models further on MMEB-V2. We also reveal SMART's superior performance, as simple lightweight post-training not only saves time and compute, but also brings forth further improvement on Visual Document retrieval, allowing a single-vector model to outperform SoTA multi-vector counterparts. Ultimately, SMART offers both a highly efficient inference enhancement and a powerful finetuning technique for multimodal retrieval. We open source our code and weights at https://github.com/HanSolo9682/SMART.

2605.24929 2026-05-26 stat.ML cs.IT cs.LG math.IT

Estimating Mixture Distributions via Stochastic Mirror Descent

通过随机镜像下降估计混合分布

Mohammadreza Ahmadypour, Tara Javidi, Farinaz Koushanfar

发表机构 * Department of Electrical and Computer Engineering(电气与计算机工程系) University of California San Diego(加州大学圣地亚哥分校)

AI总结 针对从样本中估计未知分布的问题,提出基于随机镜像下降(SMD)的混合模型估计器族,通过选择Bregman散度实现灵活估计,在大规模候选分量下保持高效,并在KL散度和ℓ2范数下达到近最优收敛率。

详情
AI中文摘要

我们重新审视了从样本中估计未知分布的经典问题,通过拟合最小化交叉熵损失的混合模型。将该任务视为在$M$分量混合分布空间上的随机凸优化问题,我们提出了一族源自随机镜像下降(SMD)算法的估计器。这种基于优化的方法提供了一个原则性且灵活的框架,它推广了传统估计器,并通过选择Bregman散度提出了多种新颖的估计器。我们方法的一个关键优势是它能够随着候选分量$f_i$的数量高效扩展;也就是说,可以在混合模型中使用大量基分布,而不会产生显著的计算开销。这使得能够实现更丰富的近似和改进的估计精度。此外,在类别分布(离散结果)的情况下,我们的估计器不需要严格的下界,换句话说,我们的框架不需要精确知道分布的支持集。我们证明,在温和条件下,所提出的$φ$-SMD估计器在Kullback-Leibler(KL)散度和$\ell_2$范数下均能达到近最优的收敛速率,并在计算昂贵时提供实际优势。我们的数值分析突出了相对于经典估计器在样本效率和可扩展性方面的改进性能保证。

英文摘要

We revisit the classical problem of estimating an unknown distribution from its samples by fitting a mixture model that minimizes cross-entropy loss. Framing the task as a stochastic convex optimization problem over the space of $ M $-component mixture distributions, we propose a family of estimators derived from the stochastic mirror descent (SMD) algorithm. This optimization-based approach provides a principled and flexible framework that generalizes traditional estimators and proposes a variety of novel estimators through the choice of Bregman divergences. A key advantage of our method is that it scales efficiently with the number of candidate components $ f_i $; that is, one can employ a large set of basis distributions in the mixture model without incurring significant computational overhead. This enables richer approximations and improved estimation accuracy. Moreover, in the case of categorical distribution (discrete outcomes) our estimators do not require a strict lower bound, in other words our framework does not require the precise knowledge of the support of the distribution. We demonstrate that, under mild conditions, the proposed $ φ$-SMD estimators achieve near-optimal convergence rates in both Kullback-Leibler (KL) divergence and $ \ell_2 $-norm and offer practical benefits when computation is expensive. Our numerical analysis highlights improved performance guaranties over classical estimators, particularly in terms of sample efficiency and scalability.

2605.24915 2026-05-26 cs.GR cs.CV

Snapshot Polarimetric Display Inverse Rendering

快照偏振显示逆渲染

Seokjun Choi, Yunseong Moon, Kaizhang Kang, Hoon-Gyu Chung, Jin-Nyeong Kim, Giljoo Nam, Seung-Hwan Baek

发表机构 * POSTECH

AI总结 本文提出一种快照偏振显示逆渲染方法,利用LCD投影线性偏振RGB图案和偏振相机获取光谱偏振测量,通过前馈Transformer预测每像素法线、反照率、粗糙度和金属度,在真实桌面场景中优于现有方法。

详情
AI中文摘要

逆渲染仍然是图形学和视觉领域的核心挑战,尤其是在轻量级桌面工作流程所需的快照配置中,每帧信息预算高度受限。以往的逆渲染工作探索了各种可用的维度来丰富每次拍摄的信息,包括时间调制、光谱编码和偏振。在这项工作中,我们引入了偏振显示逆渲染,使用LCD投影线性偏振RGB二值图案,并配备四分之一波片的RGB偏振相机在单次拍摄中获取光谱偏振测量。一个前馈Transformer将这些测量映射到每像素法线、反照率、粗糙度和金属度。为了克服训练数据稀缺,我们通过生成流形扩展了一组有限的实测偏振双向反射分布函数。在真实桌面设置上的评估表明,该方法在多种场景中实现了准确的逆渲染,优于现有方法。

英文摘要

Inverse rendering remains a core challenge in graphics and vision, especially in the snapshot configurations required for lightweight desktop workflows, where the per-frame information budget is highly constrained. Previous inverse rendering work explores various available dimensions for enriching the per-shot information, including temporal modulation, spectral encoding, and polarization. In this work, we introduce polarimetric display inverse rendering, using an LCD to project a linearly polarized RGB binary pattern and an RGB polarization camera augmented with a quarter-wave plate to acquire spectro-polarimetric measurements in a single shot. A feed-forward transformer maps these measurements to per-pixel normal, albedo, roughness, and metallicity. To overcome training data scarcity, we expand a limited set of measured polarimetric bidirectional reflectance distribution functions via a generative manifold. Evaluations on a real desktop setup demonstrate accurate inverse rendering across diverse scenes, outperforming existing approaches.

2605.24914 2026-05-26 cs.IR cs.DB cs.LG

MVR-cache: Optimizing Semantic Caching via Multi-Vector Retrieval and Learned Prompt Segmentation

MVR-cache:通过多向量检索和学习型提示分割优化语义缓存

Ali Noshad, Zishan Zheng, Yinjun Wu

发表机构 * School of Computer Science, Peking University, Beijing, China(北京大学计算机科学学院,北京,中国) School of Information, Renmin University of China, Beijing, China(中国人民大学信息学院,北京,中国)

AI总结 提出MVR-cache方法,利用多向量检索和学习型提示分割模型,通过强化学习优化缓存命中率,在保证正确性的前提下将缓存命中率提升高达37%。

Comments Published in ICML 2026

详情
AI中文摘要

为了降低LLM的成本和延迟,语义缓存系统必须准确识别新提示是否与缓存提示匹配。当前方法通常依赖简单的相似性度量,限制了其有效性。我们提出MVR-cache,一种新颖的语义缓存方法,通过集成多向量检索(MVR)显著提高了检索准确性。MVR-cache基于一个可学习的分割模型,智能地分割提示,通过MaxSim实现细粒度的相似性比较。我们从严格的理论分析中推导出模型的训练目标,确保优化该目标能在严格正确性约束下直接最大化缓存命中率。为了解决由此产生的非可微组合优化问题,我们利用基于强化学习的训练策略,以理论推导的目标作为奖励。在跨不同任务的已有基准上的实验结果表明,与最先进方法相比,MVR-cache在保持相同正确性保证的同时,一致地将缓存命中率提高了高达37%。MVR-cache可在https://github.com/PKU-SDS-lab/MVR-Cache获取。

英文摘要

To reduce LLM costs and latency, semantic caching systems must accurately identify when a new prompt matches a cached one. Current methods often rely on simplistic similarity measures, which limit their effectiveness. We introduce MVR-cache, a novel semantic caching approach that significantly improves retrieval accuracy by integrating Multi-Vector Retrieval (MVR). MVR-cache is built upon a learnable segmentation model that intelligently splits prompts, enabling fine-grained similarity comparisons via MaxSim. We derive the model's training objective from a rigorous theoretical analysis. This can ensure that optimizing this objective directly maximizes cache hits under strict correctness constraints. To solve the resulting non-differentiable combinatorial optimization problem, we leverage a reinforcement learning-based training strategy with the theoretically grounded objectives as the reward. Experimental results on established benchmarks across diverse tasks confirm that in comparison to the state-of-the-art, MVR-cache consistently increases the cache hit rates by up to 37% while maintaining the same correctness guarantees. MVR-cache is available at https://github.com/PKU-SDS-lab/MVR-Cache

2605.24913 2026-05-26 eess.IV cs.AI q-bio.QM

Explainable Multi-Task Retinal Imaging Reveals Microvascular Signals for Systemic Risk Stratification in Type 2 Diabetes: A Pilot Study

可解释多任务视网膜成像揭示2型糖尿病系统性风险分层的微血管信号:一项初步研究

Mini Han Wang, Liting Huang, Wei Hong, Boonthawan Wingwon

发表机构 * Faculty of Computer Science and Artificial Intelligence, Shenzhen University of Advanced Technology(深圳先进技术大学计算机科学与人工智能学院) Frontier Science Computing Center, Zhuhai Institute of Advanced Technology Chinese Academy of Sciences(中国科学院珠海先进技术研究院前沿科学计算中心) Chinese University of Hong Kong(香港中文大学) Zhuhai People's Hospital (The Affiliated Hospital of Beijing Institute of Technology, Zhuhai Clinical Medical College of Jinan University)(珠海人民医院(北京理工大学珠海临床医学院附属医院)) Lampang Inter-Tech College, Lampang Thailand(泰国 Lampang 职业技术学院)

AI总结 本研究开发了一个可解释的多任务深度学习框架,通过分析视网膜微血管特征与系统性异常(如肾脏异常)的关联,验证了视网膜成像作为糖尿病系统性风险分层生物标志物的潜力。

Comments 18 pages, 4 figures

详情
AI中文摘要

视网膜成像提供了进入系统性微血管健康的非侵入性窗口,并已成为系统性疾病的潜在生物标志物。然而,视网膜特征是否编码了生物学上有意义的系统性信号,并且可以使用可解释人工智能(XAI)可靠地解释,仍不清楚。我们开发了一个可解释的多任务深度学习框架,以研究视网膜微血管特征与2型糖尿病系统性异常之间的关联。使用共享神经网络和针对血糖状态、肾脏异常和多系统参与的任务特定头部,分析了来自2,719名个体的11,011张眼底图像。使用梯度加权类激活映射(Grad-CAM)、解剖掩膜和血管对齐分析评估模型可解释性。该框架展示了任务依赖的预测性能,对肾脏异常的最佳区分度(AUC高达0.63),而血糖状态预测性能有限(AUC = 0.49-0.61)。可解释性分析一致地将模型注意力定位到视网膜血管和视盘周围区域。掩膜实验表明,遮挡血管区域导致性能下降最大,表明视网膜血管是主要的预测来源。不同架构表现出异质的注意力模式,提示存在多种系统性信号编码的表征路径。这项初步研究表明,视网膜微血管特征包含与系统性异常(尤其是微血管损伤)相关的可测量信号。通过将多任务学习与定量XAI验证相结合,该框架推动视网膜成像向用于糖尿病系统性风险分层的可解释数字生物标志物发展。

英文摘要

Retinal imaging provides a non-invasive window into systemic microvascular health and has emerged as a potential biomarker for systemic diseases. However, whether retinal features encode biologically meaningful systemic signals that can be reliably interpreted using explainable artificial intelligence (XAI) remains unclear. An explainable multi-task deep learning framework was developed to investigate associations between retinal microvascular features and systemic abnormalities in Type 2 Diabetes Mellitus. A total of 11,011 fundus images from 2,719 individuals were analysed using a shared neural network with task-specific heads for glycaemic status, kidney abnormality, and multi-system involvement. Model interpretability was evaluated using Gradient-weighted Class Activation Mapping (Grad-CAM), anatomical masking, and vessel alignment analysis. The framework demonstrated task-dependent predictive performance, with the best discrimination observed for kidney abnormality (AUC up to 0.63), whereas glycaemic status prediction showed limited performance (AUC = 0.49-0.61). Explainability analyses consistently localized model attention to retinal vessels and peripapillary regions. Masking experiments showed that occlusion of vascular regions caused the greatest performance decline, indicating that retinal vessels were the primary predictive source. Different architectures exhibited heterogeneous attention patterns, suggesting multiple representational pathways for systemic signal encoding. This pilot study demonstrates that retinal microvascular features contain measurable signals associated with systemic abnormalities, particularly microvascular damage. By integrating multi-task learning with quantitative XAI validation, this framework advances retinal imaging toward interpretable digital biomarkers for systemic risk stratification in diabetes.

2605.24903 2026-05-26 cs.CR cs.LG

SEED: Semi-supervised Continual MalwarE Detection for Tackling ConcEpt Drift on a BuDget

SEED: 预算约束下应对概念漂移的半监督持续恶意软件检测

Suresh Kumar Amalapuram, Bikraj Shresta, Siva Ram murthy Chebiyam, Bheemarjuna Reddy Tamma, Sumohana S Channappayya

发表机构 * Indian Institute of Technology Ropar(印度理工学院罗帕尔) Indian Institute of Technology Hyderabad(印度理工学院海得拉巴)

AI总结 提出SEED方法,结合定制二元交叉熵损失与半监督持续学习和主动学习,在有限标注下有效检测未知恶意软件,平均AUT提升40%(BODMAS)和14%(AndroZoo)。

详情
AI中文摘要

基于机器学习的恶意软件检测器会随着良性应用和恶意应用中的概念漂移而随时间变得过时。最近的方法依赖完全标注数据,并利用层次对比损失(HCL)与主动学习,通过利用恶意软件表示中的语义结构来提高对漂移的鲁棒性。然而,在安全领域获取标注数据很困难。在部分标注设置下,HCL在检测未知恶意软件时性能显著下降,尤其是在BODMAS等可能缺乏强语义结构的数据集上。本文提出SEED,一种在有限监督下进行恶意软件检测的语义结构无关方法。SEED将定制的二元交叉熵目标与半监督持续学习和主动学习相结合。对于部分标注的已见任务,未标注样本通过奇异值分解投影到从先前已见数据构建的表示空间中,并与合适的标注样本配对以鼓励表示一致性。对于完全未标注的未见任务,使用表示空间中的余弦距离量化不确定性,并选择最不确定的样本供分析师标注。我们在Windows和Android恶意软件数据集上评估SEED。在已见任务上仅使用20%标注数据,与HCL*(HCL的半监督适应)相比,SEED在未知恶意软件检测上平均AUT提升40%(BODMAS)和14%(AndroZoo),同时在APIGraph上保持竞争力。最后,我们引入延迟缓冲区更新策略以减少重放期间的标签噪声传播并提高学习稳定性。

英文摘要

Machine learning based malware detectors become obsolete over time due to concept drift in benign and malware applications. Recent methods rely on fully labeled data and use hierarchical contrastive loss (HCL) with active learning to improve robustness against drift by exploiting semantic structure in malware representations. However, obtaining labeled data in the security domain is difficult. Under partially labeled settings, HCL suffers significant performance degradation in detecting unseen malware, especially on datasets such as BODMAS where strong semantic structure may not exist. In this paper, we propose SEED, a semantic-structure-agnostic method for malware detection under limited supervision. SEED combines a tailored binary cross-entropy objective with semi-supervised continual learning and active learning. For partially labeled seen tasks, unlabeled samples are projected into a representation space constructed from previously seen data using singular value decomposition, and paired with suitable labeled samples to encourage representation consistency. For unseen tasks with fully unlabeled data, uncertainty is quantified using cosine distance in representation space, and the most uncertain samples are selected for analyst labeling. We evaluate SEED on both Windows and Android malware datasets. Using only 20% labeled data on seen tasks, SEED achieves average AUT improvements of 40% on BODMAS and 14% on AndroZoo for unseen malware detection compared to HCL* (the semi-supervised adaptation of HCL), while remaining competitive on APIGraph. Finally, we introduce a delayed buffer update strategy to reduce label noise propagation during replay and improve learning stability.

2605.24876 2026-05-26 math.NA cs.LG cs.NA

IV-Net: A neural network for elliptic PDEs with random and highly varying coefficients

IV-Net: 用于随机和高变系数椭圆型偏微分方程的神经网络

Shan Zhong, George Biros

发表机构 * Oden Institute for Computational Science and Engineering, The University of Texas at Austin(计算科学与工程院,德克萨斯大学奥斯汀分校) Walker Department of Mechanical Engineering, The University of Texas at Austin(机械工程系,德克萨斯大学奥斯汀分校)

AI总结 提出一种受V-cycle多重网格求解器启发的神经算子架构IV-Net,用于逼近高对比度空间变系数线性椭圆型偏微分方程的解,在高度异质系数问题上优于POD和现有神经算子,在光滑系数低频振荡Helmholtz问题上与Fourier神经算子性能相当。

Comments 36 pages

详情
AI中文摘要

我们提出了一种新颖的神经算子架构,旨在逼近具有高对比度、空间变化系数的线性椭圆型偏微分方程的解。该网络称为迭代V形网络(IV-Net),实现了从输入系数和右端项到相应解场的映射。IV-Net的架构受V-cycle多重网格求解器启发,并与之高度相似。IV-Net模型通过物理域中定义的卷积层进行参数化。对于具有高度异质系数的强制问题,所提出的网络相对于本征正交分解(POD)方法和几种现有的神经算子架构表现出优越的性能。对于具有光滑系数的低频振荡Helmholtz问题,其性能与Fourier神经算子相似。我们分析了IV-Net的逼近误差和收敛行为、其数据效率以及对底层离散网格的依赖性。此外,我们通过一系列数值实验展示了该架构的实际有效性,包括在不确定性量化、反问题和感兴趣量预测中的应用。

英文摘要

We introduce a novel neural operator architecture designed to approximate solutions of linear elliptic partial differential equations with high-contrast, spatially varying coefficients. The network, termed the Iterated V-shaped Net (IV-Net), realizes a mapping from the input coefficients and righthand side to the corresponding solution field. The architecture of IV-Net is informed by, and closely resembles, a V-cycle multigrid solver. The IV-Net model is parameterized via convolutional layers defined in the physical domain. For coercive problems with highly heterogeneous coefficients, the proposed network exhibits superior performance relative to a proper orthogonal decomposition (POD) approach and several existing neural operator architectures. For low-frequency oscillatory Helmholtz problems with smooth coefficients, its performance is similar to that of a Fourier neural operator. We analyze the approximation error and convergence behavior of IV-Net, its data efficiency, and its dependence on the underlying discretization mesh. Furthermore, we demonstrate the practical effectiveness of the architecture through a series of numerical experiments, including applications to uncertainty quantification, inverse problems, and prediction of quantities of interest.

2605.24860 2026-05-26 eess.SY cs.AI cs.ET cs.LG cs.RO cs.SY

DBPnet: Damper Characteristics-Based Bayesian Physics-Informed Neural Network for Wheel Load Estimation

DBPnet:基于阻尼特性的贝叶斯物理信息神经网络用于车轮载荷估计

Tianyi Wang, Tianyi Zeng, Zimo Zeng, Feiyang Zhang, Yujin Wang, Xiangyu Li, Yiming Xu, Sikai Chen, Junfeng Jiao, Christian Claudel, Xinbo Chen

发表机构 * Department of Civil, Architectural, and Environmental Engineering, The University of Texas at Austin(德克萨斯大学奥斯汀分校土木、建筑与环境工程系) School of Automation and Intelligent Sensing, Shanghai Jiao Tong University(上海交通大学自动化与智能感知学院) College of Electrical Engineering, Zhejiang University(浙江大学电气工程学院) School of Automotive Studies, Tongji University(同济大学汽车学院) School of Architecture, The University of Texas at Austin(德克萨斯大学奥斯汀分校建筑学院) Department of Civil and Environmental Engineering, University of Wisconsin-Madison(威斯康星大学麦迪逊分校土木与环境工程系)

AI总结 提出DBPnet,一种结合阻尼特性嵌入模块的贝叶斯物理信息神经网络,通过悬架连杆级建模和物理信息损失函数,实现鲁棒的车轮载荷估计。

Comments 14 pages, 12 figures, 6 tables

详情
AI中文摘要

高级驾驶辅助系统(ADAS)在现代汽车智能化中扮演重要角色,显著提升车辆安全性和稳定性。ADAS的性能关键依赖于准确可靠的车辆状态估计,特别是来自车辆动态传感器的信号。在这些信号中,车轮载荷是底盘控制和安全关键功能的关键变量,但由于复杂的悬架几何结构、非线性动力学和测量噪声,难以鲁棒估计。为解决此问题,我们提出DBPnet,一种贝叶斯物理信息神经网络(PINN),其具有受阻尼特性启发的物理感知嵌入模块。首先,本文提出一种悬架连杆级建模(SLLM)方法,通过显式考虑悬架的复杂几何结构,构建非线性瞬时动态模型。在SLLM基础上,将贝叶斯推断集成到PINN中,有效应对车辆底盘系统中的噪声和不确定性,从而提高模型的鲁棒性。然后,采用物理信息损失函数确保与基本物理原理的一致性,同时受阻尼特性启发的嵌入模块提取输入信号的时间变化特征,并将其融入PINN的每一层,确保物理观测指导神经网络而不受固定物理模型的约束。在高保真仿真和真实世界实验上的广泛评估表明,我们的DBPnet在RMSE和MaxError上始终低于基线方法。这些结果凸显了我们的DBPnet在推进车轮载荷估计和为更可靠的ADAS执行器功能发展做出贡献的潜力。

英文摘要

Advanced driver assistance systems (ADAS) play an important role in modern automotive intelligence, significantly enhancing vehicle safety and stability. The performance of ADAS critically relies on accurate and reliable vehicle state estimation, particularly from vehicle dynamic sensors. Among these signals, wheel load is a key variable for chassis control and safety-critical functions, yet it remains difficult to estimate robustly due to complex suspension geometry, nonlinear dynamics, and measurement noise. To address this issue, we propose DBPnet, a Bayesian physics-informed neural network (PINN) with a physics-aware embedding module inspired by damper characteristics. First, this paper presents a suspension linkage-level modeling (SLLM) approach that constructs a nonlinear instantaneous dynamic model by explicitly considering the complex geometric structure of the suspension. Building upon SLLM, Bayesian inference is integrated into the PINN to effectively cope with noise and uncertainty in the vehicle chassis system, thereby improving the model's robustness. Then, a physics-informed loss function is employed to ensure consistency with fundamental physical principles, while the damper characteristics-inspired embedding module extracts temporal variation features of input signals and incorporates them into each layer of the PINN, ensuring that physical observations guide the neural network without being constrained by fixed physical models. Extensive evaluations on high-fidelity simulations and real-world experiments demonstrate that our DBPnet consistently achieves lower RMSE and MaxError than baseline methods. These results highlight the potential of our DBPnet to advance wheel load estimation and contribute to the development of more reliable ADAS actuator functions.

2605.24834 2026-05-26 cs.CR cs.AI

Reflect-Guard: Enhancing LLM Safeguards against Adversarial Prompts via Logical Self-Reflection

Reflect-Guard: 通过逻辑自我反思增强大语言模型对对抗性提示的防护

Lixing Lin, Juli You, Yue Li, Luyun Lin, Yiqing Wang, Zhen Zhang, Moxuan Zheng

发表机构 * Yale University(耶鲁大学) Columbia University(哥伦比亚大学) Citigroup(摩根大通) Independent Researcher(独立研究者)

AI总结 提出Reflect-Guard方法,通过参数高效微调为大语言模型安全分类器注入链式思维自我反思能力,显著提升对对抗性越狱攻击的检测性能。

Comments 12 pages, 2 figures, and 4 tables

详情
AI中文摘要

大语言模型安全分类器(如Llama Guard)能有效检测明显有害的提示,但对通过角色扮演场景、虚构框架和间接请求伪装恶意意图的对抗性越狱攻击仍然脆弱。我们提出Reflect-Guard,一种通过参数高效微调为大语言模型安全分类器注入链式思维自我反思能力的方法。我们的方法从GPT-4o-mini中提炼分析推理能力,形成结构化反思注释,然后通过QLoRA训练Llama-Guard-3-8B,使其在发布安全判决前生成逻辑自我反思。仅使用1000个训练样本并更新0.5%的模型参数(约4200万),Reflect-Guard在两个具有挑战性的基准测试上取得了显著改进。在WildGuardTest上,F1分数从0.770提升至0.842(+7.2个百分点),对抗性提示的召回率从0.513提升至0.921(+40.8个百分点)。在JailbreakBench上,攻击成功率从10.3%降至1.8%,相对降低82.5%。这些增益在对抗性输入上尤为明显,显式的推理步骤使模型能够看穿击败标准模式匹配方法的混淆技术。我们的结果表明,教会安全分类器推理对抗性意图,而非简单分类表面模式,是实现鲁棒大语言模型安全性的有前景方向。

英文摘要

Large language model (LLM) safety classifiers such as Llama Guard are effective at detecting overtly harmful prompts but remain vulnerable to adversarial jailbreak attacks that disguise malicious intent through role-play scenarios, fictional framing, and indirect requests. We present Reflect-Guard, a method that augments LLM-based safety classifiers with chain-of-thought self-reflection capabilities through parameter-efficient fine-tuning. Our approach distills analytical reasoning from GPT-4o-mini into structured reflection annotations, then trains Llama-Guard-3-8B via QLoRA to generate logical self-reflections before issuing safety verdicts. Using only 1000 training examples and updating just 0.5% of model parameters (~42M), Reflect-Guard achieves substantial improvements on two challenging benchmarks. On WildGuardTest, F1 score improves from 0.770 to 0.842 (+7.2 pp), with recall on adversarial prompts increasing from 0.513 to 0.921 (+40.8 pp). On JailbreakBench, the attack success rate drops from 10.3% to 1.8%, representing an 82.5% relative reduction. These gains are especially pronounced on adversarial inputs, where the explicit reasoning step enables the model to see through obfuscation techniques that defeat standard pattern-matching approaches. Our results demonstrate that teaching safety classifiers to reason about adversarial intent, rather than simply classify surface patterns, is a promising direction for robust LLM safety.

2605.24825 2026-05-26 eess.SP cs.SD cs.SY eess.AS eess.SY math.OC

Time Segmented Beamforming via Dynamic Programming: Theory and Implementation

基于动态规划的时间分段波束形成:理论与实现

Manan Mittal, Ryan M. Corey, Diego Cuji, John R. Buck, Andrew C. Singer

发表机构 * Department of Electrical and Computer Engineering, Stony Brook University(石溪大学电气与计算机工程系) Department of Electrical and Computer Engineering, University of Illinois(伊利诺伊大学电气与计算机工程系) Department of Electrical and Computer Engineering, University of Massachusetts Dartmouth(马萨诸塞大学达特茅斯分校电气与计算机工程系) College of Applied Science and Engineering, Stony Brook University(石溪大学应用科学与工程学院)

AI总结 针对时变干扰环境,提出一种基于动态规划的时间分段无失真响应波束形成器,通过数据驱动的自适应分段估计协方差矩阵以跟踪非平稳干扰。

Comments 16 pages, 17 figures, Beamforming New Approach Regret Bounds

详情
AI中文摘要

在具有时变干扰的动态声学环境中,有效的波束形成需要识别随时间变化的平稳区域。Capon波束形成器是一种白化匹配滤波器,约束在期望方向上保持单位增益,理论上依赖于瞬时集合协方差矩阵。实际实现依赖于批量Capon(或样本矩阵求逆),通过对一批快照进行平均来估计样本协方差矩阵(SCM)。这种实用方法隐含假设批处理窗口内的数据是平稳的,可以相干组合。在非平稳环境中,对固定或过长窗口进行平均的批处理方法会失效,因为移动干扰会模糊SCM并降低波束形成器的零陷能力。为解决此问题,本文引入了一种时间分段无失真响应波束形成器。受分段最小二乘法(将分段多项式拟合到数据,同时惩罚过度分段以防止过拟合)的启发,该框架通过引入数据驱动的时间分段扩展了实用的Capon波束形成。该公式在最小化输出功率的同时,动态调整SCM估计窗口以适应局部平稳性,为跟踪时变干扰提供了一种原则性方法。

英文摘要

In dynamic acoustic environments with time-varying interferers, effective beamforming requires identifying stationary regions over time. The Capon beamformer, a whitened matched filter constrained to maintain unity gain in the desired direction, theoretically relies on the instantaneous ensemble covariance matrix. Practical implementations rely on the batch Capon (or Sample Matrix Inversion), which estimates the sample covariance matrix (SCM) by averaging over a block of snapshots. This practical approach implicitly assumes that the data within the batch window is stationary and can be coherently combined. In non-stationary settings, a batch approach that averages over fixed or excessively long windows fails, as moving interferers smear the SCM and degrade the beamformer's nulling capabilities. To address this, this paper introduces a temporally segmented distortionless response beamformer. Inspired by the segmented least squares method, which fits piecewise polynomials to data while penalizing excessive segmentation to prevent overfitting, the framework extends practical Capon beamforming by incorporating data-driven temporal segmentation. This formulation minimizes output power while dynamically adapting the SCM estimation windows to local stationarity, offering a principled approach to tracking time-varying interferers.

2605.24817 2026-05-26 cs.CR cs.AR cs.CL cs.LG

RouteScan: A Non-Intrusive Approach to Auditing MoE LLMs Safety via Expert Routing Telemetry

RouteScan: 通过专家路由遥测对MoE大语言模型安全性进行非侵入式审计

Bo Lv, Zhiheng Xu, KeDong Xiu, Ruyi Ding, Tianhang Zheng, Zhibo Wang, Kui Ren

发表机构 * Zhejiang University(浙江大学) Donghua University(东华大学) Louisiana State University(路易斯安那州立大学)

AI总结 提出RouteScan,一种利用MoE模型GPU级专家路由遥测(如预填充阶段活跃线程数)作为微架构指纹,通过轻量级检测流水线识别恶意提示的非侵入式审计框架,在未见过的有害领域AUROC超0.93,新越狱包装下超0.96,且相比基于内容的审计方法具有隐私优势。

Comments 20 pages. Under submission

详情
AI中文摘要

混合专家(MoE)架构已成为扩展大型语言模型(LLM)日益重要的范式。随着MoE模型越来越多地部署在实际服务中,安全性审计变得必要,以验证这些模型在运行过程中是否产生或助长有害行为。然而,现有的基于内容的审计方法通常需要访问用户提示、模型输入或生成输出,可能暴露敏感用户信息,并在LLM安全性和用户隐私之间造成根本性紧张。另一方面,我们观察到,在MoE模型中,稀疏专家路由将不同输入映射到激活不同的专家执行模式,在低级GPU执行遥测中产生可测量的足迹。受此观察启发,我们提出RouteScan,一种通过GPU级专家路由遥测检测有害行为的非侵入式审计框架。具体而言,RouteScan利用预填充阶段分配给专家模块的活跃GPU线程数作为判别性微架构指纹,并构建轻量级检测流水线,隔离跨领域不变风险指标以精确识别恶意提示。对具有不同路由设计的开源MoE LLM的全面评估表明,RouteScan实现了强泛化,在未见过的有害领域AUROC超过0.93,在新型越狱包装下超过0.96。此外,经验性反演测试表明,收集的专家路由遥测为提示重建提供的信息有限,表明相对于基于内容的审计方法具有实际隐私优势。

英文摘要

Mixture-of-Experts (MoE) architectures have become an increasingly important paradigm for scaling Large Language Models (LLMs). As MoE models are increasingly deployed in real-world services, safety auditing becomes necessary to verify whether these models produce or facilitate harmful behaviors during operation. However, existing content-based auditing methods typically require access to user prompts, model inputs, or generated outputs, potentially exposing sensitive user information and creating a fundamental tension between LLM safety and user privacy. On the other hand, we observe that, in MoE models, sparse expert routing maps different inputs to activate different expert-execution patterns, producing measurable footprints in low-level GPU execution telemetry. Inspired by this observation, we propose RouteScan, a non-intrusive auditing framework for detecting harmful behaviors through GPU-level expert routing telemetry. Specifically, RouteScan utilizes the number of active GPU threads allocated to expert modules during the prefilling phase as a discriminative micro-architectural fingerprint, and builds a lightweight detection pipeline that isolates cross-domain invariant risk indicators for the precise identification of malicious prompts. Comprehensive evaluations on open-source MoE LLMs with distinct routing designs demonstrate that RouteScan achieves strong generalization, with an AUROC exceeding 0.93 on unseen harmful domains and 0.96 under novel jailbreak wrappers. Moreover, empirical inversion tests show that the collected expert routing telemetry provides limited information for prompt reconstruction, suggesting a practical privacy advantage over content-based auditing methods.

2605.24765 2026-05-26 cs.CR cs.LG

CyberMaskQA: A Privacy-Aware Benchmark for Evaluating Large Language Models in Cybersecurity Question Answering

CyberMaskQA: 一个用于评估大语言模型在网络安全问答中隐私意识的基准

Matilda Gaddi, Jin Noh, Onat Gungor, Tajana Rosing

发表机构 * Department of Computer Science and Engineering(计算机科学与工程系) University of California, San Diego (UCSD)(加州大学圣地亚哥分校)

AI总结 针对现有基准缺乏隐私保护评估的问题,提出CyberMaskQA基准,通过结合人工场景与LLM语义扩展生成带隐私标签的数据集,以评估模型在网络安全问答中的推理与隐私保护能力。

详情
AI中文摘要

大型语言模型(LLM)越来越多地应用于网络安全问答(QA),用于事件响应和漏洞分析等关键任务。然而,现实世界的操作环境,包括系统日志和网络配置,本质上包含敏感标识符,例如IP地址、主机名和用户账户。在受监管的环境中,使用基于云的模型处理这些数据通常不安全或不可行。此外,隐私保护问答的进展因缺乏能够同时评估操作推理和隐私保护的带注释、上下文丰富的数据集而受阻。为解决这一差距,我们引入了CYBERMASKQA,一个涵盖关键安全领域的隐私感知问答基准。与主要测试事实知识的现有基准不同,CYBERMASKQA将问题置于现实的组织环境中,并具有资产和权限之间的显式因果依赖关系。通过系统化的流水线生成,该数据集结合了人工策划的基础场景与LLM驱动的语义扩展,为每个实例标注精确的私有实体标签,以实现可控的信息披露。对问答准确性和掩码性能的评估证明了该基准在开发可部署、上下文感知的网络安全模型以及促进隐私-效用权衡的细致研究方面的实用性。一经接受,我们将发布数据集和生成框架。

英文摘要

Large language models (LLMs) are increasingly applied to cybersecurity question answering (QA) for critical tasks such as incident response and vulnerability analysis. However, real-world operational contexts, including system logs and network configurations, inherently contain sensitive identifiers, e.g., IP addresses, host names, and user accounts. Processing this data with cloud-based models is often unsafe or infeasible in regulated environments. Furthermore, progress in privacy-preserving QA is hindered by the lack of annotated, context-rich datasets capable of jointly evaluating operational reasoning and privacy preservation. To address this gap, we introduce CYBERMASKQA, a privacy-aware QA benchmark covering key security domains. Unlike existing benchmarks that primarily test factual knowledge, CYBERMASKQA grounds questions in realistic organizational contexts with explicit causal dependencies among assets and privileges. Generated through a systematic pipeline, the dataset combines human-curated base scenarios with LLM-driven semantic expansion, annotating each instance with precise private entity labels to enable controlled information disclosure. Evaluations of QA accuracy and masking performance demonstrate the benchmark's utility for developing deployable, context-aware cybersecurity models and facilitating nuanced studies of privacy-utility trade-offs. Upon acceptance, we will release the dataset and the generation framework.

2605.24764 2026-05-26 cs.IR cs.AI cs.CL

Spectral Retrieval: Multi-Scale Sinc Convolution over Token Embeddings for Localized Retrieval in LLM Multi-Agent Systems

光谱检索:基于多尺度sinc卷积的令牌嵌入局部化检索在LLM多智能体系统中的应用

Andrea Morandi

发表机构 * Cisco(思科)

AI总结 提出光谱检索方法,通过多尺度sinc卷积对令牌嵌入进行重排序,在无需重新训练的情况下显著提升局部化检索性能,并自然适配于LLM多智能体系统。

详情
AI中文摘要

[删节版] - 光谱检索是一种插件式重排序阶段,通过在令牌嵌入上进行多尺度sinc卷积,在逐令牌MaxSim和均值池化检索之间进行插值。在标准稠密检索中,每个文档是一个均值池化向量;当相关性局限于一个短子跨度时,信号会平均为噪声。光谱检索重用来自晚期交互索引的逐令牌嵌入,并将其与归一化的sinc核在多个尺度上进行卷积。在L=1时,核作为恒等映射,恢复逐令牌MaxSim;随着L增大,它趋近于均匀滤波器,恢复均值池化。跨位置和尺度的最大余弦产生一个得分,其信息量不低于任一端点。在一个包含1000个文档和植入单位置尖峰的可控合成基准上,无论尖峰强度如何,均值池化检索处于随机水平(Recall@10 ~ 0.02),而光谱检索在植入余弦超过语料级令牌噪声基底时达到Recall@10 = 1.0。在冻结的all-mpnet-base-v2编码器上的LIMIT-small数据集中,光谱检索无需重新训练即可将Recall@10从0.33提升至0.90,MRR从0.22提升至0.79,严格Success@10从0.12提升至0.84。该方法自然适用于多智能体LLM系统,其中每个智能体受益于共享语料库上更紧密、特定角色的检索窗口。

英文摘要

[Abridged] - Spectral Retrieval is a plug-in re-ranking stage that interpolates between per-token MaxSim and mean-pool retrieval through a multi-scale sinc convolution over token embeddings. In standard dense retrieval each document is one mean-pooled vector; when relevance localises into a short subspan, the signal averages into noise. Spectral Retrieval reuses per-token embeddings from a late-interaction index and convolves them with a normalised sinc kernel at multiple scales. At L=1 the kernel acts as the identity, recovering per-token MaxSim; as L grows it approaches a uniform filter, recovering mean pooling. The maximum cosine over positions and scales yields a score provably no less informative than either endpoint. On a controlled synthetic benchmark with 1,000 documents and planted single-position spikes, mean-pool retrieval sits at chance (Recall@10 ~ 0.02) regardless of spike strength, while Spectral Retrieval reaches Recall@10 = 1.0 once the planted cosine exceeds the corpus-level token noise floor. On LIMIT-small with a frozen all-mpnet-base-v2 encoder, Spectral Retrieval lifts Recall@10 from 0.33 to 0.90, MRR from 0.22 to 0.79, and strict Success@10 from 0.12 to 0.84, without retraining. The method fits naturally into multi-agent LLM systems, where each agent benefits from a tighter, role-specific retrieval window over a shared corpus.

2605.24749 2026-05-26 stat.ML cs.LG

How Neural Reward Models Learn Features for Policy Optimization: A Single-Index Analysis

神经奖励模型如何学习策略优化的特征:单指标分析

Rei Higuchi, Ryotaro Kawata, Akifumi Wachi, Shokichi Takakura, Kohei Miyaguchi, Taiji Suzuki

发表机构 * The University of Tokyo(东京大学) RIKEN AIP(理化学研究所AIP) LY Corporation(LY公司)

AI总结 本文通过高斯单指标模型分析两阶段神经奖励模型,研究指数奖励加权对特征学习的影响,并推导出倾斜策略价值差距的界限,给出可接受的部署温度范围。

Comments 35 pages

详情
AI中文摘要

奖励建模不仅是一个预测问题:在KL正则化策略优化中,学习到的奖励被指数化以定义部署策略,因此下游价值取决于奖励倾斜区域中的误差。我们在高斯单指标模型 $r^*(x) = σ^*(\langle θ^*, x angle)$ 且 $x \sim N(0, I_d)$ 下研究这种反馈。我们分析了一个两阶段神经奖励模型,该模型首先从奖励加权样本中学习隐藏方向 $θ^*$,然后通过加权岭回归拟合读出层。指数奖励加权改变了第一层可用的Hermite信号;对于任何高于无维度 $O(1)$ 阈值的特征学习温度 $β_1$,恒定比例的神经元恢复隐藏方向,弱恢复复杂度由生成指数控制。在特征恢复后,我们推导了理想化标签加权拟合(权重 $e^{y/β_2}$)和更实用的代理加权拟合(权重 $e^{r_{a_0}(x)/β_2}$)的倾斜策略价值差距界限。保持 $β_2$ 依赖性显式,得到一组可接受的部署温度,平衡降低 $β_2$ 带来的收益与指数加权放大的学习成本;在代理加权情况下,代理相关因子缩小了该可接受集。

英文摘要

Reward modeling is not only a prediction problem: in KL-regularized policy optimization, the learned reward is exponentiated to define the deployed policy, so downstream value depends on errors in reward-tilted regions. We study this feedback in a Gaussian single-index model with $r^*(x) = σ^*(\langle θ^*, x\rangle)$ and $x \sim N(0, I_d)$. We analyze a two-stage neural reward model that first learns the hidden direction $θ^*$ from reward-weighted samples and then fits the readout layer by weighted ridge regression. Exponential reward weighting changes the Hermite signal available to the first layer; for any feature-learning temperature $β_1$ above a dimension-free $O(1)$ threshold, a constant fraction of neurons recover the hidden direction, with weak-recovery complexity governed by the generative exponent. After feature recovery, we derive tilted-policy value-gap bounds for an idealized label-weighted fit with weights $e^{y/β_2}$ and a more practical surrogate-weighted fit with weights $e^{r_{a_0}(x)/β_2}$. Keeping the $β_2$-dependence explicit yields an admissible set of deployment temperatures, balancing the gain from lowering $β_2$ against the learning cost amplified by exponential weighting; in the surrogate-weighted case, proxy-dependent factors shrink this admissible set.

2605.24748 2026-05-26 astro-ph.SR cs.LG

Deep Learning-Enabled Prediction of Geoeffective CMEs Using SOHO and SDO Observations

基于深度学习的日冕物质抛射地效性预测:利用SOHO和SDO观测数据

Zhaoxin Yan, Jason T. L. Wang, Haimin Wang, Harim Lee, Ju Jing, Yan Xu, Chunhui Xu, Vasyl Yurchyshyn

发表机构 * Institute for Space Weather Sciences(空间天气科学研究所) Department of Computer Science(计算机科学系) Center for Solar-Terrestrial Research(太阳-地球研究中心) Big Bear Solar Observatory(大熊太阳观测站)

AI总结 提出一种融合卷积神经网络和预测网络的模型,利用SOHO和SDO观测数据预测日冕物质抛射是否引发地磁暴及其概率,在五折交叉验证中TSS达0.703,Brier分数0.095。

Comments 23 pages, 12 figures, 4 tables

详情
AI中文摘要

理解和预测日冕物质抛射(CME)的地效性对于保护近地空间环境和地球上的基础设施至关重要。在本研究中,我们提出了一种新颖的融合模型来预测CME事件的地效性。我们的模型结合了用于特征学习的卷积神经网络和用于特征融合及事件分类的预测网络。该模型利用来自太阳和日球层天文台(SOHO)上的大角度光谱日冕仪(LASCO)以及太阳动力学天文台(SDO)上的大气成像组件(AIA)和日震与磁成像仪(HMI)的观测数据进行训练。然后,训练好的模型用于预测一个到达地球的CME是否会引起地磁暴,以及/或者该CME引起此类暴的概率。基于五折交叉验证方案的实验结果表明,我们的融合模型表现出良好的性能:当模型用作确定性预测工具时,平均真实技能统计(TSS)得分为0.703;当模型用作概率预测工具时,平均Brier得分为0.095,其中TSS得分为1或Brier得分为0表示完美性能。这项工作有助于预测太阳-地球相互作用中指向地球的CME与地磁暴之间的因果关系。

英文摘要

Understanding and forecasting the geoeffectiveness of a coronal mass ejection (CME) is crucial for protecting infrastructure in the near-Earth space environment and on Earth. In this study, we present a novel fusion model to forecast the geoeffectiveness of CME events. Our model combines convolutional neural networks for feature learning and a prediction network for feature fusion and event classification. The model is trained by observations from instruments including the Large Angle Spectroscopic Coronagraph (LASCO) on board the Solar and Heliospheric Observatory (SOHO) and the Atmospheric Imaging Assembly (AIA) and Helioseismic and Magnetic Imager (HMI) on board the Solar Dynamics Observatory (SDO). The trained model is then used to predict whether an Earth-reaching CME will cause a geomagnetic storm and/or the probability that the CME will cause such a storm. Experimental results based on a five-fold cross validation scheme demonstrate the good performance of our fusion model, achieving a mean true skill statistic (TSS) score of 0.703 when the model is used as a deterministic prediction tool, and a mean Brier score of 0.095 when the model is used as a probabilistic forecasting tool, where a TSS score of 1 or a Brier score of 0 indicates perfect performance. This work contributes to forecasting the causal relationship between Earth-directed CMEs and geomagnetic storms in solar-terrestrial interactions.

2605.24741 2026-05-26 math.ST cs.IT cs.LG math.IT stat.ML stat.TH

On the Sample Complexity of Robust Binary Hypothesis Testing

关于鲁棒二元假设检验的样本复杂度

Shankar Vallinayagam, Ankit Pensia, Varun Jog

发表机构 * Department of Pure Mathematics and Mathematical Statistics, University of Cambridge(剑桥大学纯数学与数学统计系) Department of Statistics, Carnegie Mellon University(卡内基梅隆大学统计系)

AI总结 研究在三种污染模型下鲁棒二元假设检验的样本复杂度,证明最不利分布的存在性并给出显式公式,揭示样本复杂度对污染参数的不稳定性,并建立不同模型间样本复杂度的可比性。

Comments Comments welcome

详情
AI中文摘要

我们研究了在三种标准污染模型下鲁棒二元假设检验的样本复杂度:$\varepsilon$-加性(Huber)、$\varepsilon$-减性和$\varepsilon$-全变差(TV),分别记为$n^*_{\mathrm{Hub}}(\varepsilon)$、$n^*_{\mathrm{Sub}}(\varepsilon)$和$n^*_{\mathrm{TV}}(\varepsilon)$。对于减性污染,我们证明最不利分布存在并给出显式公式,使该模型与经典的Huber和TV模型一致。接下来我们表明,在所有三种模型中,样本复杂度可能在污染参数$\varepsilon$上高度不稳定,即使对于$o(\varepsilon)$的扰动也会增加多项式因子。类似地,当$\varepsilon$精确已知与仅知道$o(\varepsilon)$误差时,样本复杂度之间可能存在多项式因子差距。尽管所有模型中样本复杂度不稳定,但我们表明,在$\varepsilon$的常数因子重新缩放下,各模型的样本复杂度是可比较的。具体地,对于任意固定的$\delta_0>0$,以下对所有分布$p$和$q$成立:(i) $n^*_{\mathrm{Hub}}(\varepsilon) \lesssim n^*_{\mathrm{TV}}(\varepsilon) \lesssim n^*_{\mathrm{Hub}}(2\varepsilon)$,(ii) $n^*_{\mathrm{Sub}}(\varepsilon) \lesssim n^*_{\mathrm{TV}}(\varepsilon) \lesssim n^*_{\mathrm{Sub}}((2+\delta_0)\varepsilon)$,(iii) $n^*_{\mathrm{Sub}}(\varepsilon) \lesssim n^*_{\mathrm{Hub}}(\varepsilon) \lesssim n^*_{\mathrm{Sub}}((1+\delta_0)\varepsilon)$,且缩放常数是紧的。最后,我们将结果扩展到污染模型的自适应版本。

英文摘要

We study the sample complexity of robust binary hypothesis testing under three standard contamination models: $\varepsilon$-additive (Huber), $\varepsilon$-subtractive, and $\varepsilon$-total variation (TV), denoted by $n^*_{\mathrm{Hub}}(\varepsilon)$, $n^*_{\mathrm{Sub}}(\varepsilon)$, and $n^*_{\mathrm{TV}}(\varepsilon)$, respectively. For subtractive contamination, we show that least favourable distributions exist and provide explicit formulas for the same, bringing this model in line with the classical Huber and TV models. Next we show that in all three models, sample complexity may be highly unstable in the contamination parameter $\varepsilon$, increasing by polynomial factors even for $o(\varepsilon)$ perturbations. Similarly, there may be polynomial factor gaps between the sample complexities when $\varepsilon$ is known exactly versus when it is known up to $o(\varepsilon)$ error. Despite the instability of the sample complexity in all models, we show that the sample complexities across models are comparable up to constant-factor rescaling of $\varepsilon$. Specifically, for any fixed $δ_0>0$, the following hold for all distributions $p$ and $q$: (i) $n^*_{\mathrm{Hub}}(\varepsilon) \lesssim n^*_{\mathrm{TV}}(\varepsilon) \lesssim n^*_{\mathrm{Hub}}(2\varepsilon)$, (ii) $n^*_{\mathrm{Sub}}(\varepsilon) \lesssim n^*_{\mathrm{TV}}(\varepsilon) \lesssim n^*_{\mathrm{Sub}}((2+δ_0)\varepsilon)$, and (iii) $n^*_{\mathrm{Sub}}(\varepsilon) \lesssim n^*_{\mathrm{Hub}}(\varepsilon) \lesssim n^*_{\mathrm{Sub}}((1+δ_0)\varepsilon)$, and the scaling constants are tight. Finally, we extend our results to adaptive versions of the contamination models.

2605.24731 2026-05-26 eess.SY cs.RO cs.SY

Passivity-based Semi-autonomous Rotational Motion Navigation for Rigid-body Networks: Stability and Human Passivity Analysis

基于无源性的刚体网络半自主旋转运动导航:稳定性与人体无源性分析

Reiji Terunuma, Yuta Nakamura, Takeshi Hatanaka

发表机构 * Institute of Science Tokyo(东京科学研究所)

AI总结 提出一种基于无源性的半自主姿态控制框架,通过虚拟领导者和隐身控制实现多机器人系统在SO(3)上的人机交互稳定性,并证明在人体无源性假设下的闭环稳定性。

Comments This work is to be submitted to the 6th Workshop on Cyber-Physical Human Systems (CPHS2026) for possible publication

详情
AI中文摘要

本文提出了一种新颖的基于无源性的半自主姿态控制框架,特别关注定义在特殊正交群$SO(3)$上的姿态运动学。虽然人机交互有助于成功执行复杂任务,但确保$SO(3)$流形上人在回路系统的稳定性仍然是一个尚未解决的挑战。我们首先提出了一种新的控制架构,其中多机器人系统通过所谓的隐身控制保持反馈给人类操作员的平均信息的不变性,并且人类干预通过虚拟领导者进行调解,该虚拟领导者通过基于无源性的姿态同步律与机器人耦合。然后,我们在假设人类表现为无源系统的条件下,严格证明了所提出的在回路系统的闭环稳定性。为支持这一分析,进行了仿真研究,将人类操作员识别为动态系统,并检查了所识别模型的无源性特性。

英文摘要

This paper presents a novel passivity-based semi-autonomous attitude control framework, with a particular focus on attitude kinematics defined on the special orthogonal group $SO(3)$. While human-robot interaction facilitates the successful execution of complex tasks, ensuring stability of human-in-the-loop systems on the $SO(3)$ manifold remains a largely unsolved challenge. We first propose a new control architecture in which a multi-robot system preserves invariance of the average information fed back to the human operator through so-called stealthy control, and the human intervention is mediated through a virtual leader, which is coupled with the robots via a passivity-based attitude synchronization law. We then rigorously prove closed-loop stability of the proposed human-in-the-loop system under the assumption that the human behaves as a passive system. To support this analysis, simulation studies are conducted to identify the human operator as a dynamical system, and to examine passivity properties of the identified model.

2605.24696 2026-05-26 cs.CR cs.LG

CALIBURN: A Regime-Sensitivity Study of Operationally Calibrated Streaming Intrusion Detection

CALIBURN: 操作校准流式入侵检测的机制敏感性研究

Michel A. Youssef

发表机构 * Independent Researcher(独立研究者)

AI总结 本文提出CALIBURN流式告警流水线,通过贝叶斯变化点检测、等渗校准、成本敏感阈值、共形风险控制和多窗口烧毁率告警五个组件,在不同攻击率场景下评估其性能,在罕见攻击场景下AUC-PR达到0.943。

Comments 55 pages, 5 figures, 14 tables. Under review at Cyber Security and Applications. Code: https://github.com/MichelYsf/rcbsid-paper. Archived release: https://doi.org/10.5281/zenodo.20074590

详情
AI中文摘要

流式网络入侵检测系统必须持续处理流并保持内存有界,但大多数现有方法将告警阈值选择视为事后调优问题,不适合生产环境。操作员需要在部署前使用误报成本、漏报成本和告警预算等输入来指定告警行为。本文提出CALIBURN,一个由五个组件组成的流式告警流水线:截断贝叶斯在线变化点检测器、将变化点后验映射到经验条件攻击概率的等渗校准层、从操作员指定的误分类成本导出的成本敏感决策阈值、将告警预算规范转换为可交换性下窗口内有效阈值的共形风险控制包装器,以及从站点可靠性工程实践改编的多窗口烧毁率告警层。我们不声称统一优势,而是将CALIBURN作为机制敏感性研究,在三个攻击率场景下评估流水线:LITNET-2020(5.2%)、CICIDS2017(22.06%)和UNSW-NB15(64%)。在罕见攻击场景下,CALIBURN在LITNET-2020上达到AUC-PR 0.943,比最佳流式基线高出2.21倍,比最佳批处理参考高出4.12倍;等渗校准将Brier分数降低30%。在中等攻击率场景下,CALIBURN在CICIDS2017上仍是最强的流式方法,但被批处理密度方法超越。在高攻击率场景下,所有流式方法都接近攻击率下限。我们进一步识别了两种不同的CRC崩溃机制,导致在小的alpha下告警规则退化,并将两者作为操作指南提供给实践者。

英文摘要

Streaming network intrusion detection systems must process flows continuously while keeping memory bounded, but most current methods leave alerting threshold selection as a post-hoc tuning problem poorly suited to production. Operators need alerting behaviour specifiable before deployment using inputs such as false-negative cost, false-positive cost, and alerting budget. This paper presents CALIBURN, a five-component streaming alerting pipeline composed of a truncated Bayesian online change-point detector, an isotonic calibration layer mapping the change-point posterior to an empirical conditional attack probability, a cost-sensitive decision threshold derived from operator-specified misclassification costs, a Conformal Risk Control wrapper that converts an alert-budget specification into a within-window valid threshold under exchangeability, and a multi-window burn-rate alerting layer adapted from Site Reliability Engineering practice. Rather than claiming uniform dominance, we present CALIBURN as a regime-sensitivity study, evaluating the pipeline across three attack-prevalence regimes: LITNET-2020 at 5.2 percent, CICIDS2017 at 22.06 percent, and UNSW-NB15 at 64 percent. In the rare-attack regime, CALIBURN achieves AUC-PR 0.943 on LITNET-2020, outperforming the best streaming baseline by 2.21x and the best batch reference by 4.12x; isotonic calibration reduces Brier score by 30 percent. In the moderate-prevalence regime, CALIBURN remains the strongest streaming method on CICIDS2017 but is exceeded by batch density methods. In the high-prevalence regime, all streaming methods approach the prevalence floor. We further identify two distinct CRC-collapse mechanisms driving the alert rule to degeneracy at small alpha, treating both as operational guidance for practitioners.

2605.24673 2026-05-26 stat.ML cs.LG

Affinity Graph Connectivity in Convex Clustering

凸聚类中的亲和图连通性

Sam Rosen, Jason Xu

发表机构 * Department of Statistical Science, Duke University(杜克大学统计科学系) Department of Biostatistics, University of California Los Angeles(加州大学洛杉矶分校生物统计学系)

AI总结 研究凸聚类中亲和权重对应一般连通图时的有限样本界,通过随机游走理论分析聚类性能与图结构连通性的关系,并提出超参数调优应包括亲和权重的调整。

Comments 28 pages, 6 figures

详情
AI中文摘要

我们将凸聚类的有限样本界推广到目标函数中的亲和权重对应一般连通图的情形。这些界及其分析有助于更好地理解数据背后各种隐含连通结构下的聚类行为,并为质心恢复提供新的收敛速率。新的理论框架基于随机游走,这使得可以应用与随机图模型相关的集中不等式,并形式化了聚类性能与图结构连通性之间的关系。通过界的形式和实证结果,我们认为凸聚类问题的超参数调优还应包括输入亲和权重的调优。

英文摘要

We generalize finite-sample bounds for convex clustering to the setting where affinity weights appearing in the objective correspond to a general connected graph. These bounds and their analysis lead to a better understanding of clustering behavior under various implied connectivity structures behind the data and to new rates of convergence for centroid recovery. The new theoretical framework is based on random walks, which allow application of concentration inequalities related to random graph models, and formalizes the relationship between the clustering performance and the connectivity of the graph structures. Through the form of the bound and empirical results, we argue proper tuning of hyperparameters to convex clustering problems should also include tuning of input affinity weights.

2605.24663 2026-05-26 cs.CR cs.AI

CyBOKClaw: Human-in-the-Loop CyBOK Mapping for Cybersecurity Curriculum

CyBOKClaw:用于网络安全课程的人机协同CyBOK映射框架

Yan Lin Aung, Kevin Togbe

发表机构 * University of Derby, Derby, UK(德比大学)

AI总结 提出CyBOKClaw,一种可解释的人机协同检索框架,通过查询归一化、术语扩展、概念提升、主题描述丰富和领域敏感排序规则,将网络安全关键词/短语映射到CyBOK,并采用专家引导的top-5有用性指标ECA-5评估,在开发集和验证集上分别达到91.88%和98.00%的ECA-5。

详情
AI中文摘要

本文提出了CyBOKClaw,一个可解释的人机协同检索框架,用于将网络安全关键词或短语(KWoPs)映射到网络安全知识体系(CyBOK)。该框架并非将任务视为严格的精确分类,而是设计为供专家审查的top-k候选生成器。它结合了查询归一化、策划的术语扩展、概念级提升、主题描述丰富以及领域敏感的排序规则。由于教育领域的KWoPs通常宽泛、模糊且仅与CyBOK术语大致对齐,严格的精确匹配只能提供部分实际效用。因此,我们使用结构检索指标和专家引导的top-5有用性指标ECA-5(前5名中精确或最接近可接受匹配)来评估该框架,该指标记录返回的候选是否包含至少一个专家判断为精确或可接受为最接近实际CyBOK位置的映射。在开发数据集上,CyBOKClaw达到了64.73%的EXA-5(前5名精确匹配)、84.18%的结构语义对齐和91.88%的ECA-5;在验证数据集上,达到了81.19%的EXA-5、93.32%的结构语义对齐和98.00%的ECA-5。这些结果表明,专家引导的top-k有用性比单纯的精确结构匹配更能忠实地反映实际CyBOK映射效用,并且CyBOKClaw作为一种针对CyBOK的专家支持检索系统是有效的。

英文摘要

This paper presents CyBOKClaw, an interpretable human-in-the-loop retrieval framework for mapping cybersecurity keywords or phrases (KWoPs) to the Cyber Security Body of Knowledge (CyBOK). Rather than treating the task as strict exact classification, the framework is designed as a top-k candidate generator for expert review. It combines query normalization, curated term expansion, concept-level boosts, topic-description enrichment, and domain-sensitive ranking rules. Because educational KWoPs are often broad, ambiguous, and only approximately aligned with CyBOK terminology, strict exact matching provides only a partial account of practical utility. We therefore evaluate the framework using both structural retrieval metrics and an expert-guided top-5 usefulness metric, ECA-5 (Exact or Closest Acceptable Match at top-5), which records whether the returned candidates contain at least one mapping that an expert would judge exact or accept as the nearest practical CyBOK placement. On the development dataset, CyBOKClaw achieves 64.73% EXA-5 (Exact Match at top-5), 84.18% structural semantic alignment, and 91.88% ECA-5; on the validation dataset, it achieves 81.19% EXA-5, 93.32% structural semantic alignment, and 98.00% ECA-5. These results show that expert-guided top-k usefulness provides a more faithful account of practical CyBOK mapping utility than exact structural matching alone, and that CyBOKClaw is effective as a CyBOK-specific expert-support retrieval system.

2605.24651 2026-05-26 math.NA cs.LG cs.NA

WINO: A Weak-Form Physics Informed Neural Operator for Hyperelasticity on Variable Domains

WINO: 一种用于变域超弹性问题的弱形式物理信息神经算子

Bokai Zhu, Qinghui Zhang, Timon Rabczuk

发表机构 * School of Science, Harbin Institute of Technology, Shenzhen, P. R. China(哈尔滨工业大学深圳校区) School of Science, Harbin Institute of Technology, Shenzhen, Guangdong(哈尔滨工业大学深圳校区) Institute of Structural Mechanics, Bauhaus-Universität Weimar(魏玛 Bauhaus 大学结构力学研究所)

AI总结 提出一种无数据框架WINO,结合神经算子的效率与φ-有限元法的几何灵活性,通过最小化弱形式残差和惩罚项训练,实现高精度且计算时间减少50-80%。

详情
AI中文摘要

我们提出了一种弱形式物理信息神经算子(WINO),这是一个无数据框架,结合了神经算子的效率与φ-有限元法(φ-FEM)的几何灵活性。φ-FEM是一种非拟合方法,无需体拟合网格即可适应几何变化,其中域几何由水平集函数φ表示。为了施加边界条件,Dirichlet问题采用φ-FEM提升,因此仅学习齐次位移贡献,而牵引驱动的Neumann问题额外预测非拟合弱形式所需的辅助场。参数通过最小化与φ-FEM对齐的弱形式残差平方以及切割单元辅助方程的平方惩罚来训练,从而消除了对大型配对数据集的依赖。训练后,WINO输出可作为神经算子热启动(NOWS)为非线性φ-FEM求解器提供初始值,相比传统冷启动求解器减少了迭代次数。数值基准测试表明,WINO在所有基准测试中实现了低于0.04的高精度,同时与纯数据驱动方法相比,总计算时间减少了50-80%。

英文摘要

We propose a Weak-form Physics-Informed Neural Operator (WINO), a data-free framework that combines the efficiency of neural operators with the geometric flexibility of the $φ$-finite element method ($φ$-FEM). $φ$-FEM is an unfitted method that accommodates geometric variations without body-fitted meshes, where the domain geometry is represented by the level-set function $φ$. To impose the boundary conditions, Dirichlet problems adopt the $φ$-FEM lifting so only the homogeneous displacement contribution is learned, whereas traction-driven Neumann problems additionally predict the auxiliary fields necessary for the unfitted weak formulation. Parameters are trained by minimizing squared weak-form residuals aligned with $φ$-FEM together with squared penalties on the cut-cell auxiliary equations, which removes the need for large paired datasets of converged reference solutions. After training, WINO outputs can seed the nonlinear $φ$-FEM solvers as neural operator warm starts (NOWS), which reduce iteration counts relative to traditional cold-started solvers. Numerical benchmarks show that WINO achieves high accuracy below 0.04 across all benchmarks, while reducing total computational time by 50--80\% compared with purely data-driven methods.

2605.24632 2026-05-26 cs.CR cs.AI cs.LG

Demystifying the Mythos or Disrupting Bugonomics? From Zero-Day Asymmetry to Defender Remediation Throughput

揭秘神话或颠覆漏洞经济学?从零日不对称到防御者修复吞吐量

Alfredo Pesoli, Herman Errico, Lorenzo Cavallaro

发表机构 * University College London(伦敦大学学院) Bynario

AI总结 本文通过漏洞经济学视角分析LLM驱动的漏洞发现,指出其核心影响并非增加零日漏洞,而是提升防御者修复吞吐量,并利用Anthropic Mythos预览和Mozilla Firefox合作数据论证这一转变。

详情
AI中文摘要

最近,大型语言模型在生产软件中生成候选和确认漏洞的演示,重新引发了AI将重塑攻防安全的叙事。头条新闻强调能力,却很少审视成本和激励。本文通过漏洞经济学视角审视LLM驱动的漏洞发现:即生产、证明、优先级排序和修复安全相关缺陷的操作经济学。历史上,最引人注目的高端漏洞经济学是攻击方定价的,因为生产级零日漏洞和利用链是面向政府、经纪人和攻击方供应商的昂贵专家输出。防御方漏洞经济学早已存在于漏洞研究、奖励计划和供应商修复工作中;LLM辅助系统改变了其规模和分布。它们使得候选生成、代码理解、测试工具构建、影响证明草拟和报告准备在代码库规模上更便宜。利用和概念验证仍然重要,但在防御方工作流中,它们主要用于证明影响、指导优先级排序和证明修复的合理性。由此产生的瓶颈不仅仅是发现更多漏洞,而是吸收、验证、分类、修补和发布更大规模的报告流。利用Anthropic的Mythos预览和Mozilla Firefox合作的公开数据,以及公开的利用市场价格锚点和漏洞奖励计划,我们认为近期的转变不仅仅是更多的零日漏洞。而是向更广泛的防御者修复吞吐量迈进:低信号候选变得更便宜,证据丰富的修复变得更加重要,稀缺的能力转向维护者审查和发布工作。这种影响在开源领域尤为严重,因为LLM辅助发现可以增加报告量,而维护者侧的验证、分类、资金和发布能力可能无法扩展。

英文摘要

Recent demonstrations of large language models producing candidate and confirmed vulnerabilities in production software have renewed the narrative that AI will reshape offensive and defensive security. Headlines emphasize capability; they rarely interrogate costs and incentives. This paper examines LLM-driven vulnerability discovery through a bugonomics lens: the operational economics of producing, proving, prioritizing, and fixing security-relevant defects. Historically, the most visible high-end bugonomics was offense-priced because production-grade zero-days and exploit chains were expensive specialist outputs for governments, brokers, and offensive vendors. Defender-side bugonomics already existed in vulnerability research, reward programs, and vendor remediation work; LLM-assisted systems change its scale and distribution. They make candidate generation, code comprehension, harness construction, proof-of-impact drafting, and report preparation cheaper at codebase scale. Exploits and proofs of concept remain important, but in defender workflows they primarily prove impact, guide prioritization, and justify remediation. The resulting bottleneck is not only finding more bugs; it is absorbing, validating, triaging, patching, and shipping a larger stream of reports. Using public data from Anthropic's Mythos Preview and Mozilla Firefox collaborations, along with public exploit-market price anchors and vulnerability reward programs, we argue that the near-term shift is not simply more zero-days. It is a move toward broader defender remediation throughput: low-signal candidates become cheaper, evidence-rich remediation become more important, and scarce capacity shifts toward maintainer review and release work. The effect is acute in open source, where LLM-assisted discovery can increase report volume while maintainer-side validation, triage, funding, and release capacity may not scale.