arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2410.19153 2026-06-01 cs.LG

Learning Coupled Subspaces for Multi-Condition Spike Data

学习多条件尖峰数据的耦合子空间

Yididiya Y. Nadew, Xuhui Fan, Christopher J. Quinn

AI总结提出耦合子空间高斯过程因子分析（CS-GPFA）模型，联合学习神经活动在条件空间中的潜在表示，并开发主动学习算法自适应选择条件，在合成和真实神经数据集上优于现有方法。

Comments 46 pages, 7 figures

详情

AI中文摘要

在神经科学中，许多研究在多种条件下进行感觉或行为实验，以获取高维尖峰序列数据集形式的神经反应。分析高维尖峰数据是一个具有挑战性的统计问题。为此，高斯过程因子分析（GPFA）作为一种流行的潜变量模型类别，被提出用于单一实验条件下收集的数据。GPFA提取平滑、低维的潜在轨迹，总结高维尖峰数据集。然而，标准GPFA独立推断每个实验条件下的这些轨迹，未考虑底层活动如何在条件空间上变化。这对准确性和潜在表示的可解释性都造成了限制。为解决这些限制，我们提出耦合子空间GPFA（CS-GPFA），一种联合学习潜在表示的贝叶斯模型，表征神经活动如何在条件空间上变化。在此基础上，我们进一步开发了一种主动学习算法，用于自适应选择条件。在合成和真实神经数据集上的实验表明，CS-GPFA相比现有方法实现了更优的性能。此外，我们的主动学习结果显示，CS-GPFA能在实际设置中有效指导实验设计。

英文摘要

In neuroscience, numerous studies conduct sensory or behavioral experiments under multiple conditions to acquire neural responses in the form of high-dimensional spike train datasets. Analyzing high-dimensional spike data is a challenging statistical problem. To this end, Gaussian process factor analysis (GPFA), a popular class of latent variable models, has been proposed for data collected under a single experimental condition. GPFA extracts smooth, low-dimensional latent trajectories that summarize highdimensional spike datasets. However, standard GPFA infers these trajectories independently for each experimental condition, not accounting for how the underlying activity varies across the condition space. This poses limitations on both accuracy and the interpretability of the latent representation. To address these limitations, we propose Coupled Subspaces GPFA (CS-GPFA), a Bayesian model that jointly learns latent representations, characterizing how the neural activity varies over the condition space. Building on this, we further develop an active-learning algorithm for adaptively selecting conditions. Experiments on both synthetic and real neural datasets demonstrate that CS-GPFA achieves superior performance compared to existing approaches. Moreover, our active learning results show that CS-GPFA can efficiently guide experiment design in practical settings.

URL PDF HTML ☆

赞 0 踩 0

2410.15475 2026-06-01 cs.CV

Multimodal Fusion via Self-Consistent Task-Gradient Fields

通过自洽任务梯度场的多模态融合

Jiayu Xiong, Jing Wang, Jun Xue, Wanlong Wang, Jianlong Kwan, Xiaosen Lyu, Zhouqiang Jiang

AI总结提出自洽场自编码器（SCFAE），利用自洽场原理平衡任务学习与特征组织，通过任务损失和重构损失在互补子空间中分离特征，从而鲁棒处理缺失数据和不均匀输入。

Comments ICML 2026 accepted paper

详情

AI中文摘要

多模态学习旨在从不同输入中保留尽可能多的任务相关信息。然而，当前的融合设计常常扭曲对特征提取器的反馈循环。激进地合并模态会纠缠它们的表示，使得特征提取器对不完整输入变得脆弱。同时，试图通过辅助损失分离特征常常引入优化冲突，分散对主要任务的注意力。我们提出自洽场自编码器（SCFAE）为任务梯度提供更好的路径。我们的方法遵循自洽场原理来平衡任务学习与特征组织，从而最小化互信息。我们为每个模态使用小型自编码器以保持信息完整。任务损失作为驱动力选择预测性特征。重构损失作为约束将这些特征分离到独立子空间中。这两个目标通过互补的特征子空间运作，从而减轻优化干扰。我们在音频-视觉-文本、音频-视觉和图像-视频基准上评估SCFAE。结果表明，SCFAE通过简单结构更鲁棒地处理缺失数据和不均匀输入尺寸。梯度分析确认SCFAE避免了冲突并保持了稳定的训练动态。

英文摘要

Multimodal learning aims to preserve as much task-related information as possible from different inputs. However, current fusion designs often distort the feedback loop to feature extractors. Aggressively merging modalities entangles their representations, making the feature extractors fragile to incomplete inputs. Meanwhile, attempting to separate features via auxiliary losses frequently introduces optimization conflicts that distract from the primary task. We propose the Self-Consistent Field Autoencoder (SCFAE) to provide a better path for task gradients. Our method follows the self-consistent field principle to balance task learning with feature organization, thereby minimizing mutual information. We use small autoencoders for each modality to keep information intact. The task loss acts as a driving force to select predictive features. The reconstruction loss acts as a constraint to separate these features into independent subspaces. These dual objectives operate through complementary feature subspaces, thereby mitigating optimization interference. We evaluate SCFAE on audio-visual-text, audio-visual, and image-video benchmarks. Results show that SCFAE handles missing data and unequal input sizes more robustly via a simple structure. Gradient analysis confirms that SCFAE avoids conflicts and maintains stable training dynamics.

URL PDF HTML ☆

赞 0 踩 0

2404.14928 2026-06-01 cs.LG cs.AI cs.CL cs.SI

Graph Machine Learning in the Era of Large Language Models (LLMs)

大语言模型时代的图机器学习

Shijie Wang, Jiani Huang, Zhikai Chen, Yu Song, Wenzhuo Tang, Haitao Mao, Wenqi Fan, Hui Liu, Xiaorui Liu, Dawei Yin, Qing Li

AI总结本文综述了大语言模型如何增强图机器学习的泛化、迁移和少样本学习能力，以及图如何提升大语言模型的推理和可解释性。

Comments Accepted by TIST

详情

AI中文摘要

图在表示社交网络、知识图谱和分子发现等各个领域的复杂关系中扮演着重要角色。随着深度学习的出现，图神经网络（GNN）已成为图机器学习（Graph ML）的基石，促进了图的表示和处理。最近，大语言模型（LLM）在语言任务中展现出前所未有的能力，并被广泛应用于计算机视觉和推荐系统等各种应用中。这一显著成功也引起了将LLM应用于图领域的兴趣。越来越多的努力致力于探索LLM在提升图机器学习的泛化性、迁移性和少样本学习能力方面的潜力。同时，图，尤其是知识图谱，富含可靠的事实知识，可用于增强LLM的推理能力，并可能缓解其局限性，如幻觉和缺乏可解释性。鉴于这一研究方向的快速进展，有必要对LLM时代图机器学习的最新进展进行系统综述，为研究人员和从业者提供深入理解。因此，在本综述中，我们首先回顾了图机器学习的最新发展。然后，我们探讨了如何利用LLM来增强图特征的质量，减轻对标注数据的依赖，并解决图异质性和分布外（OOD）泛化等挑战。之后，我们深入探讨了图如何增强LLM，突出了它们增强LLM预训练和推理的能力。此外，我们调查了各种应用，并讨论了这一有前景领域的潜在未来方向。

英文摘要

Graphs play an important role in representing complex relationships in various domains like social networks, knowledge graphs, and molecular discovery. With the advent of deep learning, Graph Neural Networks (GNNs) have emerged as a cornerstone in Graph Machine Learning (Graph ML), facilitating the representation and processing of graphs. Recently, LLMs have demonstrated unprecedented capabilities in language tasks and are widely adopted in a variety of applications such as computer vision and recommender systems. This remarkable success has also attracted interest in applying LLMs to the graph domain. Increasing efforts have been made to explore the potential of LLMs in advancing Graph ML's generalization, transferability, and few-shot learning ability. Meanwhile, graphs, especially knowledge graphs, are rich in reliable factual knowledge, which can be utilized to enhance the reasoning capabilities of LLMs and potentially alleviate their limitations such as hallucinations and the lack of explainability. Given the rapid progress of this research direction, a systematic review summarizing the latest advancements for Graph ML in the era of LLMs is necessary to provide an in-depth understanding to researchers and practitioners. Therefore, in this survey, we first review the recent developments in Graph ML. We then explore how LLMs can be utilized to enhance the quality of graph features, alleviate the reliance on labeled data, and address challenges such as graph Heterophily and out-of-distribution (OOD) generalization. Afterward, we delve into how graphs can enhance LLMs, highlighting their abilities to enhance LLM pre-training and inference. Furthermore, we investigate various applications and discuss the potential future directions in this promising field.

URL PDF HTML ☆

赞 0 踩 0

2307.04722 2026-06-01 cs.LG

Advances and Challenges in Meta-Learning: A Technical Review

元学习的进展与挑战：技术综述

Anna Vettoruzzo, Mohamed-Rafik Bouguelia, Joaquin Vanschoren, Thorsteinn Rögnvaldsson, KC Santosh

AI总结本文全面综述元学习技术，探讨其与多任务学习、迁移学习等领域的关联，并指出未来研究方向。

详情

AI中文摘要

元学习使学习系统能够从多个任务中获取知识，从而更快地适应和泛化到新任务。本综述对元学习进行了全面的技术概述，强调了其在数据稀缺或获取成本高的实际应用中的重要性。本文涵盖了最先进的元学习方法，并探讨了元学习与多任务学习、迁移学习、领域适应与泛化、自监督学习、个性化联邦学习和持续学习之间的关系。通过突出这些主题与元学习领域之间的协同作用，本文展示了某一领域的进展如何惠及整个领域，同时避免不必要的重复工作。此外，本文深入探讨了高级元学习主题，例如从复杂的多模态任务分布中学习、无监督元学习、学习有效适应数据分布变化以及持续元学习。最后，本文指出了该领域未来研究的开放问题和挑战。通过综合最新的研究进展，本文提供了对元学习及其对各种机器学习应用潜在影响的深入理解。我们相信，这篇技术综述将有助于元学习的进步及其在解决实际问题中的实际应用。

英文摘要

Meta-learning empowers learning systems with the ability to acquire knowledge from multiple tasks, enabling faster adaptation and generalization to new tasks. This review provides a comprehensive technical overview of meta-learning, emphasizing its importance in real-world applications where data may be scarce or expensive to obtain. The paper covers the state-of-the-art meta-learning approaches and explores the relationship between meta-learning and multi-task learning, transfer learning, domain adaptation and generalization, self-supervised learning, personalized federated learning, and continual learning. By highlighting the synergies between these topics and the field of meta-learning, the paper demonstrates how advancements in one area can benefit the field as a whole, while avoiding unnecessary duplication of efforts. Additionally, the paper delves into advanced meta-learning topics such as learning from complex multi-modal task distributions, unsupervised meta-learning, learning to efficiently adapt to data distribution shifts, and continual meta-learning. Lastly, the paper highlights open problems and challenges for future research in the field. By synthesizing the latest research developments, this paper provides a thorough understanding of meta-learning and its potential impact on various machine learning applications. We believe that this technical overview will contribute to the advancement of meta-learning and its practical implications in addressing real-world problems.

URL PDF HTML ☆

赞 0 踩 0

2605.31593 2026-06-01 cs.CR cs.AI

Stateful Online Monitoring Catches Distributed Agent Attacks

有状态在线监控捕获分布式智能体攻击

Davis Brown, Samarth Bhargav, Arav Santhanam, Kasper Hong, Ivan Zhang, Matan Shtepel, Steffi Chern, Alexander Robey, Eric Wong, Hamed Hassani

AI总结针对分布式智能体攻击中跨账户聚合的恶意行为难以被单上下文监控检测的问题，提出一种基于实时聚类的有状态在线监控方法，能够更早、更有效地捕获分布式攻击，同时保持低延迟。

详情

AI中文摘要

语言模型可以发现数千个严重的软件漏洞，并且智能体越来越多地被滥用于网络攻击。为了避免检测，攻击者经常分布他们的滥用行为，将有害任务分割到多个用户账户中，使得每个单独的记录看起来无害。由于安全监控器一次只评估一个智能体上下文，它们在结构上无法检测到仅在跨多个账户的聚合中才可见的滥用行为。我们通过构建据我们所知第一个分布式智能体攻击来证明这一漏洞是真实存在的，该攻击是一个多智能体框架，能够在完成困难的网络安全任务的同时，将有害目标隐藏在具有有限上下文的子智能体中，从而规避标准监控器，后者捕获它的频率仅为先前智能体攻击的五分之一。为了防御，我们开发了一种在线有状态监控器，它使用实时聚类来收集跨多个智能体记录的微弱可疑信号，并且仅在极少数情况下升级到语言模型以标记跨用户账户的滥用行为。在模拟数据中心流量的大规模评估中，我们的监控器帕累托优于标准监控器，提前30%捕获分布式攻击，并在网络滥用达到最有害阶段之前标记出来。至关重要的是，这对于约99%的用户流量带来的额外延迟可以忽略不计。这种检测优势在良性背景流量非常大时仍然存在但会缩小。经过广泛的红队演练，我们改进了防御，并且令人惊讶地发现它也能捕获标准越狱，因为自适应攻击者会跨账户重复使用攻击变体。我们的结果指向了一类新的安全监控器，它们对用户群体而非孤立记录进行推理。

英文摘要

Language models can find thousands of severe software vulnerabilities, and agents are increasingly being misused for cyberattacks. To avoid detection, attackers frequently distribute their misuse, splitting a harmful task across many user accounts so each individual transcript looks benign. Because safety monitors score only one agent context at a time, they are structurally blind to misuse that is only visible in aggregate, across many accounts. We show this gap is real by building, to our knowledge, the first distributed agent attack, a multi-agent scaffold that completes hard cybersecurity tasks while hiding the harmful objective across subagents with limited contexts, evading a standard monitor that catches it only a fifth as often as prior agent attacks. Towards a defense, we develop an online stateful monitor that uses real-time clustering to collect weak suspiciousness signals across many agent transcripts, and escalates only rarely to a language model that flags misuse across user accounts. In evaluations with large-scale simulated datacenter traffic, our monitor Pareto dominates standard monitors, catching distributed attacks 30% earlier and flagging cyber misuse before it reaches the most harmful stages. Crucially, this comes at negligible additional latency for ~99% of user traffic. This detection advantage persists but narrows as the benign background traffic grows very large. After an extensive red-teaming exercise, we improve the defense and surprisingly also find that it catches standard jailbreaks, since adaptive attackers reuse attack variants across accounts. Our results point toward a new class of safety monitors which reason over groups of users rather than isolated transcripts.

URL PDF HTML ☆

赞 0 踩 0

2605.31575 2026-06-01 cs.IR cs.AI

SPECTRA: Synthetic IR Test Collections with Relevance Oracles and Controlled Distractor Diagnostics

SPECTRA: 具有相关性真值表和受控干扰物诊断的合成信息检索测试集

Eric Liang

AI总结提出SPECTRA框架，通过分离潜在主题结构、文本实现、元数据控制、查询意图生成和确定性相关性真值表，生成合成文本语料库和检索测试集，以诊断检索系统的扩展性和故障模式。

详情

AI中文摘要

可扩展的信息检索测试需要足够大的语料库来测试索引构建、排序延迟、查询路由和评估工具，但人工判断的测试集仍然昂贵，并且在文档私有或仍在设计时可能不可用。本文介绍了SPECTRA，一个可复现的框架，通过分离潜在主题结构、表面文本实现、元数据控制、查询意图生成和确定性相关性真值表，生成合成文本语料库和检索测试集。该框架旨在作为Cranfield风格和TREC风格评估的诊断补充，而非替代人工评估。一个单进程Python原型生成了多达60,000个文档和961万个标记的语料库，同时保持了可控的长尾词汇增长，并为96个查询生成了分级相关性标签。在本地模拟研究中，生成速度接近线性，约为每秒12,000到14,000个文档，估计的Zipf斜率绝对值保持在0.86附近，增加跨主题干扰文本使BM25 nDCG@10从2%干扰物时的1.00下降到36%干扰物时的0.43。这些结果表明，轻量级合成语料库可以在昂贵的集合构建开始之前暴露检索系统的扩展性和故障模式。

英文摘要

Scalable information retrieval testing needs corpora that are large enough to stress index construction, ranking latency, query routing, and evaluation tooling, yet human-judged test collections remain expensive and may be unavailable when documents are private or still under design. This paper introduces SPECTRA, a reproducible framework for generating synthetic text corpora and retrieval test collections through a separation of latent topical structure, surface text realization, metadata controls, query intent generation, and deterministic relevance oracles. The framework is intended as a diagnostic complement to Cranfield-style and TREC-style evaluation, not as a replacement for human assessment. A single-process Python prototype generated corpora up to 60,000 documents and 9.61 million tokens while preserving controllable long-tail vocabulary growth and producing graded relevance labels for 96 queries. In the local simulation study, generation remained close to linear at roughly 12K to 14K documents per second, estimated Zipf slopes stayed near 0.86 in absolute value, and increasing cross-topic distractor text reduced BM25 nDCG@10 from 1.00 at 2% distractors to 0.43 at 36% distractors. These results show that lightweight synthetic corpora can expose retrieval-system scaling and failure modes before costly collection construction begins.

URL PDF HTML ☆

赞 0 踩 0

2605.31532 2026-06-01 cond-mat.soft cs.LG

Discovering Thermodynamically Admissible Dissipation Potentials via Grammar-Based Symbolic Regression

通过基于语法的符号回归发现热力学允许的耗散势

Federico Califano, Jacopo Ciambella

AI总结提出一种基于语法的符号回归框架，在广义标准材料形式下自动发现满足热力学约束（凸性和非负性）的耗散势，并在合成数据和实验数据上验证其有效性。

详情

AI中文摘要

非弹性材料的本构定律必须满足严格的热力学允许性要求，然而当前的数据驱动方法即使通过物理编码架构提供了形式保证，也牺牲了可解释性。我们提出了一种符号回归框架，用于在广义标准材料（GSM）形式下数据驱动地发现控制内变量演化的耗散势。从Clausius-Duhem不等式出发，我们强制执行对偶耗散势必须满足的热力学要求——凸性和非负性，以保证非负的机械耗散。这些要求在一般的次微分设置中表述，在一个统一框架内涵盖了率相关（粘弹性）和粘塑性耗散机制，包括具有真正弹性区域的势。候选势由一种复合扩展的保凸语法生成，该语法通过构造保证热力学允许性。该框架在包含牛顿、幂律和Bingham粘塑性真实过程的合成数据集（含过程和测量噪声）上进行了验证，并在合成弹性体的实验振荡剪切测量（多个应变幅度和频率）上进行了验证，其中发现的势再现了动态模量的幅度依赖性软化，并优于校准的线性Zener基线。

英文摘要

Constitutive laws for inelastic materials must satisfy strict thermodynamic admissibility requirements, yet current data-driven approaches sacrifice interpretability, even when formal guarantees are provided by physics-encoded architectures. We propose a symbolic regression framework for the data-driven discovery of dissipation potentials governing the evolution of internal variables within the Generalized Standard Materials (GSM) formalism. Starting from the Clausius--Duhem inequality, we enforce the thermodynamic requirements, convexity and non-negativity, that the dual dissipation potential must satisfy to guarantee non-negative mechanical dissipation. These requirements are formulated in the general subdifferential setting, encompassing rate-dependent (viscoelastic) and viscoplastic dissipative mechanisms, including potentials with genuine elastic domains, within a unified framework. Candidate potentials are generated by a composition-extended convexity-preserving grammar that guarantees thermodynamic admissibility \emph{by construction}. The framework is validated on synthetic datasets spanning Newtonian, power-law, and Bingham viscoplastic ground truths under process and measurement noise, and on experimental oscillatory shear measurements of a synthetic elastomer across multiple strain amplitudes and frequencies, where the discovered potentials reproduce the amplitude-dependent softening of the dynamic moduli and outperform a calibrated linear Zener baseline.

URL PDF HTML ☆

赞 0 踩 0

2605.31520 2026-06-01 cs.SE cs.AI cs.CR

Separating Secrets from Placeholders: A Hybrid CNN-CodeBERT Framework for Three-Class Credential Leakage Detection

区分秘密与占位符：一种用于三类凭证泄露检测的混合CNN-CodeBERT框架

Maksuda Bilkis Baby, Khushika Shah, Naiyue Liang, Lei Zhang

AI总结针对现有凭证泄露检测工具高误报率的问题，提出一种基于CodeBERT语义理解与字符级模式识别的三分类框架，将占位符/弱凭证作为独立类别建模，在新构建的9426样本数据集上达到0.86的MCC和0.90的宏F1分数，将高严重性警报减少33%而不牺牲安全覆盖。

Comments Accepted at ICSME 2026 (International Conference on Software Maintenance and Evolution)

详情

AI中文摘要

公共源代码仓库中的凭证泄露构成严重安全威胁，仅2024年就有超过2380万个秘密被暴露。现有检测工具由于刚性模式匹配和二元分类方案无法区分真实凭证与占位符或弱凭证，导致高误报率。我们提出一个三分类框架，明确将占位符或弱凭证建模为一个独立类别，利用基于CodeBERT的语义理解结合字符级模式识别。我们在一个新构建的包含10种编程语言、9426个样本的数据集上评估了我们的方法。我们的模型实现了0.86的马修斯相关系数和0.90的宏F1分数，对真实凭证泄露达到93%的召回率和89%的精确率，同时将高严重性警报减少了33.0%（从373降至250），且未牺牲安全覆盖。与先前的字符级方法相比，我们的方法将占位符或弱凭证检测的F1分数从54%提升至81%，同时保持了强大的跨语言泛化能力，在留一语言评估中，10种语言中有9种语言的F1分数超过0.80。

英文摘要

Credential leakage in public source code repositories poses a critical security threat, with over 23.8 million secrets exposed in 2024 alone. Existing detection tools suffer from high false-positive rates because rigid pattern matching and binary classification schemes fail to distinguish genuine credentials from placeholder or weak credentials. We propose a three-class classification framework that explicitly models placeholder or weak credentials as a distinct class, leveraging CodeBERT-based semantic understanding combined with character-level pattern recognition. We evaluate our approach on a newly constructed dataset of 9,426 samples spanning 10 programming languages. Our model achieves a Matthews Correlation Coefficient of 0.86 and a macro F1-score of 0.90, achieving 93% recall and 89% precision for genuine credential leaks while reducing high severity alerts by 33.0% (from 373 to 250) without sacrificing security coverage. Compared to prior character-level approaches, our method improves placeholder or weak credential detection from 54% to 81% F1-score while maintaining strong cross language generalization, with 9 of 10 languages achieving F1 above 0.80 under leave-one-language-out evaluation.

URL PDF HTML ☆

赞 0 踩 0

2605.31478 2026-06-01 cs.SE cs.CL cs.SY eess.SY

Knowledge Boundary Probing and Demand-Guided Intervention for LLM-Based Power System Code Generation

知识边界探测与需求引导干预：面向基于LLM的电力系统代码生成

Hui Wu, Xiaoyang Wang, Zhong Fan

AI总结针对LLM在电力系统代码生成中因API知识边界错误导致失败的问题，提出PowerCodeBench基准、L0-L3文档驱动探测和边界感知干预方法，显著提升模型准确率。

Comments 43 pages, 12 figures, includes supplementary material

详情

AI中文摘要

大型语言模型（LLMs）越来越多地被用于自动化电力系统分析，但许多公用事业和能源研究实验室出于保密、监管、可重复性和成本原因，要求本地部署。这使得开源模型的可靠性成为一个部署问题。我们表明，电力系统代码生成中的首次失败并非仅由推理主导，而是由结构化的API知识边界错误主导：在版本化的仿真库中出现虚构的函数名、误用的参数以及处理不当的结果表。我们引入了PowerCodeBench，一个经过执行验证的基准生成器，它将自然语言操作员查询与pandapower代码和数值真值配对；一个L0-L3文档驱动的探测程序，用于测量每个模型的API知识概况；以及一种边界感知干预，将查询侧API需求估计与目标主动文档注入和路由被动修正相结合。在一个包含2000个任务的冻结版本上，我们评估了十个开源LLM（1.5B-480B参数）和四个商业中端API。该干预措施使每个评估的至少7B参数的开源模型和每个商业API提升了32到56个准确率点。70B-120B范围内的开源模型匹配了商业中端准确率范围，而Llama-3.1-405B和Qwen3-Coder-480B领先。目标提示在保持全上下文准确率上限的同时，使用了41%的提示令牌成本。结果是在不进行微调或云端推理的情况下，为电网分析工作流提供可靠的本地LLM辅助的准确率侧、部署时路径。

英文摘要

Large language models (LLMs) are increasingly used to automate power-system analysis, but many utilities and energy-research labs require on-premise serving for confidentiality, regulatory, reproducibility, and cost reasons. This makes the reliability of open-weight models a deployment issue. We show that first-pass failures in power-system code generation are dominated not by reasoning alone, but by structured API-knowledge boundary errors: hallucinated function names, misused parameters, and mishandled result tables in versioned simulation libraries. We introduce PowerCodeBench, an execution-validated benchmark generator that pairs natural-language operator queries with pandapower code and numerical ground truth; an L0-L3 documentation-driven probing procedure that measures per-model API knowledge profiles; and a boundary-aware intervention that combines query-side API demand estimation with targeted proactive documentation injection and routed reactive correction. On a 2,000-task frozen release, we evaluate ten open-weight LLMs (1.5B-480B parameters) and four commercial mid-tier APIs. The intervention improves every evaluated open-weight model of at least 7B parameters and every commercial API by 32 to 56 accuracy points. Open-weight models in the 70B-120B range match the commercial mid-tier accuracy range, while Llama-3.1-405B and Qwen3-Coder-480B lead the panel. The targeted prompts preserve the full-context accuracy ceiling while using 41% of the prompt-token cost. The result is an accuracy-side, deployment-time path toward reliable on-premise LLM assistance for grid-analysis workflows without fine-tuning or cloud inference.

URL PDF HTML ☆

赞 0 踩 0

2605.31445 2026-06-01 cs.GT cs.AI cs.CL cs.LG

Used Car Salesbots? Honesty and Credulity of LLMs as Bargaining Agents under Partial Information

二手车销售机器人？作为讨价还价代理的LLM在部分信息下的诚实与轻信

Antonio Valerio Miceli-Barone, Vaishak Belle, Shay B. Cohen

AI总结研究LLM代理在模拟讨价还价场景中的表现，发现它们偏离博弈论均衡，尝试撒谎但无法有效利用信息不对称，且优化财务效用会增强谈判能力但增加不诚实行为。

Comments 18 pages, 14 figures

详情

AI中文摘要

在这项工作中，我们研究了模拟讨价还价场景中的代理，其中买方和卖方通过文本渠道进行通信，并试图在不同信息制度（完全信息、信息不对称或相互不确定性）下谈判互利交易。我们评估了它们相对于博弈论解决方案的表现，并进一步调查了它们的诚实性（披露或隐瞒信息、误导或欺骗的倾向）以及轻信性（信任或不信任对方提供信息的倾向）。我们研究了零样本LLM代理（使用简单的提示脚手架）以及微调代理，以探讨优化代理以最大化财务利润是否使它们成为更强的谈判者，但也更不诚实和更不信任。我们发现，现成的LLM都显著偏离博弈论均衡，它们试图对自己的私人信息撒谎，但无法有效利用信息不对称。对财务效用的微调使代理在达成更好交易方面更强，但也更不诚实，这突显了优化代理任务对其安全性可能带来的风险。我们发布了我们的代码和一个讨价还价场景数据集。

英文摘要

In this work we study agents in simulated bargaining scenarios, where a buyer and a seller communicate through a text channel and attempt to negotiate mutually beneficial trades, under different information regimes (complete information, information asymmetry or mutual uncertainty). We evaluate their performance w.r.t. game-theoretical solutions and further investigate their honesty (their tendency to disclose or withhold information or to mislead and deceive) as well as their credulity (their tendency to trust or distrust information provided by the other agent). We study zero-shot LLM agents with simple prompting scaffolding as well as fine-tuned agents, in order to investigate whether optimising the agents to maximise financial profits makes them stronger negotiators but also more dishonest and less trusting. We find that off-the-shelf LLMs all substantially deviate from game-theoretical equilibria, they attempt to lie about their private information but cannot efficiently exploit information asymmetries. Fine-tuning on financial utility makes the agents stronger at achieving better deals but also more dishonest, highlighting the risks that optimising agents for a task can have on their safety. We release our code and a dataset of bargaining scenarios.

URL PDF HTML ☆

赞 0 踩 0

2605.31443 2026-06-01 stat.ME cs.LG econ.EM math.ST stat.TH

Modeling Covariate Transition for Efficient Estimation of Longitudinal Treatment Effects in Randomized Experiments

建模协变量转移以高效估计随机实验中的纵向处理效应

Naoki Chihara, Tatsushi Oka, Yasuko Matsubara, Yasushi Sakurai, Shota Yasui

AI总结提出一种回归调整框架，通过建模协变量转移来估计随机实验中的纵向处理效应，并实现渐近正态性和半参数有效性。

Comments Accepted by ICML'26

详情

Journal ref: The 43rd International Conference on Machine Learning, 2026

AI中文摘要

我们提出一个回归调整框架，用于在静态制度下估计随机实验中的纵向处理效应。虽然回归调整方法通过使用预处理协变量有助于随机实验中的方差减少，但它们通常只关注平均效应，从中我们无法获得关于效应何时出现以及持续多久的有价值见解。为了解决这个问题，我们考虑随时间变化的中间结果和事后协变量，并使用转移核表示这些动态轨迹。此外，我们建立了估计量的渐近正态性和半参数效率界，从而实现更强大的统计推断。使用日本某流媒体平台的A/B测试数据进行的模拟研究和实证分析显示了我们的方法的实际优势。

英文摘要

We present a regression-adjustment framework designed for the estimation of longitudinal treatment effects in randomized experiments under static regimes. While regression-adjustment methods are useful for variance reduction in randomized experiments by using pre-treatment covariates, they usually focus only on average effects, from which we cannot obtain valuable insights into when the effects appear and how long they continue. To address this issue, we consider intermediate outcomes and evolving post-treatment covariates over time, and we represent such dynamic trajectories using transition kernels. Furthermore, we establish the asymptotic normality and the semiparametric efficiency bound for our estimator, enabling more powerful statistical inference. Simulation studies and empirical analysis using A/B test data from a streaming platform in Japan show the practical advantages of our method.

URL PDF HTML ☆

赞 0 踩 0

2605.31426 2026-06-01 eess.IV cs.CV math.OC

Self-Tuning Regularization for Image Scanning Microscopy

图像扫描显微镜的自调谐正则化

Sofia Agostoni, Lisa Cuneo, Christian Daniele, Giacomo Garré, Laurent Le, Alessandro Zunino, Giuseppe Vicidomini, Luca Calatroni

AI总结针对图像扫描显微镜（ISM）的多图像反卷积（MID）和超分辨率切片ISM（s²ISM）重建，提出一种自调谐显式正则化框架，通过贝叶斯最大后验公式结合多帧泊松数据保真项与ℓ1或平滑全变分惩罚，并基于残差白化原则自适应选择正则化参数，无需经验停止准则，在低光子条件下实现稳定超分辨和光学切片。

详情

AI中文摘要

图像扫描显微镜（ISM）是一种荧光成像技术，它结合探测器阵列采集和计算重建，实现理想共聚焦显微镜（即使用无穷小针孔）的理论分辨率，同时保持高信噪比。在获得超分辨图像的重建方法中，多图像反卷积（MID）及其旨在保持共聚焦显微镜光学切片能力的扩展（称为超分辨率切片ISM，s²ISM）是最广泛使用的方法之一。这两种方法都依赖于Richardson-Lucy型迭代方案，其半收敛行为需要提前停止，并且常常导致噪声放大和重建伪影。在这项工作中，我们为MID和s²ISM重建引入了一个自调谐显式正则化框架。在贝叶斯最大后验公式中，我们将多帧泊松数据保真项与显式正则化相结合，考虑ℓ1和平滑全变差惩罚作为代表性例子。我们进一步通过将残差白化原则适应于多帧泊松设置，并引入针对s²ISM定制的频谱高通扩展，开发了一种自动且无需真实值的正则化参数选择策略。由此产生的框架无需经验停止规则即可实现稳定重建。为了演示所提出的框架，我们考虑了基于近端梯度和镜像下降方法的一阶优化方案，并采用自适应回溯策略。在模拟和真实荧光ISM数据集上的实验表明，与无正则化方法相比，重建稳定性和图像质量得到改善，同时在低光子条件下实现了鲁棒的超分辨率和光学切片。

英文摘要

Image Scanning Microscopy (ISM) is a fluorescence imaging technique that combines detector-array acquisition and computational reconstruction to achieve the theoretical resolution of an ideal confocal microscope, i.e., one operating with an infinitesimally small pinhole, while maintaining high signal-to-noise ratio. Among the reconstruction methods for obtaining the super-resolved image, multi-image deconvolution (MID) and its extension aimed at preserving the optical sectioning capability of confocal microscopy, known as super-resolution sectioning ISM (s$^2$ISM), are among the most widely used approaches. Both methods rely on Richardson--Lucy-type iterative schemes, whose semi-convergent behavior requires early stopping and often leads to noise amplification and reconstruction artifacts. In this work, we introduce a self-tuning explicit regularization framework for both MID and s$^2$ISM reconstruction. Within a Bayesian maximum a posteriori formulation, we combine a multi-frame Poisson data fidelity term with explicit regularization, considering $\ell_1$ and smoothed total variation penalties as representative examples. We further develop an automatic and ground-truth-free strategy for regularization parameter selection by adapting the residual whiteness principle to the multi-frame Poisson setting and introducing a spectral high-pass extension tailored to s$^2$ISM. The resulting framework enables stable reconstructions without empirical stopping rules. To demonstrate the proposed framework, we consider first-order optimization schemes based on proximal gradient and mirror descent methods with adaptive backtracking strategies. Experiments on simulated and real fluorescence ISM datasets demonstrate improved reconstruction stability and image quality with respect to unregularized approaches, while enabling robust super-resolution and optical sectioning in low-photon conditions.

URL PDF HTML ☆

赞 0 踩 0

2605.31413 2026-06-01 math.ST cs.LG stat.TH

Improved Guarantees for Langevin Monte Carlo with Average Smoothness

Langevin Monte Carlo 的平均光滑性改进保证

Arnak S. Dalalyan, Avetik Karagulyan

AI总结针对强对数凹情形下的 Langevin Monte Carlo，利用平均坐标光滑常数而非全局光滑常数，改进了 Wasserstein 距离下的非渐近误差界，并推广至变步长、Laplacian-Lipschitz 势及有限和问题。

详情

AI中文摘要

我们在强对数凹背景下，当误差由 Wasserstein 距离度量时，建立了 Langevin Monte Carlo 的改进非渐近界。主要结果表明，离散化误差由平均坐标光滑常数控制，而非通常的全局光滑常数。证明简短且概率化，依赖于同步耦合的精细使用。我们进一步表明，相同的想法导致了变步长、Laplacian 是 Lipschitz 连续的势以及通过具有固定点控制变量的随机梯度 Langevin 动力学采样的有限和问题的改进界。在 Laplacian 光滑情形下，通常的 Hessian-Lipschitz 贡献被一个更弱的迹型三阶光滑量所取代。在有限和设定中，得到的 SGLD 界改进了对分量函数均方根光滑性的依赖。应用于具有高斯设计的广义线性模型表明，这些改进可以产生比先前已知界显著且依赖于维度的改进，特别是对于相关协变量。

英文摘要

We establish improved nonasymptotic bounds for Langevin Monte Carlo in the strongly log-concave setting, when the error is measured by the Wasserstein distance. The main result shows that the discretization error is governed by an average coordinate-wise smoothness constant, rather than by the usual global smoothness constant. The proof is short and probabilistic, and relies on a refined use of the synchronous coupling. We further show that the same ideas lead to improved bounds for variable step sizes, for potentials whose Laplacian is Lipschitz-continuous, and for finite-sum problems sampled by stochastic-gradient Langevin dynamics with fixed point control variates. In the Laplacian-smooth case, the usual Hessian-Lipschitz contribution is replaced by a weaker trace-type third-order smoothness quantity. In the finite-sum setting, the resulting SGLD bound improves the dependence on the root mean square smoothness of the component functions. Applications to generalized linear models with Gaussian design show that these refinements can yield substantial, dimension-dependent improvements over previously known bounds, especially for correlated covariates.

URL PDF HTML ☆

赞 0 踩 0

2605.31391 2026-06-01 physics.ins-det cs.LG hep-ex

Deep-learning-based low-energy trigger algorithms for the Hyper-Kamiokande experiment

基于深度学习的Hyper-Kamiokande实验低能量触发算法

Katharina Lachner, Saúl Alonso-Monsalve, Benjamin Richards, Davide Sgalaberna

AI总结本文针对Hyper-Kamiokande实验的低能中微子事件（<7 MeV），提出并比较了监督式神经网络和基于异常检测（自编码器与MPDR）的触发算法，在3 MeV单电子信号上效率分别达76.7%和31.8%，远超传统命中计数触发的26.4%，且GPU推理延迟低于毫秒级，满足实时运行需求。

Comments 16 pages, 6 figures

详情

AI中文摘要

现代机器学习技术因其强大的模式识别能力在粒子物理学中变得越来越重要，包括在具有严格运行时间约束的实时数据采集中。本文详细介绍了针对大型水切伦科夫探测器（如Hyper-Kamiokande）的低能中微子事件（低于7 MeV）的基于深度学习的触发算法的性能。展示了自定义神经网络监督分类器的性能，以及两种仅基于探测器噪声训练的异常检测方法：纯自编码器和基于流形投影-扩散恢复（MPDR）的能量模型。监督模型对动能为3 MeV的单电子信号识别效率为76.7%，显著超过了传统基于命中计数触发的26.4%的信号效率，MPDR方法也达到了31.8%。在GPU上的运行时间评估显示，每窗口推理延迟远低于毫秒量级，表明实时操作是可行的。

英文摘要

Modern machine learning techniques have become increasingly important in particle physics because of their powerful pattern-recognition capabilities, including in real-time data acquisition where stringent runtime constraints apply. This paper details the performance of deep-learning-based trigger algorithms for a large water Cherenkov detector such as Hyper-Kamiokande aimed at low-energy neutrino events (below 7 MeV). The performance of custom neural-network supervised classifiers is shown alongside two anomaly-detection approaches trained solely on detector noise: a pure autoencoder and an energy-based model based on Manifold Projection--Diffusion Recovery (MPDR). The supervised model shows signal identification efficiencies of 76.7% for single electrons of 3 MeV kinetic energy, significantly exceeding signal efficiencies obtained from a traditional hit-count-based trigger of 26.4%, as does the MPDR approach with 31.8%. Runtime evaluations on GPU yield per-window inference latencies well below the millisecond scale, indicating that real-time operation is feasible.

URL PDF HTML ☆

赞 0 踩 0

2605.31377 2026-06-01 cs.IR cs.AI

DynaTree: Dynamic Agentic Retrieval Tree for Time-Sensitive News Retrieval

DynaTree: 面向时效性新闻检索的动态智能检索树

Siyuan Qi, Xinyuan Wang, Yingxuan Yang, Haochuan Guo, Jianghao Lin, Weiwen Liu, Yong Yu, Weinan Zhang

AI总结提出DynaTree两阶段框架，通过离线构建可复用检索树和在线轻量子树选择，实现高效、自适应的时效性新闻检索，在Syft新闻基准和BEIR数据集上优于标准RAG和现有智能体方法。

详情

AI中文摘要

智能体检索增强生成通过集成规划、工具使用和迭代推理改进了检索，但现有的智能体RAG方法通常将语义扩展与检索决策耦合在短视推理循环中，导致推理成本高且不适用于时效性新闻检索。我们提出DynaTree，一个高效自适应新闻检索的两阶段框架。在离线阶段，DynaTree使用协调的智能体构建一个可复用的检索树，具体化查询主题的语义空间。在在线阶段，DynaTree在时间局部评估代理上执行轻量级日常子树选择，无需进一步的智能体推理、树修改或重新训练。在多日Syft新闻基准和多个BEIR数据集上的实验表明，DynaTree实现了强大的召回和排序性能，始终优于标准RAG和先前的智能体基线。我们进一步在Syft生产系统中部署DynaTree，并通过2026年1月28日至2月6日的在线A/B测试进行评估。动态适应变体将固定离线选择子树的生存率从0.32-0.53提高到0.59-0.73，并且在每个评估日都优于现有的生产召回器。这些结果表明，持久的、结构感知的语义扩展可以将离线智能体推理转化为实际改进，覆盖范围、新鲜度和相关性在真实世界新闻检索中均得到提升。

英文摘要

Agentic Retrieval-Augmented Generation improves retrieval by integrating planning, tool use, and iterative reasoning, but existing agentic RAG methods often couple semantic expansion with retrieval decisions in short-horizon inference loops, leading to high inference cost and limited suitability for time-sensitive news retrieval. We propose DynaTree, a two-stage framework for efficient and adaptive news retrieval. In the offline stage, DynaTree uses coordinated agents to construct a reusable retrieval tree that materializes the semantic space of a query topic. In the online stage, DynaTree performs lightweight daily subtree selection over a time-localized evaluation proxy, without further agentic reasoning, tree modification, or retraining. Experiments on a multi-day Syft news benchmark and multiple BEIR datasets show that DynaTree achieves strong recall and ranking performance, consistently outperforming standard RAG and prior agentic baselines. We further deploy DynaTree in the Syft production system and evaluate it through online A/B testing from Jan. 28 to Feb. 6, 2026. The dynamically adapted variant improves survival rate from 0.32-0.53 to 0.59-0.73 over a fixed offline-selected subtree and outperforms existing production recallers on every evaluation day. These results show that persistent, structure-aware semantic expansion can translate offline agentic reasoning into practical improvements in coverage, freshness, and relevance for real-world news retrieval.

URL PDF HTML ☆

赞 0 踩 0

2605.31361 2026-06-01 cs.MA cs.AI cs.LG

Dreaming Of Others: Latent Teammate Modeling In World Models For Multi-Agent Reinforcement Learning

梦见他人：多智能体强化学习中世界模型内的潜在队友建模

Tomas Leroy-Stone

AI总结提出一种将队友建模为世界模型中可学习组件的方法，通过分解潜在状态并引入心智理论头来推断队友行为，实现零样本和少样本协调。

Comments 5 pages, 2 figures. Accepted as a poster at the 2026 World Modeling Workshop. Conceptual workshop paper

详情

AI中文摘要

在合作多智能体强化学习（MARL）中，智能体必须与内部策略和意图不可直接观察的伙伴协调。虽然像Dreamer这样的世界模型在单智能体设置中表现出强大的泛化能力和样本效率，但它们由于无法处理队友引起的不确定性而在MARL中的应用受到限制。我们提出一个新的视角：将队友视为智能体世界模型中的结构化、可学习组件。我们引入一种架构，将Dreamer风格的循环状态空间模型（RSSM）的潜在状态分解为环境和队友组件，并学习一个辅助的心智理论（ToM）头，从部分轨迹中推断队友行为的潜在嵌入，如角色、意图和预测动作。这些队友潜在变量影响演员和评论家，使智能体能够想象并适应多样化的合作者。我们概述了这种方法如何在部分可观察设置中支持零样本和少样本协调，并提出了一套基准测试和评估协议来评估其影响。这项工作将世界模型定位为不仅是环境动态的预测器，而且是社会行为的模拟器，为可泛化、与人类兼容的AI开辟了新方向。

英文摘要

In cooperative multi-agent reinforcement learning (MARL), agents must coordinate with partners whose internal policies and intentions are not directly observable. While world models such as Dreamer have demonstrated strong generalization and sample efficiency in single-agent settings, their application to MARL remains limited by an inability to handle teammate-induced uncertainty. We propose a new perspective: treat teammates as structured, learnable components within the agent's world model. We introduce an architecture that factorizes the latent state of a Dreamer-style recurrent state-space model (RSSM) into environment and teammate components, and learns an auxiliary Theory-of-Mind (ToM) head to infer latent embeddings of partner behavior such as character, intent, and predicted actions from partial trajectories. These teammate latents condition the actor and critic, enabling the agent to imagine and adapt to diverse collaborators. We outline how this approach can support zero-shot and few-shot coordination in partially observable settings and propose a set of benchmarks and evaluation protocols to assess its impact. This work positions world models as not only predictors of environmental dynamics, but as simulators of social behavior, opening new directions for generalizable, human-compatible AI.

URL PDF HTML ☆

赞 0 踩 0

2605.31346 2026-06-01 math.OC cs.LG

Wall-Clock Complexity for Zeroth-Order Optimization with Tunable Oracle Fidelity

可调 oracle 保真度的零阶优化的挂钟复杂度

Alexandra Suvorikova, Igor Pavlov, Artem Vasin, Georgii Bychkov, Anastasia Antsiferova, Darina Dvinskikh, Alexander Gasnikov

AI总结针对零阶优化中 oracle 保真度可调的场景，提出挂钟复杂度模型并分析参数选择对总时间的影响，揭示加速方法可能劣于非加速方法，并刻画恒定保真度策略最优的条件。

详情

AI中文摘要

零阶（黑箱）优化应用于梯度不可用且目标评估依赖昂贵模拟的情况。在许多此类应用中，oracle 保真度是可调的：更高精度的查询降低噪声但增加计算成本。为捕捉这一权衡，我们研究一个精度感知的挂钟模型，其中每次保真度为 $\delta$ 的查询具有成本 $c(\delta)$，并在目标精度约束下最小化总时间 $T_{\mathrm{total}} = \sum_{k=1}^{N} c(\delta_k)$。我们展示了 oracle 类型、噪声模型和优化方案的选择如何导致算法参数的显式挂钟最优选择。例如，我们证明加速方法在挂钟时间上可能劣于非加速方案。此外，我们刻画了恒定保真度策略在 Big-O 意义上最优的条件。我们的框架提供了一种统一的方法，将收敛保证转化为实际的保真度和批处理建议。

英文摘要

Zeroth-order (black-box) optimization is applied when gradients are unavailable and objective evaluations rely on expensive simulations. In many such applications, the oracle fidelity is tunable: higher-accuracy queries reduce noise but incur higher computational costs. To capture this trade-off, we study an accuracy-aware wall-clock model where each query with fidelity $δ$ has a cost $c(δ)$, and we minimize the total time $T_{\mathrm{total}} = \sum_{k=1}^{N} c(δ_k)$, subject to a target accuracy constraint. We show how the choice of oracle type, noise model, and optimization scheme induces explicit wall-clock-optimal choices for the algorithmic parameters. For instance, we demonstrate that accelerated methods can be wall-clock inferior to non-accelerated schemes. Furthermore, we characterize the conditions under which a constant fidelity strategy is optimal in the Big-O sense. Our framework provides a unified methodology to translate convergence guarantees into practical fidelity and batching recommendations.

URL PDF HTML ☆

赞 0 踩 0

2605.31345 2026-06-01 stat.ML cs.LG stat.ME

Log-Ratio Propagation on the Simplex: A Theory of Cellwise Contamination for Compositional Data

单纯形上的对数比传播：成分数据细胞污染的理论

Matthias Templ

AI总结本文提出单纯形上细胞污染的理论，通过乘法扰动和传播定理证明单个成分污染导致对数比向量秩一偏移，并揭示欧几里得细胞方法在单纯形上的失效与降维现象。

Comments 50 pages, no figures; 11-page supplement included as an ancillary file. A companion methods paper (cellPcaCoDa: cellwise-robust PCA for compositional data) is forthcoming

详情

AI中文摘要

成分数据必须通过对数比进行分析：尺度不变性，该领域的定义公理，别无选择。中心化对数比除以每个部分的几何平均值，因此单个受污染成分会同时移动所有中心化对数比坐标，将对数比向量位移一个固定量，任何坐标选择都无法减少。我们围绕这一观察发展了单纯形上细胞污染的理论。基于乘法扰动的尺度不变污染模型与传播定理相结合，表明单个原始部分的腐败会导致对数比向量的秩一偏移，方向由对比矩阵决定。由此产生的扰动模式不等同于对数比坐标中的任何独立细胞污染模型——因此，应用于对数比的标准欧几里得细胞方法在单纯形污染机制下是不适定的。对于其欧几里得细胞崩溃由列集中配置见证的估计量——包括MCD、$S$-、$τ$-和坐标$M$-估计量的位置和散度——单纯形上的细胞崩溃值相对于其欧几里得对应值减少了因子$(D-1)/D$，这种减少是紧的，并且纯粹源于$nD$个原始细胞与$n(D-1)$个ilr细胞之间的归一化不匹配。变异矩阵的细胞影响函数携带诊断指纹：单个部分的污染恰好膨胀一行和一列，从而识别出责任成分。这些结果为单纯形上的细胞鲁棒方法奠定了理论基础；一篇配套论文开发了一种利用传播几何的细胞鲁棒PCA估计器，并在模拟和地球化学数据上进行了演示。

英文摘要

Compositional data must be analysed through log-ratios: scale invariance, the defining axiom of the field, leaves no alternative. The centred log-ratio divides by the geometric mean of every part, so a single contaminated component shifts every centred-log-ratio coordinate at once, displacing the log-ratio vector by a fixed amount that no choice of coordinates can reduce. We develop a theory of cellwise contamination on the simplex around this observation. A scale-invariant contamination model built from multiplicative perturbation combines with a propagation theorem showing that corruption of a single raw part induces a rank-one shift of the log-ratio vector, with direction determined by the contrast matrix. The resulting perturbation pattern is not equivalent to any independent cellwise contamination model in log-ratio coordinates -- so standard Euclidean cellwise methods applied to log-ratios are ill-posed under the simplex contamination mechanism. For estimators whose Euclidean cellwise breakdown is witnessed by a column-concentrated configuration -- a class including MCD, $S$-, $τ$-, and coordinate-wise $M$-estimators of location and scatter -- the cellwise breakdown value on the simplex is reduced by the factor $(D-1)/D$ relative to its Euclidean counterpart, a reduction that is tight and arises purely from the normalisation mismatch between $nD$ raw cells and $n(D-1)$ ilr cells. The cellwise influence function for the variation matrix carries a diagnostic fingerprint: contamination of a single part inflates exactly one row and column, identifying the responsible component. These results form the theoretical foundation for cellwise-robust methods on the simplex; a companion paper develops a cellwise-robust PCA estimator that exploits the propagation geometry and demonstrates it on simulated and geochemical data.

URL PDF HTML ☆

赞 0 踩 0

2605.31340 2026-06-01 cs.HC cs.AI

Appropriateness of Empathy in AI: A Signal-Cost Perspective

AI中同理心的适当性：信号-成本视角

Chi-Ching Juan, Tao Wang, Harold Lee

AI总结本文从信号-成本视角出发，运用信号理论提出信号成本代理（情感丰富性、观点采择和情境定制）来评估AI同理心的适当性，建立多维度框架以系统评价同理心是否适应用户需求。

Comments Accepted by IEEE CASCON 2025

2605.31330 2026-06-01 cs.GT cs.AI cs.MA math.OC nlin.AO

Social welfare optimisation under institutional reward and punishment

制度奖惩下的社会福利优化

Van An Nguyen, Vuong Khang Huynh, Huu Loi Bui, Hai Anh Ha, Quang Dung Le, Tan Dat Nguyen, Ngoc Ngu Nguyen, Zhao Song, Manh Hong Duong, Le Hong Trang, The Anh Han

AI总结研究在有限混合群体中，通过奖励合作者或惩罚背叛者来最大化社会福利的激励机制，推导出最优激励的显式表达式和相变条件，并比较奖励与惩罚的福利效果。

详情

AI中文摘要

制度激励被广泛用于促进从人类社会到多智能体和AI系统中自主、自利代理人的合作。现有工作通常将激励设计视为双目标问题：在实现高长期合作频率的同时最小化制度成本。此类方案是否也能最大化社会福利——即扣除制度支出后的总人口收益——在很大程度上尚未被探索。我们针对有限、充分混合的群体中参与社会困境（捐赠博弈和公共品博弈）的情况，开发了一个以福利为中心的激励框架，同时考虑对合作者的奖励和对背叛者的惩罚。对于每种机制，我们推导出预期社会福利的显式表达式，并刻画其如何依赖于激励效率和选择强度。在解析上，我们识别出社会福利具有单一最优激励水平的参数区间，以及出现定性相变、福利非单调且具有多个局部最优的区间。我们证明任何最大化福利的激励要么为零，要么集中在简单的闭式目标附近，并提供了一种高效算法来计算这些最优值。通过比较奖励和惩罚，我们进一步推导出在给定预算下奖励在福利方面优于惩罚的闭式条件。总体而言，我们的结果揭示了针对成本或合作频率优化的激励与最大化福利的激励之间存在系统性差距。

英文摘要

Institutional incentives are widely used to promote cooperation among autonomous, self-regarding agents, from human societies to multi-agent and AI systems. Existing work typically treats incentive design as a bi-objective problem: minimise institutional cost while achieving a high long-run frequency of cooperation. Whether such schemes also maximise social welfare - total population payoff net of institutional expenditure - has remained largely unexplored. We develop a welfare-centric framework for institutional incentives in finite, well-mixed populations playing a social dilemma (Donation Game and Public Goods Game), considering both rewards for cooperators and punishments for defectors. For each mechanism, we derive explicit expressions for expected social welfare and characterise how it depends on incentive efficiency and selection intensity. Analytically, we identify parameter regimes where social welfare has a single optimal incentive level and regimes with qualitative phase transitions, in which welfare becomes non-monotonic with multiple local optima. We prove that any welfare-maximising incentive is either zero or concentrated around a simple closed-form target, and we provide an efficient algorithm to compute these optima. Comparing reward and punishment, we further derive close-formed conditions under which reward outperform punishment in terms of social welfare for any given budget. Overall, our results reveal a systematic gap between incentives optimised for cost or cooperation frequency and those that maximise welfare.

URL PDF HTML ☆

赞 0 踩 0

2605.31311 2026-06-01 math.OC cs.DC cs.LG

S$^3$LDBO: A Snapshot Single-Loop Algorithm for Decentralized Bilevel Optimization

S$^3$LDBO: 一种用于去中心化双层优化的快照单循环算法

Chao Yin, Youran Dong, Shiqian Ma, Bofan Wang, Junfeng Yang

AI总结提出S$^3$LDBO算法，通过快照机制间歇跳过昂贵导数计算，实现去中心化双层优化的高效单循环求解，并理论证明其复杂度，实验验证计算效率与学习性能的平衡。

详情

AI中文摘要

网络化AI系统日益依赖多个智能体通过通信网络协作学习和适应模型。在此类系统中，双层公式自然出现在超参数优化、数据清洗和元学习中，但梯度、雅可比矩阵和海森矩阵的重复评估可能给单个智能体带来巨大计算负担。为应对这一挑战，我们提出Snapshot-SLDBO（S$^3$LDBO），一种高效的单循环去中心化双层优化算法，通过快照机制使智能体能够间歇性地跳过昂贵的导数计算。该机制可解释为网络化AI的自主计算-适应策略，其中智能体选择性执行昂贵的局部更新，同时保持全局协作学习。我们在确定性设定下建立了所提出算法的遍历迭代复杂度和高概率非遍历迭代复杂度。在合成数据集和MNIST数据集上的超参数优化、Fashion-MNIST上的数据超清洗以及miniImageNet上的去中心化元学习实验结果表明，所提出算法在保持竞争性学习性能的同时提高了计算效率。

英文摘要

Networked AI systems increasingly rely on multiple agents that collaboratively learn and adapt models over communication networks. In such systems, bilevel formulations naturally arise in hyperparameter optimization, data cleaning, and meta-learning, but the repeated evaluation of gradients, Jacobians, and Hessians can impose a substantial computational burden on individual agents. To address this challenge, we propose Snapshot-SLDBO (S$^3$LDBO), an efficient single-loop decentralized bilevel optimization algorithm that enables agents to intermittently skip expensive derivative evaluations through a snapshot mechanism. This mechanism can be interpreted as an autonomous computation-adaptation strategy for networked AI, where agents selectively perform costly local updates while maintaining global collaborative learning. We establish the ergodic iteration complexity and the high probability nonergodic iteration complexity of the proposed algorithm within a deterministic setting. Experimental results on hyperparameter optimization with synthetic and MNIST datasets, data hyper-cleaning on Fashion-MNIST, and decentralized meta-learning on miniImageNet demonstrate that the proposed algorithm improves computational efficiency while maintaining competitive learning performance.

URL PDF HTML ☆

赞 0 踩 0

2605.31302 2026-06-01 eess.IV cs.CV eess.SP

MoE-dqINR: A Unified Mixture-of-Experts Implicit Neural Representation Framework for Scan-Specific Dynamic and Quantitative MRI Reconstruction

MoE-dqINR：用于特定扫描动态和定量MRI重建的统一混合专家隐式神经表示框架

Yinzhe Wu, Fanwen Wang, Zhenxuan Zhang, Zi Wang, Chengyan Wang, Guang Yang

AI总结提出MoE-dqINR框架，通过共享空间专家和状态条件路由路径，实现高效、统一的特定扫描多线圈动态和定量MRI重建，优化时间约30秒。

详情

AI中文摘要

欠采样磁共振成像（MRI）重建旨在从不完整的多线圈k空间数据中恢复时间或对比度变化的图像序列，同时为动态和定量MRI（qMRI）保留状态相关的保真度。现有的特定扫描隐式神经表示（INR）通常使用单一的时空坐标场、显式子空间、运动或变形模型、校准变量或序列特定的定量信号模型。这些设计选择在跨采集状态适应图像合成的同时，限制了共享空间信息的灵活性。此外，许多基于INR的基线方法计算量大，通常需要每个扫描数百到数千秒的优化时间。我们提出MoE-dqINR，一种特定扫描的多线圈MRI重建框架，将图像域表示分解为共享空间专家和状态条件路由路径。空间专家编码可重用的坐标相关图像内容，而路由权重（以有序采集状态为条件）从公共专家库合成每个动态帧或对比状态。该表示与多线圈MRI前向模型耦合，使用归一化状态索引驱动动态和定量MRI中的路由。通过将共享空间表示与状态相关合成分离，该框架为动态和定量MRI提供了一种以图像为先的架构，同时在我们的实验中将特定扫描INR优化减少到每扫描约30秒。所提出的公式建立了状态条件混合专家INR作为特定扫描多线圈MRI重建先验，统一了共享空间表示、动态和qMRI特定合成以及实际每扫描效率。

英文摘要

Undersampled magnetic resonance imaging (MRI) reconstruction seeks to recover temporally or contrast-varying image series from incomplete multicoil k-space data while preserving state-dependent fidelity for dynamic and quantitative MRI (qMRI). Existing scan-specific implicit neural representations (INRs) often use monolithic spatiotemporal coordinate fields, explicit subspaces, motion or deformation models, calibration variables, or sequence-specific quantitative signal models. These design choices can limit flexibility in sharing spatial information while adapting image synthesis across acquisition states. Moreover, many INR-based baselines remain computationally demanding, typically requiring per-scan optimization times on the order of hundreds to thousands of seconds. We propose MoE-dqINR, a scan-specific multicoil MRI reconstruction framework that factorizes the image-domain representation into shared spatial experts and a state-conditioned routing pathway. Spatial experts encode reusable coordinate-dependent image content, whereas routing weights, conditioned on ordered acquisition states, synthesize each dynamic frame or contrast state from a common expert bank. The representation is coupled to a multicoil MRI forward model, uses the normalized state index to drive routing in both dynamic and quantitative MRI. By separating shared spatial representation from state-dependent synthesis, the framework provides an image-first architecture for dynamic and quantitative MRI while reducing scan-specific INR optimization to approximately 30 s per scan in our experiments. The proposed formulation establishes state-conditioned mixture-of-experts INR as a scan-specific multicoil MRI reconstruction prior that unifies shared spatial representation, dynamic- and qMRI-specific synthesis, and practical per-scan efficiency.

URL PDF HTML ☆

赞 0 踩 0

2605.31296 2026-06-01 q-bio.BM cs.LG

mRNAutilus: Multi-Objective-Guided Discrete Generation of mRNA with Optimized Therapeutic Properties

mRNAutilus：多目标引导的mRNA离散生成与优化治疗特性

Sawan Patel, Sophia Tang, Yesol Kim, Yinuo Zhang, Divya Srijay, Ping-Jung Lin, Shambhavi Shubham, Fengmei Pi, Cedric Wu, Sherwood Yao, Pranam Chatterjee

AI总结提出mRNAutilus框架，结合掩码离散扩散模型和蒙特卡洛树引导，实现同时优化密码子和从头设计UTR，生成多目标帕累托最优的完整mRNA序列，在多个靶标上显著提升表达和稳定性。

详情

AI中文摘要

治疗性mRNA设计需要协调整个转录本中多个相互作用的序列特征，其中密码子使用、非翻译区（UTR）及其耦合共同决定稳定性、翻译效率和蛋白质表达。在这里，我们提出通过展开轨迹和信息潜在更新生成mRNA（mRNAutilus），这是一个直接从序列进行同时密码子优化和从头UTR设计的框架。mRNAutilus结合了在数百万全长mRNA上训练的掩码离散扩散模型与蒙特卡洛树引导，在多个功能目标下生成帕累托高效序列，使用模型嵌入上的轻量级回归器预测半衰期、翻译效率和蛋白质丰度。与最近分别设计编码序列和UTR或依赖事后组装和筛选的方法不同，mRNAutilus在单个过程中生成完整转录本，并跨属性优化。在多种靶标上，编码P. pyralis荧光素酶的零样本mRNA表达量比野生型高400倍以上，并优于商业和机器学习设计的基线，包括零样本生成方法。零样本SARS-CoV-2 Spike mRNA超过临床使用和商业构建体，并匹配或超越实验室优化设计，同时具有更好的耐久性。我们进一步展示了在治疗环境中的通用性，包括先导编辑（PEMax）和可编程蛋白质组调节，其中mRNAutilus设计的构建体增强了用于β-连环蛋白降解的肽引导E3连接酶（uAbs）的表达。这些结果建立了一个基于序列的多目标框架，用于生成适用于多种生物应用的功能性mRNA。

英文摘要

Therapeutic mRNA design requires coordinating multiple interacting sequence features across the full transcript, where codon usage, untranslated regions (UTRs), and their coupling jointly determine stability, translation efficiency, and protein expression. Here, we present mRNA generation via unrolled trajectories and informed latent updates (mRNAutilus), a framework for simultaneous codon optimization and de novo UTR design directly from sequence. mRNAutilus combines a masked discrete diffusion model trained on millions of full-length mRNAs with Monte Carlo Tree Guidance to generate Pareto-efficient sequences under multiple functional objectives, using lightweight regressors over model embeddings to predict half-life, translation efficiency, and protein abundance. Unlike recent methods that design coding sequences and UTRs separately or rely on post hoc assembly and screening, mRNAutilus generates complete transcripts in a single process optimized across properties. Across diverse targets, zero-shot mRNAs encoding P. pyralis luciferase achieve over 400-fold higher expression than wild-type and outperform commercial and machine learning-designed baselines, including zero-shot generative approaches. Zero-shot SARS-CoV-2 Spike mRNAs exceed clinically used and commercial constructs and match or surpass lab-optimized designs with improved durability. We further demonstrate generality in therapeutic settings, including prime editing (PEMax) and programmable proteome modulation, where mRNAutilus-designed constructs enhance expression of peptide-guided E3 ligases (uAbs) for beta-catenin degradation. These results establish a sequence-based, multi-objective framework for generating functional mRNAs tailored to diverse biological applications.

URL PDF HTML ☆

赞 0 踩 0

2605.31291 2026-06-01 cs.IR cs.LG

Contextual Scalarisation Thompson Sampling for multi-objective decisions in public media

面向公共媒体多目标决策的上下文标量化汤普森采样

Théo Maëtz, Luc Guillet, Andrea Cavallaro

AI总结提出上下文标量化汤普森采样（CSTS）方法，通过学习上下文相关的目标权重，在公共媒体推荐中平衡多个竞争目标，实验表明其优于固定权重和标准上下文赌博机方法。

Comments 15 pages, 3 figures, 3 tables. Submitted-manuscript version of a paper accepted at ICPR 2026. The Version of Record will be published in the Springer Lecture Notes in Computer Science series; DOI will be added when available

2605.31287 2026-06-01 cs.CY cs.AI cs.HC

Neither Replacement nor Panacea: Comparing LLM-Based Conversational and Graphical Decision Support in Industrial Tasks

既非替代品也非万能药：比较基于LLM的对话式与图形化决策支持在工业任务中的应用

Roberto Figliè, Simone Caputo, Alan Serrano, Daria Mikhaylova, Tommaso Turchi, Daniele Mazzei

AI总结通过混合因子实验，比较基于LLM的对话式界面与仪表盘在工业决策支持中的效果，发现对话界面在低复杂度任务中降低认知负荷和加快完成时间，但优势随任务复杂度增加而消失，且未提高决策准确性。

详情

AI中文摘要

制造业环境中的管理者依赖数字界面解读运营数据以进行决策，但不断增长的数据量和复杂性使得高效识别相关洞察变得困难。虽然仪表盘在工业环境中仍占主导地位，但通过对话式用户界面（CUI）访问的基于大型语言模型（LLM）的对话代理（CA）可能提供更直接的数据访问。然而，其有效性可能取决于任务的信息处理需求。本研究在制造决策支持场景中比较了通过CUI提供的基于LLM的CA与仪表盘。在一个2x3设计的混合因子实验中，134名工业决策者被分配到一种界面条件，并完成三个复杂度递增的任务。我们考察了感知心理负荷（MWL）、决策准确性、完成时间和预期依赖，并测试了自我报告的数据素养作为调节变量。结果显示，CUI总体上降低了感知MWL，并在低要求任务中支持更快的完成，但随着任务复杂度增加，这两个优势均减弱。两种界面在决策准确性上均未产生一致的整体优势，且CUI不被偏好作为后续决策的唯一基础。此外，数据素养并未可靠地调节界面效应。这些发现表明，对话式交互为工业决策支持提供的是有条件而非普遍的好处。基于LLM的CA可能减少信息访问努力，而复杂决策仍然受益于持久、可检查的视觉表示。

英文摘要

Managers in manufacturing settings rely on digital interfaces to interpret operational data for decision-making, but growing data volume and complexity can make relevant insights difficult to identify efficiently. While dashboards remain dominant in industrial contexts, Large Language Model (LLM)-based conversational agents (CAs), accessed through conversational user interfaces (CUIs), may provide more direct access to such data. However, their effectiveness may depend on the information-processing demands of the task. This study compares an LLM-based CA delivered through a CUI with a dashboard in a manufacturing decision-support scenario. In a mixed factorial experiment with a 2x3 design, 134 industrial decision-makers were assigned to one interface condition and completed three tasks of increasing complexity. We examined perceived Mental Workload (MWL), decision accuracy, completion time, and intended reliance, and tested self-reported data literacy as a moderator. Results showed that the CUI reduced perceived MWL overall and supported faster completion in less demanding tasks, but both advantages diminished as task complexity increased. Neither interface produced a consistent overall advantage in decision accuracy, and the CUI was not preferred as a sole basis for subsequent decisions. Furthermore, data literacy did not reliably moderate interface effects. These findings indicate that conversational interaction offers conditional rather than universal benefits for industrial decision support. LLM-based CAs may reduce information-access effort, whereas complex decisions continue to benefit from persistent, inspectable visual representations.

URL PDF HTML ☆

赞 0 踩 0

2605.31279 2026-06-01 eess.SP cs.AI cs.NI

Practical Cross-Band Channel Prediction for AI-RAN via Physics-Guided Deep Unfolding

面向AI-RAN的实用跨频段信道预测：基于物理引导的深度展开

Ruiqi Kong, He Chen, Xiaojun Lin

AI总结提出GUIDE框架，通过将无线信道物理嵌入可微层，实现跨频段信道预测的泛化与实时推理，在未见环境中波束赋形增益比深度学习基线FIRE高2.75倍，比模型基线R2F2高1.39倍且速度快1610倍以上。

Comments 2 pages

2605.31277 2026-06-01 cs.CR cs.LG

GETA: Generalized Encrypted Traffic Analysis

GETA: 通用加密流量分析

Ransika Gunasekara, Rahat Masood, Salil Kanhere

AI总结提出GETA框架，通过元学习、嵌入优化和自注意力机制，仅利用流量元数据建模多变量时间序列，实现跨协议、少样本的加密流量分析，在应用识别、VPN分类、IoT设备指纹和攻击检测等任务上优于现有方法。

详情

AI中文摘要

传统流量分析正受到加密、隧道和隐私保护协议的快速采用的根本性挑战，这些协议日益模糊数据包载荷并限制深度包检测（DPI）的实用性。尽管机器学习推进了加密流量分析，但现有方法通常仍依赖于特定协议的头部特征，依赖大量标注数据集，并在跨异构网络环境部署时性能下降。我们提出GETA，一个协议无关的加密流量分析框架，它仅使用流量元数据将网络流建模为多变量时间序列，从而避免了对数据包载荷或头部语义的依赖。GETA结合了元学习、嵌入优化和自注意力机制，以支持对未见过的领域进行少样本适应，仅需极少的标注数据。在涵盖应用识别、VPN流量分类、IoT设备指纹识别和攻击检测的九个公开数据集上，GETA始终优于最先进的基线方法。这些结果表明，GETA为现代加密网络中的鲁棒流量分析提供了一个实用且可泛化的基础。

英文摘要

Traditional traffic analysis is being fundamentally challenged by the rapid adoption of encryption, tunnelling, and privacy-preserving protocols, which increasingly obscure packet payloads and limit the usefulness of Deep Packet Inspection (DPI). Although machine learning has advanced encrypted traffic analysis, existing approaches often remain tied to protocol-specific header features, depend on large labelled datasets, and degrade when deployed across heterogeneous network environments. We present GETA, a protocol-agnostic framework for encrypted traffic analysis that models network flows as multivariate time series using only traffic metadata, thereby avoiding reliance on packet payloads or header semantics. GETA combines meta-learning, embedding refinement, and self-attention to support few-shot adaptation to previously unseen domains with minimal labelled data. Across nine public datasets spanning application identification, VPN traffic classification, IoT device fingerprinting, and attack detection, GETA consistently outperforms state-of-the-art baselines. These results show that GETA offers a practical and generalisable foundation for robust traffic analysis in modern encrypted networks.

URL PDF HTML ☆

赞 0 踩 0

2605.31275 2026-06-01 cs.HC cs.AI

Personalized to Persuade: The Effects of Contextualization and Warmth on Trust and Reliance in Conversational AI

个性化以说服：情境化和温暖对对话式AI中信任与依赖的影响

Mert Yazan, Suzan Verberne, Frederik Bungaran Ishak Situmeang

AI总结通过2x2被试间实验（N=380），研究情境化与对话温暖如何交互影响AI助手在反驳专家建议时的说服力与用户依赖，发现情境化降低说服力但与温暖结合通过交叉交互恢复，且AI素养解耦信任与行为。

详情

AI中文摘要

人工智能（AI）代理通过根据用户的背景、兴趣和先前交互来定制解释，即情境化，从而个性化其响应。个性化已被视为政治或营销中的说服策略。然而，在用户通常缺乏先验知识的日常任务中，情境化的说服效果仍不明确。我们进行了一项2×2被试间实验（N=380），研究情境化与对话温暖相结合如何影响AI助手在反驳专家建议时的依赖性和说服力。我们的发现表明，情境化降低了AI的说服力，但其与温暖的结合通过交叉交互恢复了说服力。对AI的依赖在所有条件下都存在，并且不受对话设计的影响。信任强烈预测说服力和依赖，但情境化和温暖都不通过信任起作用。AI素养解耦了信任与行为：素养更高的用户对助手报告的信任较低，但更易被说服且更依赖其建议。这些结果表明，用户倾向于依赖AI代理而非人类专家判断；然而，界面级别的对话设计选择在塑造行为方面的作用有限。

英文摘要

Artificial Intelligence (AI) agents personalize their responses by tailoring explanations to users' backgrounds, interests, and prior interactions, referred to as contextualization. Personalization has been identified as a persuasive strategy in politics or in marketing. However, the persuasive effect of contextualization in everyday tasks, where users often lack prior knowledge, remains unclear. We conducted a $2\times2$ between-subjects experiment ($N = 380$) examining how contextualization, combined with conversational warmth, shapes reliance and persuasiveness of an AI assistant arguing against expert recommendations. Our findings reveal that contextualization reduces the persuasive power of AI, but its combination with warmth restores persuasiveness through a crossover interaction. Reliance on AI is present across conditions and is invariant to the conversational design. Trust strongly predicts both persuasion and reliance, yet neither contextualization nor warmth operates through trust. AI literacy decouples trust from behavior: more literate users report lower trust in the assistant, yet are more persuaded and more reliant on its advice. These results suggest that users are prone to deferring to AI agents over human expert judgment; however, interface-level conversational design choices have a limited role in shaping the behavior.

URL PDF HTML ☆

赞 0 踩 0

2605.31250 2026-06-01 stat.ML cs.AI cs.LG

Entropic Projection Alignment: Estimating, Explaining, and Improving Model Performance Under Distribution Shift

熵投影对齐：估计、解释和改进分布偏移下的模型性能

Salim I. Amoukou, Emanuele Albini, Tom Bewley, Saumitra Mishra, Manuela Veloso

AI总结提出熵投影对齐（EPA）方法，通过匹配选定矩并最小化KL散度来对齐源分布与目标分布，从而统一解决分布偏移下的性能估计、解释和改进问题。

Comments Accepted at the 29th International Conference on Artificial Intelligence and Statistics (AISTATS 2026)

2605.31246 2026-06-01 cs.CR cs.CV

BadBone: Backdoor Attacks Against Backbone Models in Visual Prompt Learning

BadBone：视觉提示学习中针对骨干模型的后门攻击

Ziqing Yang, Rui Wen, Xinlei He, Yun Shen, Michael Backes, Yang Zhang

AI总结提出BadBone，一种利用双层优化的隐蔽自适应后门攻击方法，通过破坏骨干模型使下游提示学习任务继承后门漏洞，实验表明现有防御措施基本无效。

Comments Accepted by IEEE Transactions on Information Forensics & Security

详情

DOI: 10.1109/TIFS.2026.3698596

AI中文摘要

提示学习是一种新的机器学习范式，因其简单性和有效性而受到广泛关注。尽管其应用日益增多，但该范式的安全漏洞仍未被充分探索。在这项工作中，我们率先提出BadBone，一种利用双层优化的隐蔽自适应后门攻击，针对提示学习。我们的目标不是对提示学习过程植入后门，而是破坏骨干模型，使得只有采用提示学习的目标下游任务继承后门漏洞。在三个不同模型和来自不同领域的三个数据集上的大量实验表明，我们的定向/非定向后门模型在保持预训练和下游任务实用性的同时，实现了高攻击性能。此外，我们针对六种最先进的模型级防御（包括Neural Cleanse、ABS、MNTD、NAD、CLP和D-BR）评估了我们的方法。结果表明，这些防御对我们的后门模型基本无效，因此有效的防御仍是未来工作的重要方向。

英文摘要

Prompt learning is a new machine learning paradigm that has attracted ample attention due to its simplicity and proven efficacy. Despite its growing adoption, the security vulnerabilities associated with this paradigm remain underexplored. In this work, we take the first step to propose BadBone, a stealthy and adaptive backdoor attack against prompt learning using bi-level optimization. Instead of backdooring the prompt learning process, we aim to compromise a backbone model such that only target downstream tasks employing prompt learning inherit the backdoor vulnerability. Extensive experiments on three different models and three datasets from various domains show that our targeted/untargeted backdoored models achieve high attack performance while maintaining utility on both pre-training and downstream tasks. Moreover, we evaluate our approach against six state-of-the-art model-level defenses, including Neural Cleanse, ABS, MNTD, NAD, CLP, and D-BR. The results demonstrate that these defenses are largely ineffective against our backdoored models and thus leave the effective defense as an important direction for future work.

URL PDF HTML ☆

赞 0 踩 0