arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 3405
2605.24556 2026-05-26 cs.IR cs.CL cs.LG

The Multilingual Curse at the Retrieval Layer: Evidence from Amharic

多语言诅咒在检索层:来自阿姆哈拉语的证据

Yosef Worku Alemneh, Kidist Amde Mekonnen, Maarten de Rijke

发表机构 * Independent Researcher(独立研究者) University of Amsterdam(阿姆斯特丹大学)

AI总结 针对零样本多语言检索在低资源形态丰富语言(如阿姆哈拉语)上表现不佳的问题,通过对比实验发现单语检索器显著优于多语言检索器,并揭示了多语言基准测试的局限性。

Comments 10 pages, 4 tables. Accepted to the 1st Workshop on Multilinguality in the Era of Large Language Models (MeLLM) at ACL 2026

详情
AI中文摘要

多语言检索日益支撑着跨语言问答和检索增强生成。在多语言基准测试上的强零样本分数常被视为当前编码器能可靠跨语言迁移的证据。我们认为,对于代表性不足、形态丰富的语言,这一假设不成立,并以阿姆哈拉语作为诊断案例。在涵盖密集、延迟交互、学习稀疏和交叉编码器范式的共享段落检索协议下,我们比较了零样本多语言检索器、阿姆哈拉语微调的多语言检索器以及单语阿姆哈拉语检索器。最强的零样本多语言检索器在MRR@10上比最强的单语阿姆哈拉语第一阶段检索器低23%。在相同的阿姆哈拉语监督下微调两个最新的多语言嵌入模型,相比零样本获得了32-60%的相对MRR@10提升,但最佳阿姆哈拉语微调多语言模型仍低于最强的单语阿姆哈拉语检索器。这些发现表明,零样本多语言检索并不能充分代表LLM时代公平的信息访问:对于代表性不足的语言,检索必须在语言内部进行评估和适应,而不是从聚合的多语言基准测试中推断。为促进未来研究,我们在https://github.com/rasyosef/amharic-neural-ir 公开发布了数据集、代码库和训练模型。

英文摘要

Multilingual retrieval increasingly underpins cross-lingual question answering and retrieval-augmented generation. Strong zero-shot scores on multilingual benchmarks are often taken as evidence that current encoders transfer reliably across many languages. We argue that this assumption breaks down for underrepresented, morphologically rich languages, and use Amharic as a diagnostic case. Under a shared passage retrieval protocol covering dense, late-interaction, learned sparse, and cross-encoder paradigms, we compare zero-shot multilingual retrievers, Amharic-fine-tuned multilingual retrievers, and monolingual Amharic retrievers. The strongest zero-shot multilingual retriever underperforms the strongest monolingual Amharic first-stage retriever by 23% relative MRR@10. Fine-tuning two recent multilingual embedding models on the same Amharic supervision yields 32-60% relative MRR@10 gains over zero-shot, but the best Amharic-fine-tuned multilingual model remains below the strongest monolingual Amharic retriever. These findings indicate that zero-shot multilingual retrieval is not a sufficient proxy for equitable information access in the LLM era: for underrepresented languages, retrieval must be evaluated and adapted in-language rather than inferred from aggregate multilingual benchmarks. To foster future research, we publicly release the dataset, codebase, and trained models at https://github.com/rasyosef/amharic-neural-ir.

2605.24542 2026-05-26 cs.CR cs.AI cs.LG cs.MA cs.SE

AI-Driven Adaptive Adversaries and the Erosion of Cryptographic Trust in Public Key Systems

AI驱动的自适应对手与公钥系统中密码学信任的侵蚀

Petar Radanliev

发表机构 * Department of Computer Sciences, University of Oxford(牛津大学计算机科学系) The Alan Turing Institute(艾伦·图灵研究所) British Library(大英图书馆)

AI总结 本文研究人工智能驱动的自适应对手如何利用实现层面的可观测性侵蚀公钥密码学的安全性,提出了一种新的安全评估框架。

详情
Journal ref
J Anal Sci Technol 17, 26 (2026)
AI中文摘要

本文研究了在人工智能驱动的自适应对手优化下,公钥密码学(PKC)安全性的侵蚀问题。所解决的问题是以算法为中心的密码安全模型与操作攻击现实之间日益增长的错配,其中对手利用实现层面的可观测性,而不是破解密码原语。

英文摘要

This paper examines the erosion of Public Key Cryptography (PKC) security under adaptive adversarial optimisation driven by artificial intelligence. The problem addressed is the growing mismatch between algorithm-centric cryptographic security models and operational attack realities, where adversaries exploit implementation-level observability rather than breaking cryptographic primitives.

2605.24538 2026-05-26 cs.CY cs.AI cs.MA

Is Decentralized AI Governable? From Regulative Policy to Constitutive Protocol

去中心化AI是否可治理?从规制政策到构成性协议

Botao Amber Hu, Helena Rong

发表机构 * University of Oxford(牛津大学) New York University Shanghai(纽约大学上海校区)

AI总结 本文分析去中心化AI的六层堆栈,揭示其导致的治理真空(责任缺口和无力化缺口),并提出从基于政策的规范性治理转向基于协议的构成性治理,同时确立合法性、可争议性、透明性和非支配性四个伦理条件。

Comments Submitted for Ethics and Information Technology

详情
AI中文摘要

每个主要的AI治理框架都预设了一个可识别的实体——开发者、部署者或操作者——该实体可以被追究责任并被强制遵守。去中心化AI(DeAI)瓦解了这一预设。我们将DeAI分析为一个六层去中心化堆栈——模型、训练、计算、驾驭、身份和所有权——并展示各层部分去中心化如何叠加成我们所谓的“治理真空”:一种AI系统足够重要以至于需要治理,但缺乏现有框架在其目标中所预设属性的状态。这种真空有两种分析上不同的形式:一是“责任缺口”,即无法识别出可问责的主体;二是“无力化缺口”,即使识别出主体也无法改变正在运行的系统。我们证明这些失败不仅是管辖权上的,而且通过规范性地址——向一个理解并响应的主体传达规则——挫败了治理的所有预设。借鉴Lessig的规制模式和Searle关于规制性规则与构成性规则的区分,我们主张将治理的焦点从政策转向协议,从规范性地址转向架构约束。基于协议的构成性治理并不针对系统内运作的主体,而是塑造决定系统内何种行动成为可能的基质。我们确定了这种治理必须满足的四个伦理条件——合法性、可争议性、透明性和非支配性——以避免退化为不负责任的专家统治权力,并认为在去中心化世界中治理AI的核心政治挑战是重建对架构选择的民主授权形式,这些选择在常规政策链条断裂后依然存在。

英文摘要

Every major framework for governing artificial intelligence presupposes an identifiable entity -- a developer, deployer, or operator -- who can be held responsible and compelled to comply. Decentralized AI (DeAI) dissolves this presupposition. We analyze DeAI as a six-layer decentralizing stack -- model, training, compute, harness, identity, and ownership -- and show how partial decentralization across layers compounds into what we call the \emph{governance vacuum}: a condition in which AI systems are consequential enough to require governance but lack the properties that existing frameworks presuppose in their targets. This vacuum takes two analytically distinct forms: an \emph{accountability gap}, where no addressable principal can be identified, and an \emph{incapacitation gap}, where even an identified principal cannot alter the running system. We demonstrate that these failures are not merely jurisdictional but defeat every presupposition of governance through normative address -- the communication of rules to a comprehending, responsive agent. Drawing on Lessig's modalities of regulation and Searle's distinction between regulative and constitutive rules, we argue for a shift in the locus of governance from policy to protocol, from normative address to architectural constraint. Protocol-based constitutive governance does not address the agents operating within a system but shapes the substrate that determines what kinds of actions are possible within it. We identify four ethical conditions -- legitimacy, contestability, transparency, and non-domination -- that such governance must satisfy to avoid degenerating into unaccountable technocratic power, and we argue that the central political challenge of governing AI in a decentralized world is reconstructing forms of democratic authorization for architectural choices that persist after the ordinary chain of policy has broken down.

2605.22634 2026-05-26 cs.SE cs.AI

Contractual Skills: A GovernSpec Design Framework for Enterprise AI Agents

合同技能:面向企业AI代理的GovernSpec设计框架

Ting Liu

发表机构 * SymbolicLight Research(SymbolicLight研究院)

AI总结 提出一种基于GovernSpec的合同技能设计框架,通过组织SKILL.md文件为可读任务合同,明确任务意图、边界和验收标准,实验表明该框架能提升生成质量并降低关键错误率。

Comments 15 pages, 5 figures, 4 tables. v2 adds a public-skill A/B study, updates experimental results, and adds a public replication package link: AGI/contractual-skill" target="_blank" rel="noopener">https://github.com/SymbolicLight-AGI/contractual-skill

详情
AI中文摘要

技能已成为代理指令、工作流、脚本和参考材料的实用封装机制。然而,在企业环境中,技能通常需要表达比任务指导更多的内容:目标、输入边界、权限、人工审批点、证据要求、输出合同、质量标准、验证步骤和交接规则。本文提出合同技能,一种受GovernSpec启发的设计框架,用于将SKILL.md文件组织为可读的任务合同,同时保持轻量级技能发现和渐进加载。该框架明确了合同技能、GovernSpec YAML合同、模型上下文协议(MCP)接口、工具适配器、运行时护栏、追踪和评估系统之间的界限。我们通过三个离线实证研究评估该框架。第一个文本生成实验涵盖三个企业技能、十五个合成任务、四种指令条件和八个生成模型,产生960个输出和1680个交叉评判分数记录。第二个研究是公共技能A/B扩展:将八个公共技能与合同重写在四十八个合成任务、六个生成模型、两次重复、1152个输出和两个完整评判文件上进行比较。在此设置中,合同技能将平均质量从4.692提高到4.914,并将关键错误率从0.083降低到0.013。第三个研究是离线工具调用挑战,涉及八个模型和192个模拟工具调用记录。结果表明,合同技能最好被理解为一种治理层,使任务意图、边界和验收标准显式化,而不是独立的安全机制。

英文摘要

Skills have become a practical packaging mechanism for agent instructions, workflows, scripts, and reference materials. In enterprise settings, however, a skill often needs to express more than task guidance: goals, input boundaries, permissions, human approval points, evidence requirements, output contracts, quality criteria, verification steps, and handoff rules. This paper proposes contractual skills, a GovernSpec-inspired design framework for organizing SKILL.md files as readable task contracts while preserving lightweight skill discovery and progressive loading. The framework clarifies the boundary between contractual skills, GovernSpec YAML contracts, Model Context Protocol (MCP) surfaces, tool adapters, runtime guardrails, tracing, and evaluation systems. We evaluate the framework with three offline empirical studies. The first text-generation experiment covers three enterprise skills, fifteen synthetic tasks, four instruction conditions, and eight generation models, producing 960 outputs and 1680 cross-judge score records. The second study is a public-skill A/B expansion: eight public skills are compared with contractual rewrites across forty-eight synthetic tasks, six generation models, two repeats, 1152 outputs, and two complete judge files. In this setting, contractual skills raise mean quality from 4.692 to 4.914 and reduce critical-error rate from 0.083 to 0.013. The third study is an offline tool-calling challenge with eight models and 192 simulated tool-call records. The results suggest that contractual skills are best understood as a governance layer that makes task intent, boundaries, and acceptance criteria explicit, not as a standalone safety mechanism.

2605.20747 2026-05-26 q-bio.GN cs.LG

Multi-Modal Machine Learning for Population- and Subject-Specific lncRNA-Type 2 Diabetes Association Analysis

多模态机器学习用于群体和个体特异性lncRNA-2型糖尿病关联分析

Ashwani Siwach, Sanjeev Narayan Sharma, Sunil Datt Sharma

发表机构 * Department of Electronics and Communication Engineering, IIITDM Jabalpur(IIITDM Jabalpur电子与通信工程系) Department of Electronics and Communication Engineering, Central University of Jammu(Jammu中央大学电子与通信工程系)

AI总结 本研究通过整合表达、二级结构和序列特征的多模态机器学习框架,在独立队列中识别与2型糖尿病相关的lncRNA,并利用SHAP分析实现群体和个体水平的关联解释。

Comments This work has been submitted to the IEEE for possible publication

详情
AI中文摘要

长链非编码RNA(lncRNA)是参与慢性疾病(包括2型糖尿病)发病机制的新兴调控分子。我们研究了文献中报道的与2型糖尿病相关的十种lncRNA:MALAT1、MEG3、MIAT、ANRIL、GAS5、KCNQ1OT1、H19、BCYRN1、XIST和HOTAIR,在两个独立的人群RNA-seq队列中进行了分析。单组学方法提供了疾病生物学的不完整视图,因此开发了一个整合多特征框架,提取每种lncRNA的表达、二级结构和序列特征。在分层k折交叉验证、留一法交叉验证和重复留出法方案下评估了八种机器学习分类器,以确保稳健的性能估计。应用SHAP分析进行个体水平的关联解释。在一个队列中,发现GAS5和XIST的表达特征以及GAS5、MEG3和ANRIL的序列特征与2型糖尿病相关,而在第二个队列中,发现MALAT1的表达特征以及KCNQ1OT1、ANRIL和MEG3的序列特征与2型糖尿病相关。SHAP将MEG3识别为两个队列中的主要lncRNA。机器学习结果与已建立的统计方法一致,同时额外提供了与特定分子特征类型相关的群体和个体水平疾病关联谱。所提出的框架增进了对2型糖尿病机制的理解,并支持基于lncRNA的精准医学。

英文摘要

Long non-coding RNAs (lncRNAs) are emerging regulatory molecules implicated in chronic disease pathogenesis, including Type 2 Diabetes Mellitus (T2D). We investigated ten literature reported lncRNAs associated with T2D: MALAT1, MEG3, MIAT, ANRIL, GAS5, KCNQ1OT1, H19, BCYRN1, XIST, and HOTAIR across two independent population-based RNA-seq cohorts. Single-omics approaches provide an incomplete view of disease biology, therefore, an integrative multi-feature framework was developed, extracting expression, secondary-structure, and sequence features for each lncRNA. Eight machine learning (ML) classifiers were evaluated under stratified k-fold, leave-one-out cross-validation (LOOCV), and repeated hold-out schemes to ensure robust performance estimation. SHAP analysis was applied for subject-level association interpretation. In one cohort, GAS5 and XIST expression features, along with GAS5, MEG3, and ANRIL sequence features, were found to be associated with T2D, while MALAT1 expression and KCNQ1OT1, ANRIL, and MEG3 sequence features were found to be associated in the second cohort. MEG3 was identified by SHAP as the dominant lncRNA in both cohorts. ML results were consistent with established statistical methods while additionally providing population- and subject-level disease association profiles linked to specific molecular feature types. The proposed framework advances mechanistic understanding of T2D and supports lncRNA-based precision medicine.

2605.19938 2026-05-26 stat.ME cs.LG stat.ML

Variance-Reduced Manifold Sampling via Polynomial-Maximization Density Estimation

通过多项式最大化密度估计的方差缩减流形采样

Serhii Zabolotnii

发表机构 * Department of Information, Multimedia Technologies and Design, Cherkasy State Business College(切爾卡西州商業學院信息、多媒體技術與設計系) State Scientific Research Institute of Armament and Military Equipment Testing and Certification(武器和軍事設備測試和認證國家科學研究 institutes) Department of Cybernetics and Applied Mathematics, Uzhhorod National University(烏茲霍羅德國家大學自動控制與應用數學系)

AI总结 针对隐式定义流形上的均匀采样问题,提出一种基于多项式最大化矩估计的密度估计模块PMM-MASEM,通过门控机制在非平坦间距分布下替代传统插件估计,降低密度均方误差22-36%。

Comments 16 pages, 5 figures, 3 tables. Code supplement: https://github.com/SZabolotnii/Ku-PMM-MASEM-code-supplement

详情
AI中文摘要

在隐式定义流形上的均匀采样是运动规划、约束模拟和概率机器学习中的核心原语。MASEM通过熵最大化重采样解决该问题,但其重采样权重依赖于局部k近邻密度估计,而激进的重采样温度可能放大其误差。我们探究是否可以用多项式最大化矩估计器替代插件密度规则,而不改变周围的MASEM架构。所提出的PMM-MASEM模块从嵌套的k近邻半径计算壳间距,估计其标准化累积量,并仅在间距分布偏离平坦的Exp(1)分布时使用门控的PMM2/PMM3估计器;否则回退到插件/MLE规则。这种回退至关重要:在平坦齐次流形上,插件估计器已经是MLE,因此PMM不应优于它。局部已知DGP蒙特卡洛实验证实了该门控:选择器在平坦Exp(1)间距下返回MLE,并在非对称伽马和边界间距情况下将密度MSE降低22-36%。证据并非一致积极:PMM3在尖峰均匀间距法则下表现更差,而轻量级重采样代理实验改善了七瓣覆盖但降低了正弦和瑞士卷代理的性能。因此,当前证据支持的是适用边界结果,而非一般的MASEM改进主张。

英文摘要

Uniform sampling on implicitly defined manifolds is a core primitive in motion planning, constrained simulation, and probabilistic machine learning. MASEM addresses this problem by entropy-maximizing resampling, but its resampling weights depend on a local k-nearest-neighbour density estimate whose errors can be amplified by aggressive resampling temperatures. We ask whether a polynomial-maximization moment estimator can replace the plug-in density rule without changing the surrounding MASEM architecture. The proposed PMM-MASEM module computes shell spacings from nested k-nearest-neighbour radii, estimates their standardized cumulants, and uses a gated PMM2/PMM3 estimator only when the spacing distribution departs from the flat Exp(1) regime; otherwise it falls back to the plug-in/MLE rule. This fallback is essential: on a flat homogeneous manifold the plug-in estimator is already the MLE, so PMM should not outperform it. A local Known-DGP Monte Carlo experiment confirms this gate: the selector returns MLE on flat Exp(1) spacings and reduces density MSE by 22--36% on asymmetric gamma and boundary-spacing regimes. The evidence is not uniformly positive: PMM3 worsens a platykurtic uniform spacing law, and a lightweight resampling-proxy experiment improves seven-lobes coverage but degrades the sine and swiss-roll proxies. The current evidence therefore supports an applicability-boundary result rather than a general MASEM improvement claim.

2605.19170 2026-05-26 stat.ML cs.LG

Reducing Diffusion Model Memorization with Higher Order Langevin Dynamics

使用高阶朗之万动力学减少扩散模型记忆化

Benjamin Sterling, Mónica F. Bugallo, Tom Tirer

发表机构 * Department of Applied Math & Statistics(应用数学与统计学系) Stony Brook University(石溪大学) Department of Electrical and Computer Engineering(电气与计算机工程系) Faculty of Engineering(工程学院) Bar-Ilan University(巴伊兰大学)

AI总结 本文研究高阶朗之万动力学(HOLD)对扩散模型记忆化的影响,通过理论分析表明HOLD通过低通滤波学习得分函数并随阶数增加平滑度,从而缓解记忆化,并在真实数据上验证了理论。

详情
AI中文摘要

扩散/基于分数的模型已成为强大的生成模型,能够生成模仿训练数据分布的高质量样本。然而,观察到它们容易重现训练样本——称为“记忆化”——可能违反版权和隐私。在本文中,我们研究了高阶朗之万动力学(HOLD)对这一现象的影响。HOLD扩散过程引入了辅助变量;如果数据变量被解释为“位置”,那么辅助变量可以解释为“速度”和“加速度”,具体取决于所选模型的阶数。它们最初是基于这样的直觉提出的:通过隐式施加额外的动力学约束来正则化数据变量的轨迹。据我们所知,我们的工作首次提供了HOLD正则化效应的理论刻画。具体来说,我们表明在HOLD中,数据变量的动力学由学习得分函数的低通滤波版本控制,其平滑度随HOLD阶数增加而增加。然后我们分析了最优经验得分和分布崩溃的可能性。总之,我们的结果解释了随着模型阶数增加记忆化的缓解。最后,我们在真实世界数据上进行了实证研究,支持了我们的理论,并突出了HOLD在实践中相对于标准扩散的这一独特优势。

英文摘要

Diffusion/score-based models have emerged as powerful generative models, capable of generating high-quality samples that mimic the training data distribution. However, it has been observed that they are prone to reproducing training samples-known as "memorization"-potentially violating copyright and privacy. In this paper, we study the effect of Higher-Order Langevin Dynamics (HOLD) on this phenomenon. HOLD diffusion processes introduce auxiliary variables; if the data variable is interpreted as "position," then the auxiliary variables can be interpreted as "velocity" and "acceleration," depending on the chosen order of the model. They were originally proposed based on the intuition that they regularize the trajectories of the data variable by implicitly imposing additional dynamical constraints. Our work provides, to our knowledge, the first theoretical characterization of the regularization effect of HOLD. Specifically, we show that in HOLD, the dynamics of the data variable are governed by a low-pass-filtered version of the learned score function, with smoothness increasing with the order of HOLD. We then analyze the optimal empirical score and the possibility of distribution collapse. Together, our results explain the mitigation of memorization as the model order increases. Finally, we present an empirical study on real-world data that supports our theory and highlights this distinct advantage of HOLD over standard diffusion in practice.

2605.14605 2026-05-26 cs.CR cs.AI cs.LG

One Step to the Side: Why Defenses Against Malicious Finetuning Fail Under Adaptive Adversaries

一步之遥:为什么针对恶意微调的防御在自适应对手面前失败

Itay Zloczower, Eyal Lenga, Gilad Gressel, Yisroel Mirsky

发表机构 * Ben-Gurion University of the Negev(贝纳-约瑟夫大学) Amrita Vishwa Vidyapeetham(阿米塔维莎瓦迪耶佩塔)

AI总结 本文通过分析15种近期防御机制,发现它们共享一个弱点:仅掩盖或误导有害行为路径而未消除行为本身,并开发了一种统一的自适应攻击,成功突破了所有防御机制。

Comments Under review

详情
AI中文摘要

模型提供商越来越多地发布开放权重或允许用户通过API微调基础模型。尽管这些模型在发布前经过安全对齐,但其防护措施通常可以通过对有害数据的微调来移除。最近的防御旨在使模型对此类恶意微调具有鲁棒性,但它们主要仅针对不考虑防御的固定攻击进行评估。我们表明这些鲁棒性声明是不完整的。通过调查15种近期防御,我们识别了几种防御机制,并表明它们共享一个单一弱点:它们掩盖或误导通往有害行为的路径,而不移除行为本身。然后,我们开发了一种统一的自适应攻击,突破了所有防御机制。我们的结果表明,当前方法并未提供稳健的安全性;它们主要阻止了它们所设计的攻击。我们希望我们针对这一领域的统一自适应对手将帮助未来的研究人员和实践者在部署前对新防御进行压力测试。

英文摘要

Model providers increasingly release open weights or allow users to fine-tune foundation models through APIs. Although these models are safety-aligned before release, their safeguards can often be removed by fine-tuning on harmful data. Recent defenses aim to make models robust to such malicious fine-tuning, but they are largely evaluated only against fixed attacks that do not account for the defense. We show that these robustness claims are incomplete. Surveying 15 recent defenses, we identify several defense mechanisms and show that they share a single weakness: they obscure or misdirect the path to harmful behavior without removing the behavior itself. We then develop a unified adaptive attack that breaks defenses across all defense mechanisms. Our results show that current approaches do not provide robust security; they mainly stop the attacks they were designed against. We hope that our unified adaptive adversary for this domain will help future researchers and practitioners stress-test new defenses before deployment.

2605.12764 2026-05-26 q-fin.MF cs.LG stat.ML

Yield Curves Dynamics Using Variational Autoencoders Under No-arbitrage

无套利条件下使用变分自编码器的收益率曲线动力学

Fusheng Luo, H'elyette Geman

发表机构 * Department of Applied Mathematics and Statistics, Johns Hopkins University, USA(应用数学与统计学系,约翰霍普金斯大学,美国)

AI总结 提出一种物理信息生成框架,通过两阶段架构(学生t条件变分自编码器+动态水平注入和神经随机微分方程)解决深度学习统计灵活性与固定收益理论约束的冲突,在多个主权货币上显著降低预测误差并实现无套利。

Comments This is the full script (version 2) of our paper, which is awaiting submission to financial journals/conferences, after modifying and double-checking the reference lists

详情
AI中文摘要

本文引入了一个物理信息生成框架,解决了深度学习统计灵活性与固定收益建模严格理论约束之间的根本冲突。我们证明,标准生成模型和无约束统计外推在预测跨多种宏观经济体制的期限结构时,会遭受“流形崩溃”和严重的套利违规。为克服这一问题,我们提出了一种两阶段架构。首先,具有动态水平注入的学生t条件变分自编码器(CVAEsT+LS)提取了一个稳健、重尾的期限结构流形,有效解耦了宏观经济形状动态与绝对基准利率。其次,潜在动态演化由连续时间神经随机微分方程(SDE)控制,并受到无套利偏微分方程(PDE)的严格惩罚。跨多个主权货币(美元、英镑、日元)的实证结果证实,我们的协同方法大幅降低了样本外预测误差——实现了卓越的6.58个基点平均期限RMSE——并成功克服了经典HJM模型在极端环境中表现出的巨大平行漂移和零下限违规。此外,通过相空间向量场分析,我们展示了该模型在无监督宏观经济体制检测和高质量连续时间情景生成方面的卓越能力。最终,本研究为期限结构建模提供了一个高度可扩展、数学上合理的演化引擎。

英文摘要

This paper introduces a physics-informed generative framework that resolves the fundamental conflict between the statistical flexibility of deep learning and the rigorous theoretical constraints of fixed-income modeling. We demonstrate that standard generative models and unconstrained statistical extrapolations suffer from "manifold collapse" and severe arbitrage violations when forecasting term structures across diverse macroeconomic regimes. To overcome this, we propose a two-stage architecture. First, a Student-t Conditional Variational Autoencoder with Dynamic Level Injection (CVAEsT+LS) extracts a robust, heavy-tailed term structure manifold, effectively decoupling macroeconomic shape dynamics from absolute base rates. Second, the latent dynamic evolution is governed by a continuous-time Neural Stochastic Differential Equation (SDE) strictly penalized by a No-Arbitrage Partial Differential Equation (PDE). Empirical results across multiple sovereign currencies (USD, GBP, JPY) confirm that our synergistic approach drastically reduces out-of-sample forecasting errors -- achieving an exceptional 6.58 bps Mean Tenor RMSE -- and successfully overcomes the massive parallel drift and zero-lower-bound violations exhibited by the classical HJM model in extreme environments. Furthermore, through phase space vector field analysis, we demonstrate the model's superior capability in unsupervised macroeconomic regime detection and high-quality continuous-time scenario generation. Ultimately, this research provides a highly scalable, mathematically sound evolutionary engine for term structure modeling.

2605.12118 2026-05-26 stat.ML cs.LG

Keeping Score: Efficiency Improvements in Neural Likelihood Surrogate Training via Score-Augmented Loss Functions

保持分数:通过分数增强损失函数提高神经似然代理训练的效率

Alexander Shen, Mikael Kuusela

发表机构 * Department of Statistics and Data Science(统计与数据科学系)

AI总结 针对随机过程模型,提出通过分数增强损失函数和自适应加权改进神经似然代理训练,在显著降低计算成本的同时提升代理质量,实现与10倍训练数据相当的推理性能。

Comments 9 pages of main text, 9 pages of appendices, 13 figures

详情
AI中文摘要

对于随机过程模型,参数推断通常受限于计算昂贵的似然函数。基于模拟的推断(SBI)通过构建摊销代理似然绕过了这一限制,但大多数SBI方法假设黑箱数据生成过程。虽然这些代理在无限训练数据下是精确的,但实际场景迫使在模型质量和模拟成本之间进行严格权衡。在这项工作中,我们放宽了SBI的黑箱假设,以改善结构化随机过程模型的这种权衡。具体而言,对于通过概率分类训练的神经网络似然代理,我们提出用精确的分数信息 $\nabla_θ\log p(x \mid θ)$ 和基于损失梯度的自适应加权来增强标准二元交叉熵损失。我们在涉及网络动力学和空间过程的案例研究中评估了我们的方法,证明我们的方法以远低于生成更多训练数据的计算成本提高了代理质量。值得注意的是,在某些情况下,我们的方法实现了与训练数据增加10倍相当的下游推理性能,而训练时间增加不到1.1倍。

英文摘要

For stochastic process models, parameter inference is often severely bottlenecked by computationally expensive likelihood functions. Simulation-based inference (SBI) bypasses this restriction by constructing amortized surrogate likelihoods, but most SBI methods assume a black-box data generating process. While these surrogates are exact in the limit of infinite training data, practical scenarios force a strict tradeoff between model quality and simulation cost. In this work, we loosen the black-box assumption of SBI to improve this tradeoff for structured stochastic process models. Specifically, for neural network likelihood surrogates trained via probabilistic classification, we propose to augment the standard binary cross-entropy loss with exact score information $\nabla_θ\log p(x \mid θ)$ and adaptive weighting based on loss gradients. We evaluate our approach on case studies involving network dynamics and spatial processes, demonstrating that our method improves surrogate quality at a drastically lower computational cost than generating more training data. Notably, in some cases, our approach achieves downstream inference performance equivalent to a 10x increase in training data with less than a 1.1x increase in training time.

2605.10718 2026-05-26 cs.DC cs.AI cs.LG cs.PF cs.SY eess.SY

An Uncertainty-Aware Resilience Micro-Agent for Causal Observability in the Computing Continuum

一种面向计算连续体中因果可观测性的不确定性感知韧性微代理

Suvi De Silva, Alfreds Lapkovskis, Alaa Saleh, Sasu Tarkoma, Praveen Kumar Donta

发表机构 * Department of Computer Systems and Sciences(计算机系统与科学系) Department of Computer Science(计算机科学系)

AI总结 提出AURORA框架,通过集成自由能原理、因果do-calculus和局部因果状态图,在边缘层实现灰色故障的因果诊断与缓解,并采用双门控执行机制在不确定性高时避免破坏性干预。

详情
AI中文摘要

计算连续体中的灰色故障会产生模糊重叠的症状,现有方法由于缺乏因果意识或在高度认知不确定性下行动,无法可靠诊断,并可能导致破坏性干预。本文提出了一种面向因果可观测性的不确定性感知韧性微代理(AURORA),这是一个轻量级框架,用于诊断和缓解边缘层环境中的灰色故障。该框架采用并行微代理,集成自由能原理、因果do-calculus和局部因果状态图,支持每个故障马尔可夫毯内的反事实根因分析。将推理限制在因果相关变量上可降低计算开销,同时保持诊断保真度。AURORA进一步引入双门控执行机制,仅在因果置信度高且预测认知不确定性有界时授权修复;否则,放弃本地干预并将诊断有效载荷升级到雾层。我们的实验表明,AURORA优于基线,实现了0%的破坏性行动率,同时保持62.0%的修复准确率和3ms的平均修复时间。

英文摘要

Grey failures in the computing continuum produce ambiguous overlapping symptoms that existing approaches fail to diagnose reliably, either due to a lack of causal awareness or acting under high epistemic uncertainty, risking destructive interventions. This paper presents an uncertainty-aware resilience micro-agent for causal observability (AURORA), a lightweight framework for diagnosing and mitigating grey failures in edge-tier environments. The framework employs parallel micro-agents that integrate the free-energy principle, causal do-calculus, and localized causal state-graphs to support counterfactual root-cause analysis within each fault's Markov blanket. Restricting inference to causally relevant variables reduces computational overhead while preserving diagnostic fidelity. AURORA further introduces a dual-gated execution mechanism that authorizes remediation only when causal confidence is high and predicted epistemic uncertainty is bounded; otherwise, it abstains from local intervention and escalates the diagnostic payload to the fog tier. Our experiments demonstrate that AURORA outperforms baselines, achieving a 0% destructive action rate, while maintaining 62.0% repair accuracy and a 3ms mean time to repair.

2605.02900 2026-05-26 cs.CR cs.AI cs.CV cs.RO

Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses

具身人工智能的安全性:风险、攻击与防御综述

Xiao Li, Xiang Zheng, Yifeng Gao, Xinyu Xia, Yixu Wang, Xin Wang, Ye Sun, Yunhan Zhao, Ming Wen, Jiayu Li, Zixing Chen, Xun Gong, Yi Liu, Yige Li, Yutao Wu, Cong Wang, Jun Sun, Yixin Cao, Zhineng Chen, Jingjing Chen, Tao Gui, Qi Zhang, Zuxuan Wu, Xipeng Qiu, Xuanjing Huang, Tiehua Zhang, Zhipeng Wei, Kun Wang, Xinfeng Li, Hanxun Huang, Sarah Erfani, James Bailey, Jianping Wang, Chaowei Xiao, Ran He, Bo Li, Xingjun Ma, Yu-Gang Jiang

发表机构 * Fudan University(复旦大学) Shanghai Innovation Institute(上海创新研究院) City University of Hong Kong(香港城市大学) Jilin University(吉林大学) Singapore Management University(新加坡管理大学) Deakin University(德肯大学) Tongji University(同济大学) Nanyang Technological University(南洋理工大学) Chinese Academy of Sciences(中国科学院) The University of Melbourne(墨尔本大学) Johns Hopkins University(约翰霍普金斯大学)

AI总结 本文综述了具身AI在感知、认知、规划、行动及交互全流程中的安全风险、攻击与防御方法,提出了多层次分类体系,并指出了多模态感知融合脆弱性、规划不稳定及人机交互可信度等关键挑战。

Comments Survey paper; 75 pages, 4 figures, 18 tables; v2 expands embodied-specific coverage of agentic threats, World Action Model threats, and contextual risk mitigation, with over 100 new references added. Project page: https://x-zheng16.github.io/Awesome-Embodied-AI-Safety/

详情
AI中文摘要

具身人工智能将感知、认知、规划与交互集成到在开放、安全关键环境中运行的智能体中。随着这些系统获得自主性并进入交通、医疗、工业或辅助机器人等领域,确保其安全性在技术上具有挑战性,在社会上也变得不可或缺。与数字AI系统不同,具身智能体必须在不确定的感知、不完整的知识和动态的人机交互下行动,故障可能直接导致物理伤害。本综述对具身AI中的安全研究进行了全面且结构化的回顾,考察了从感知、认知到规划、行动与交互以及智能体系统的完整具身流程中的攻击与防御。我们引入了一个多层次分类体系,统一了分散的研究工作,并将具身特定的安全发现与视觉、语言和多模态基础模型的更广泛进展联系起来。我们的综述综合了来自500多篇论文的见解,涵盖对抗性攻击、后门攻击、越狱攻击和硬件级攻击;攻击检测、安全训练和鲁棒推理;以及风险感知的人机交互。这一分析揭示了几个被忽视的挑战,包括多模态感知融合的脆弱性、越狱攻击下规划的不稳定性,以及开放场景中人机交互的可信度。通过将领域组织成连贯的框架并识别关键研究空白,本综述为构建不仅具备能力和自主性,而且在现实部署中安全、鲁棒和可靠的具身智能体提供了路线图。

英文摘要

Embodied Artificial Intelligence (Embodied AI) integrates perception, cognition, planning, and interaction into agents that operate in open-world, safety-critical environments. As these systems gain autonomy and enter domains such as transportation, healthcare, and industrial or assistive robotics, ensuring their safety becomes both technically challenging and socially indispensable. Unlike digital AI systems, embodied agents must act under uncertain sensing, incomplete knowledge, and dynamic human-robot interactions, where failures can directly lead to physical harm. This survey provides a comprehensive and structured review of safety research in embodied AI, examining attacks and defenses across the full embodied pipeline, from perception and cognition to planning, action and interaction, and agentic system. We introduce a multi-level taxonomy that unifies fragmented lines of work and connects embodied-specific safety findings with broader advances in vision, language, and multimodal foundation models. Our review synthesizes insights from over 500 papers spanning adversarial, backdoor, jailbreak, and hardware-level attacks; attack detection, safe training and robust inference; and risk-aware human-agent interaction. This analysis reveals several overlooked challenges, including the fragility of multimodal perception fusion, the instability of planning under jailbreak attacks, and the trustworthiness of human-agent interaction in open-ended scenarios. By organizing the field into a coherent framework and identifying critical research gaps, this survey provides a roadmap for building embodied agents that are not only capable and autonomous but also safe, robust, and reliable in real-world deployment.

2604.23396 2026-05-26 cs.IR cs.AI cs.CL cs.LG

Lost in Decoding? Reproducing and Stress-Testing the Look-Ahead Prior in Generative Retrieval

迷失在解码中?复现与压力测试生成式检索中的前瞻先验

Kidist Amde Mekonnen, Yongkang Li, Yubao Tang, Simon Lupart, Maarten de Rijke

发表机构 * University of Amsterdam(阿姆斯特丹大学)

AI总结 本文复现并压力测试了生成式检索中的前瞻先验方法PAG,发现其规划信号在词汇表面形式变化下脆弱,并评估了跨语言鲁棒性与查询端缓解策略。

Comments 12 pages, 5 figures, 9 tables; accepted to the 49th International ACM SIGIR Conference on Research and Development in Information Retrieval, July 20-24, 2026, Melbourne/Naarm, Australia

详情
Journal ref
Proceedings of the 49th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '26), pages XXX-XXX, 2026
AI中文摘要

生成式检索(GR)通过自回归生成文档标识符来对文档进行排序。由于许多GR方法依赖于trie约束的束搜索,它们在有限束解码下容易过早剪枝相关前缀。生成式检索中的前瞻规划(PAG)通过使用同时解码来计算文档级前瞻先验,指导后续顺序解码,从而缓解了这种失败模式。我们在推理时复现了PAG,并压力测试了其解码行为。使用作者发布的检查点和标识符/trie工件,在报告的解码设置下,我们在MS MARCO Dev和TREC-DL 2019/2020上复现了主要有效性结果,并在我们的硬件设置中证实了报告的束大小-延迟权衡。在复现之外,我们引入了规划漂移诊断,量化意图保持的查询变体如何改变规划器的top-n候选集和最高权重规划器令牌,以及这些变化如何影响引导解码。我们发现PAG的规划信号在词汇表面形式变化下是脆弱的:意图保持的拼写错误可能触发规划崩溃,其中规划的候选池变化足够大,使得前瞻奖励几乎无法提供有用的指导,实际上使解码退回到较弱的无引导搜索。我们进一步使用非英语mMARC O查询对英语索引评估了固定索引的跨语言鲁棒性,并评估了无需重新索引的查询端缓解策略;在我们的设置中,查询翻译提供了最强的恢复。总体而言,我们的结果证实了PAG报告的有效性以及在发布的推理设置下规划引导解码的优势,同时表明这些增益依赖于规划信号在现实查询变化和查询-文档不匹配下的稳定性。

英文摘要

Generative retrieval (GR) ranks documents by autoregressively generating document identifiers. Because many GR methods rely on trie-constrained beam search, they are vulnerable to early pruning of relevant prefixes under finite-beam decoding. Planning Ahead in Generative Retrieval (PAG) mitigates this failure mode by using simultaneous decoding to compute a document-level look-ahead prior that guides subsequent sequential decoding. We reproduce PAG at inference time and stress-test its decoding behavior. Using the authors' released checkpoint and identifier/trie artifacts under the reported decoding setup, we reproduce the main effectiveness results on MS MARCO Dev and TREC-DL 2019/2020, and corroborate the reported beam-size-latency trade-off in our hardware setting. Beyond reproduction, we introduce plan drift diagnostics that quantify how intent-preserving query variations alter the planner's top-n candidate set and highest-weight planner tokens, and how these changes affect guided decoding. We find that PAG's planning signal is brittle under lexical surface-form variation: intent-preserving typos can trigger plan collapse, where the planned candidate pool shifts enough that the look-ahead bonus provides little useful guidance, effectively reverting decoding toward weaker unguided search. We further evaluate fixed-index cross-lingual robustness using non-English mMARCO queries against an English index, and assess query-side mitigation strategies that require no re-indexing; query translation provides the strongest recovery in our setting. Overall, our results confirm PAG's reported effectiveness and the benefit of planning-guided decoding under the released inference setup, while showing that these gains depend on the stability of the planning signal under realistic query variation and query-document mismatch.

2604.18800 2026-05-26 cs.SI cs.GT cs.LG

Optimal Exploration of New Products under Assortment Decisions

基于分类决策的新产品最优探索

Jackie Baek, Atanas Dinev, Thodoris Lykouris

发表机构 * Stern School of Business, New York University(纽约大学斯特恩商学院) Massachusetts Institute of Technology(麻省理工学院)

AI总结 研究平台在容量约束下通过分类决策在线学习新产品质量,提出最优探索策略以最小化遗憾,并揭示新产品应与顶级现有产品搭配、同时探索数量由潜力决定等结构洞见。

详情
AI中文摘要

我们研究了一个平台在容量约束下对提供哪些产品进行分类决策时,对新产品的在线学习。对于新上架的产品,其质量最初未知,质量信息通过社会学习传播:当顾客购买新产品并留下评论时,其质量对平台和未来顾客都变得可见。由于评论需要购买,平台必须在分类中展示新产品(“探索”)以产生评论来了解新产品。这种探索成本高昂,因为顾客对新产品的需求低于现有产品。我们刻画了用于探索的最优分类以最小化遗憾,解决了两个问题。(1)平台应该单独提供新产品还是与现有产品一起提供?前者最大化新产品的购买概率,但产生较低的短期收入。尽管购买概率较低,我们证明将新产品与顶级现有产品配对总是最优的。(2)对于多个新产品,平台应该同时探索它们还是逐个探索?我们证明同时探索的新产品最优数量具有简单的阈值结构:它随着新产品的“潜力”增加而增加,并且令人惊讶的是,不依赖于它们的个体购买概率。我们还表明,两种经典的bandit算法,UCB和汤普森采样,在此设置中因相反的原因而失败:UCB过度探索而汤普森采样探索不足。我们的结果为平台应如何通过分类决策了解新产品提供了结构性洞见。

英文摘要

We study online learning for new products on a platform that makes capacity-constrained assortment decisions on which products to offer. For a newly listed product, its quality is initially unknown, and quality information propagates through social learning: when a customer purchases a new product and leaves a review, its quality is revealed to both the platform and future customers. Since reviews require purchases, the platform must feature new products in the assortment ("explore") to generate reviews to learn about new products. Such exploration is costly because customer demand for new products is lower than for incumbent products. We characterize the optimal assortments for exploration to minimize regret, addressing two questions. (1) Should the platform offer a new product alone or alongside incumbent products? The former maximizes the purchase probability of the new product but yields lower short-term revenue. Despite the lower purchase probability, we show it is always optimal to pair the new product with the top incumbent products. (2) With multiple new products, should the platform explore them simultaneously or one at a time? We show that the optimal number of new products to explore simultaneously has a simple threshold structure: it increases with the "potential" of the new products and, surprisingly, does not depend on their individual purchase probabilities. We also show that two canonical bandit algorithms, UCB and Thompson Sampling, both fail in this setting for opposite reasons: UCB over-explores while Thompson Sampling under-explores. Our results provide structural insights on how platforms should learn about new products through assortment decisions.

2604.08501 2026-05-26 cs.DL cs.CL cs.SE

sciwrite-lint: Verification Infrastructure for the Age of Science Vibe-Writing

sciwrite-lint:科学氛围写作时代的验证基础设施

Sergey V Samsonau

发表机构 * Authentic Research Partners(真实研究伙伴) Princeton, NJ(新泽西州普林斯顿)

AI总结 针对AI辅助写作导致的引用幻觉问题,提出基于软件工程lint范式的引用验证工具sciwrite-lint,在研究者本地运行,快速检查引用存在性、元数据准确性、撤回状态和主张支持,并评估引用链完整性。

Comments Code: https://github.com/authentic-research-partners/sciwrite-lint

详情
AI中文摘要

科学论文通过引用对先前工作提出主张。大规模验证这些引用(每篇被引论文是否存在、是否支持引用主张、本身是否可靠)在结构上超出了人类评审的能力:一篇典型论文有数十条引用,而仔细的评审者最多通读少数几篇。AI辅助写作使这一差距更加紧迫:LLM会幻觉化参考文献,并从它们从未读过的论文标题或摘要中填充看似合理的细节,对于隐私意识研究者必须使用的较小本地权重模型,情况更糟。 sciwrite-lint将软件工程中的lint范式应用于引用验证:它完全在研究者机器上运行(免费公共数据库、单个消费级GPU和开放权重模型),速度足够快,可在修订之间重新lint,使作者在起草时就能从源头发现问题,并为期刊和评审者提供自动化的第一遍检查。该流程检查引用存在性、元数据准确性、撤回状态和主张支持,遍历被引论文参考文献的一级深度,并生成每篇引用的可靠性评分。我们在30篇未见过的论文(arXiv和bioRxiv)上进行了评估,包括错误注入和LLM裁决的假阳性分析。 相同的lint工作流程扩展到内部一致性:文本与表格中的数字、摘要与正文、图注与内容、统计结果与其文字解释,以及结构交叉引用(悬空引用、孤立参考文献)。作为独立的实验贡献,我们还提出了SciLint评分:引用链完整性与一个贡献组件相结合,该组件操作了五个科学哲学框架(Popper、Lakatos、Kitcher、Laudan、Mayo)。

英文摘要

Scientific papers make claims about prior work backed by citations. Verifying those citations at scale (that each cited paper exists, says what the citation claims, and is itself reliable) is structurally beyond what human review can deliver: a typical paper has dozens of citations, and a careful reviewer reads at most a handful end-to-end. AI-assisted writing makes this gap even more urgent: LLMs hallucinate references and may fill in plausible details from titles or abstracts of papers they never read, worse for the smaller local-weights models that privacy-aware researchers must use. sciwrite-lint applies the linting paradigm from software engineering to citation verification: it runs entirely on the researcher's machine (free public databases, a single consumer GPU, and open-weights models), is fast enough to re-lint between revisions so authors catch problems at the source while drafting, and serves journals and reviewers as an automated first pass. The pipeline checks reference existence, metadata accuracy, retraction status, and claim support, traverses one level into cited papers' bibliographies, and produces per-reference reliability scores. We evaluate on 30 unseen papers (arXiv and bioRxiv) with error injection and LLM-adjudicated false-positive analysis. The same linting workflow extends to internal consistency: numbers in text vs. tables, abstract vs. body, figure captions vs. content, statistical results vs. their verbal interpretation, plus structural cross-references (dangling cites, orphan references). As a separate experimental contribution we also propose SciLint Score: citation-chain integrity combined with a contribution component operationalizing five philosophy-of-science frameworks (Popper, Lakatos, Kitcher, Laudan, Mayo).

2603.29897 2026-05-26 cs.IR cs.AI

UniRank: End-to-End Domain-Specific Reranking of Hybrid Text-Image Candidates

UniRank: 混合文本-图像候选的端到端领域特定重排序

Yupei Yang, Lin Yang, Wanxi Deng, Lin Qu, Shikui Tu, Lei Xu

发表机构 * Shanghai Jiao Tong University(上海交通大学) Alibaba Group(阿里巴巴集团)

AI总结 提出UniRank,一种基于视觉语言模型的重排序框架,通过无需模态转换的统一评分和端到端领域适应(包括指令微调和基于强化学习的偏好对齐),在科学文献检索和设计专利搜索中显著提升性能。

详情
AI中文摘要

重排序是许多信息检索流程中的关键组件。尽管在纯文本场景中取得了显著进展,多模态重排序仍然具有挑战性,尤其是当候选集包含混合文本和图像项时。一个关键难点是模态差距:文本重排序器本质上更接近文本候选而非图像候选,导致跨模态排序存在偏差且次优。视觉语言模型(VLM)通过强大的跨模态对齐缓解了这一差距,并已被用于构建多模态重排序器。然而,大多数基于VLM的重排序器将所有候选编码为图像,将文本视为图像会引入大量计算开销。同时,现有的开源多模态重排序器通常在通用领域数据上训练,在特定领域场景中往往表现不佳。为解决这些限制,我们提出UniRank,一种基于VLM的重排序框架,无需任何模态转换即可原生地对混合文本-图像候选进行评分和排序。基于这种混合评分接口,UniRank提供了端到端的领域适应流程,包括:(1)指令微调阶段,通过将标签令牌似然映射到统一标量分数来学习校准的跨模态相关性评分;(2)硬负样本驱动的偏好对齐阶段,构建领域内成对偏好,并通过基于人类反馈的强化学习(RLHF)进行查询级策略优化。在科学文献检索和设计专利搜索上的大量实验表明,UniRank一致优于最先进的基线,Recall@1分别提高了8.9%和7.3%。

英文摘要

Reranking is a critical component in many information retrieval pipelines. Despite remarkable progress in text-only settings, multimodal reranking remains challenging, particularly when the candidate set contains hybrid text and image items. A key difficulty is the modality gap: a text reranker is intrinsically closer to text candidates than to image candidates, leading to biased and suboptimal cross-modal ranking. Vision-language models (VLMs) mitigate this gap through strong cross-modal alignment and have recently been adopted to build multimodal rerankers. However, most VLM-based rerankers encode all candidates as images, and treating text as images introduces substantial computational overhead. Meanwhile, existing open-source multimodal rerankers are typically trained on general-domain data and often underperform in domain-specific scenarios. To address these limitations, we propose UniRank, a VLM-based reranking framework that natively scores and orders hybrid text-image candidates without any modality conversion. Building on this hybrid scoring interface, UniRank provides an end-to-end domain adaptation pipeline that includes: (1) an instruction-tuning stage that learns calibrated cross-modal relevance scoring by mapping label-token likelihoods to a unified scalar score; and (2) a hard-negative-driven preference alignment stage that constructs in-domain pairwise preferences and performs query-level policy optimization through reinforcement learning from human feedback (RLHF). Extensive experiments on scientific literature retrieval and design patent search demonstrate that UniRank consistently outperforms state-of-the-art baselines, improving Recall@1 by 8.9% and 7.3%, respectively.

2603.25288 2026-05-26 cs.IT cs.AI cs.ET cs.LG eess.SP math.IT

CSI-tuples-based 3D Channel Fingerprints Construction Assisted by MultiModal Learning

基于CSI元组的多模态学习辅助3D信道指纹构建

Chenjie Xie, Li You, Ruirong Chen, Gaoning He, Xiqi Gao

发表机构 * National Mobile Communications Research Laboratory, Southeast University(东南大学国家移动通信研究中心) Purple Mountain Laboratories(紫金山实验室) Huawei Technologies Co., Ltd.(华为技术有限公司)

AI总结 针对低空通信中的3D信道指纹构建问题,提出一种基于CSI元组的多模态回归框架,通过融合位置、通信测量和地理环境地图,实现高效高精度的信道状态信息估计。

Comments 14 pages, 9 figures

详情
Journal ref
IEEE Transactions on Wireless Communications, vol. 25, pp. 17369-17383, 2026
AI中文摘要

低空通信可以促进空中和地面无线资源的整合,扩大网络覆盖范围,提高传输质量,从而推动第六代(6G)移动通信的发展。作为低空传输的关键技术,3D信道指纹(3D-CF),也称为3D无线电地图或3D信道知识地图,有望增强对通信环境的理解,并辅助获取信道状态信息(CSI),从而避免重复估计并降低计算复杂度。本文提出了一种模块化的多模态框架来构建3D-CF。具体而言,我们首先基于莱斯衰落信道建立了3D-CF模型,将其表示为CSI元组的集合,每个元组包含低空飞行器(LAV)的位置及其对应的统计CSI。考虑到不同先验数据的异构结构,我们将3D-CF构建问题表述为一个多模态回归任务,其中CSI元组中的目标信道信息可以通过其对应的LAV位置、通信测量和地理环境地图直接估计。然后,相应地提出了一种高效的多模态框架,包括基于相关性的多模态融合(Corr-MMF)模块、多模态表示(MMR)模块和CSI回归(CSI-R)模块。数值结果表明,我们提出的框架能够高效地构建3D-CF,并在不同通信场景下比现有算法至少提高27.5%的精度,展示了其竞争性能和出色的泛化能力。我们还分析了计算复杂度,并说明了其在推理时间方面的优越性。

英文摘要

Low-altitude communications can promote the integration of aerial and terrestrial wireless resources, expand network coverage, and enhance transmission quality, thereby empowering the development of sixth-generation (6G) mobile communications. As an enabler for low-altitude transmission, 3D channel fingerprints (3D-CF), also referred to as the 3D radio map or 3D channel knowledge map, are expected to enhance the understanding of communication environments and assist in the acquisition of channel state information (CSI), thereby avoiding repeated estimations and reducing computational complexity. In this paper, we propose a modularized multimodal framework to construct 3D-CF. Specifically, we first establish the 3D-CF model as a collection of CSI-tuples based on Rician fading channels, with each tuple comprising the low-altitude vehicle's (LAV) positions and its corresponding statistical CSI. In consideration of the heterogeneous structures of different prior data, we formulate the 3D-CF construction problem as a multimodal regression task, where the target channel information in the CSI-tuple can be estimated directly by its corresponding LAV positions, together with communication measurements and geographic environment maps. Then, a high-efficiency multimodal framework is proposed accordingly, which includes a correlation-based multimodal fusion (Corr-MMF) module, a multimodal representation (MMR) module, and a CSI regression (CSI-R) module. Numerical results show that our proposed framework can efficiently construct 3D-CF and achieve at least 27.5% higher accuracy than the state-of-the-art algorithms under different communication scenarios, demonstrating its competitive performance and excellent generalization ability. We also analyze the computational complexity and illustrate its superiority in terms of the inference time.

2603.20479 2026-05-26 cs.CY cs.AI cs.CL

Profiling learners' affective engagement: Emotion AI, intercultural pragmatics, and language learning

学习者情感投入画像:情感AI、跨文化语用学与语言学习

Robert Godwin-Jones

发表机构 * Virginia Commonwealth University(弗吉尼亚大学)

AI总结 本文探讨了情感AI在语言学习中的应用,特别是自动情感识别和模拟人类响应如何影响语用能力和互动能力的发展,并讨论了其个性化学习优势与情感操纵风险。

详情
Journal ref
Language Learning & Technology, 30(2), 14-35 (2026)
AI中文摘要

学习另一种语言可能是一个高度情感化的过程,通常以无数大大小小的挫折和成功为特征。对大多数学习者而言,语言学习并非遵循线性、可预测的路径,其曲折进程受动机(或去动机)变量影响,如个人特征、师生关系、学习材料以及对未来第二语言自我的梦想。虽然语言学习的某些方面(阅读、语法)相对机械,但其他方面可能充满压力且不可预测,尤其是用目标语言交谈。这种体验不仅需要结构和词汇知识,还需要以适合社会和文化语境的方式使用语言的能力。AI聊天机器人的出现为练习会话能力提供了新机会,既有优势(响应迅速、无评判),也有缺点(缺乏情感、文化偏见)。本文探讨了技术使用中产生的情感方面,特别是自动情感识别和AI系统中模拟的人类响应如何与语言学习以及语用和互动能力的发展相互作用。情感AI,即算法驱动对用户情感信号的解读,被认为能够实现更个性化的学习,适应感知到的学习者认知和情感状态。其他人则警告情感操纵以及不恰当和无效的用户画像。

英文摘要

Learning another language can be a highly emotional process, typically characterized by numerous frustrations and triumphs, big and small. For most learners, language learning does not follow a linear, predictable path, its zigzag course shaped by motivational (or demotivating) variables such as personal characteristics, teacher/peer relationships, learning materials, and dreams of a future L2 (second language) self. While some aspects of language learning (reading, grammar) are relatively mechanical, others can be stressful and unpredictable, especially conversing in the target language. That experience necessitates not only knowledge of structure and lexis, but also the ability to use the language in ways that are appropriate to the social and cultural context. A new opportunity to practice conversational abilities has arrived through the availability of AI chatbots, with both advantages (responsive, non-judgmental) and drawbacks (emotionally void, culturally biased). This column explores aspects of emotion as they arise in technology use and in particular how automatic emotion recognition and simulated human responsiveness in AI systems interface with language learning and the development of pragmatic and interactional competence. Emotion AI, the algorithmically driven interpretation of users' affective signals, has been seen as enabling greater personalized learning, adapting to perceived learner cognitive and emotional states. Others warn of emotional manipulation and inappropriate and ineffective user profiling

2603.20334 2026-05-26 cs.SE cs.AI

Procedural Refinement by LLM-driven Algorithmic Debugging for ARC-AGI-2

基于LLM驱动的算法调试的程序化精炼用于ARC-AGI-2

Yu-Ning Qiu, Lin-Feng Zou, Jiong-Da Wang, Xue-Rong Yuan, Wang-Zhou Dai

发表机构 * Nanjing University(南京大学)

AI总结 提出一种神经符号精炼方法ABPR,结合LLM与Prolog元解释器,通过证明树推导进行语义重检,在ARC-AGI-2上实现高通过率,并扩展到RAVEN风格推理任务。

详情
AI中文摘要

在高复杂度的抽象推理中,系统必须从少量示例或结构化观察中推断出潜在规则,并将其应用于未见实例。LLM可以将此类规则表达为程序,但基于对话的常规精炼主要停留在结果层面:它观察到答案或输出是错误的,而没有正式重新检查是哪个抽象、关系或变换导致了该结果。我们提出基于溯因的程序化精炼(ABPR),一种神经符号精炼方法,它将LLM与Prolog元解释器相结合。ABPR将每个候选程序视为潜在规则的可执行声明性假设,并将其SLD目标-子目标解析具体化为紧凑的证明树风格推导,遵循Shapiro的算法程序调试(APD)。在此视角下,精炼不仅仅是代码级调试,而是对模型假设规则进行语义重检。我们主要在ARC-AGI-2上评估ABPR,这是一个具有挑战性的少样本抽象规则归纳基准,涉及网格变换。使用Gemini-3-Flash的ABPR在公共评估集上达到56.67%的Pass@2,而使用GPT-5.5 xHigh的ABPR达到98.33%的Pass@2。在填空式I-RAVEN-X和A-I-RAVEN改编上的补充实验表明,相同的轨迹引导框架可以扩展到RAVEN风格的关系和类比抽象,而不仅限于ARC特定的网格任务。重复运行和敏感性分析表明,随着搜索广度和总搜索深度的增加,并行轨迹引导搜索减少了随机方差。

英文摘要

In high-complexity abstract reasoning, a system must infer a latent rule from a few examples or structured observations and apply it to unseen instances. LLMs can express such rules as programs, but ordinary conversation-based refinement is largely outcome-level: it observes that an answer or output is wrong without formally re-checking which abstraction, relation, or transformation justified that outcome. We propose \emph{Abduction-Based Procedural Refinement} (ABPR), a neuro-symbolic refinement approach that couples an LLM with a Prolog meta-interpreter. ABPR treats each candidate program as an executable declarative hypothesis of the latent rule and reifies its SLD goal--subgoal resolution into compact proof-tree-style derivations, following Shapiro's algorithmic program debugging (APD). In this view, refinement is not merely code-level debugging, but semantic re-checking of the model's hypothesised rule. We evaluate ABPR primarily on ARC-AGI-2, a challenging few-shot abstract rule induction benchmark over grid transformations. ABPR with Gemini-3-Flash achieves 56.67\% Pass@2, while GPT-5.5 xHigh with ABPR reaches 98.33\% Pass@2 on the public evaluation set. Supplementary experiments on fill-in-the-blank I-RAVEN-X and A-I-RAVEN adaptations provide evidence that the same trace-guided framework extends beyond ARC-specific grid tasks to RAVEN-style relational and analogical abstraction. Repeated-run and sensitivity analyses show that parallel trace-guided search reduces stochastic variance as search breadth and total search depth increase.

2603.00177 2026-05-26 cs.CR cs.HC cs.LG

Detecting Cognitive Signatures in Typing Behavior for Non-Intrusive Authorship Verification

通过打字行为检测认知特征以实现非侵入式作者验证

David Condrey

发表机构 * Writerslogic, Inc.(Writerslogic公司)

AI总结 利用大规模击键数据集中的认知负荷相关性(CLC)区分真实创作与机械转录,提出一种仅收集时间元数据的非侵入式验证框架,在保护隐私的同时实现85-95%的判别准确率,并证明认知特征对时序伪造攻击具有鲁棒性。

Comments 7 pages

详情
AI中文摘要

AI生成文本的激增加剧了对可靠作者验证的需求,然而当前基于输出的方法越来越不可靠。我们观察到,普通的打字界面捕获了丰富的认知特征,即击键时序中可测量的模式,反映了真实创作过程中的规划、翻译和修改阶段。基于包含超过1.36亿事件的大规模击键数据集,我们定义了认知负荷相关性(CLC),并表明它能区分真实创作与机械转录。我们提出了一种非侵入式验证框架,该框架在现有写作界面内运行,仅收集时间元数据以保护隐私。我们的分析评估估计,在所述假设下,判别准确率为85%至95%,同时通过证据量化限制生物特征泄露。我们分析了认知特征的对抗鲁棒性,表明它们能够抵抗击败运动级身份验证的时序伪造攻击,因为认知通道与语义内容纠缠在一起。我们得出结论,将作者验证重新定义为人机交互问题,为侵入式监控提供了一种保护隐私的替代方案。

英文摘要

The proliferation of AI-generated text has intensified the need for reliable authorship verification, yet current output-based methods are increasingly unreliable. We observe that the ordinary typing interface captures rich cognitive signatures, measurable patterns in keystroke timing that reflect the planning, translating, and revising stages of genuine composition. Drawing on large-scale keystroke datasets comprising over 136 million events, we define the Cognitive Load Correlation (CLC) and show it distinguishes genuine composition from mechanical transcription. We present a non-intrusive verification framework that operates within existing writing interfaces, collecting only timing metadata to preserve privacy. Our analytical evaluation estimates 85 to 95 percent discrimination accuracy under stated assumptions, while limiting biometric leakage via evidence quantization. We analyze the adversarial robustness of cognitive signatures, showing they resist timing-forgery attacks that defeat motor-level authentication because the cognitive channel is entangled with semantic content. We conclude that reframing authorship verification as a human-computer interaction problem provides a privacy-preserving alternative to invasive surveillance.

2602.22631 2026-05-26 cs.MS cs.LG cs.LO cs.NA cs.PL math.NA

TorchLean: Formalizing Neural Networks in Lean

TorchLean: 在 Lean 中形式化神经网络

Robert Joseph George, Jennifer Cruden, Will Adkisson, Xiangru Zhong, Huan Zhang, Anima Anandkumar

发表机构 * California Institute of Technology(加利福尼亚理工学院) Washington University in St. Louis(华盛顿大学圣路易斯分校) University of Illinois Urbana-Champaign(伊利诺伊大学厄巴纳-香槟分校)

AI总结 提出 TorchLean 框架,在 Lean 4 中统一神经网络的执行、验证与定理证明,通过共享语义弥合执行网络与分析工件之间的语义鸿沟。

Comments 55 pages

详情
AI中文摘要

神经网络越来越多地部署在科学、安全关键和任务关键型流程中,但验证和分析通常在定义和运行模型的编程环境之外进行。这在执行的网络与分析工件之间造成了语义鸿沟:保证可能依赖于关于算子语义、张量布局、预处理、浮点行为、图变换、加速内核和外部证书的隐式约定。我们提出 TorchLean,一个在 Lean 4 中形式化、执行和验证神经网络的统一框架。TorchLean 将学习模型视为可执行程序和数学对象,具有用于计算、验证和定理证明的共享语义。该框架为类型化张量、层、目标、优化器、自动微分和图程序提供了 PyTorch 风格的 API,具有急切和编译执行路径,这些路径降低到公共计算图表示。TorchLean 支持精确和有限精度张量语义、验证的反向模式微分、区间和仿射边界传播、CROWN/LiRPA 风格的证书检查、导入/导出工作流以及通过显式 FFI 边界的 CUDA 支持执行。它还包括用于注意力和 FlashAttention、状态空间序列模型、扩散和采样过程、概率核、强化学习目标和马尔可夫决策过程以及自监督目标(如掩码自编码、JEPA 风格的预测视图和基于方差/相关性的抗崩溃损失)的语义层。这些组件共同为验证机器学习提供了语义基础,其中可执行的神经网络工件、验证过程、运行时边界和数学声明可以在一个定理证明环境中陈述和关联。

英文摘要

Neural networks are increasingly deployed in scientific, safety critical, and mission critical pipelines, yet verification and analysis are often performed outside the programming environment that defines and runs the model. This creates a semantic gap between the executed network and the analyzed artifact: guarantees can depend on implicit conventions about operator semantics, tensor layouts, preprocessing, floating-point behavior, graph transformations, accelerated kernels, and external certificates. We present TorchLean, a unified framework for formalizing, executing, and verifying neural networks in Lean 4. TorchLean treats learned models as executable programs and mathematical objects with a shared semantics for computation, verification, and theorem proving. The framework provides a PyTorch style API for typed tensors, layers, objectives, optimizers, automatic differentiation, and graph programs, with eager and compiled execution paths that lower to a common computation-graph representation. TorchLean supports exact and finite-precision tensor semantics, verified reverse-mode differentiation, interval and affine bound propagation, CROWN/LiRPA style certificate checking, import/export workflows, and CUDA-backed execution through explicit FFI boundaries. It also includes semantic layers for attention and FlashAttention, state-space sequence models, diffusion and sampling processes, probability kernels, reinforcement-learning objectives and Markov decision processes, and self-supervised objectives such as masked autoencoding, JEPA-style predictive views, and variance/correlation-based anti-collapse losses. Together, these components provide a semantic foundation for verified machine learning, where executable neural network artifacts, verification procedures, runtime boundaries, and mathematical claims can be stated and related inside one theorem-proving environment.

2602.21479 2026-05-26 stat.ML cs.LG

Global Sequential Testing for Multi-Stream Auditing

多流审计的全局序贯检验

Beepul Bharti, Ambar Pal, Jeremias Sulam

发表机构 * Mathematical Institute for Data Science (MINDS), Johns Hopkins University(数据科学数学研究所(MINDS),约翰霍普金斯大学) Department of Biomedical Engineering, Johns Hopkins University(生物医学工程系,约翰霍普金斯大学) Amazon Responsible AI(亚马逊负责任人工智能)

AI总结 针对多数据流审计问题,提出基于鞅合并的序贯检验方法,在稀疏和密集备择假设下分别达到最优停止时间,并通过实验验证。

详情
AI中文摘要

在许多风险敏感领域,随着接收更多数据,持续审计机器学习系统以快速判断其是否按设计运行至关重要。该审计任务可建模为具有 $k$ 个数据流和全局零假设的序贯假设检验问题,其中全局零假设断言系统在所有 $k$ 个流上按预期运行。在备择假设下,使用 Bonferroni 校正的标准全局序贯检验,对于大 $k$ 和显著性水平 $α$,期望停止时间为 $O\left(\ln rac{k}{α} ight)$。在这项工作中,我们证明了依赖于通过平均和乘积规则合并鞅的高效序贯检验提供了改进的停止时间,从而对零假设具有更强的检验能力。利用这些结果,我们表明平衡检验在稀疏情形(仅少数非零流)下可以达到 Bonferroni 的 $O\left(\ln rac{k}{α} ight)$ 速率,同时在密集备择假设(许多非零流)下实现 $O\left( rac{1}{k}\ln rac{1}{α} ight)$。我们通过在合成数据和真实数据上的实验验证了我们的理论。

英文摘要

Across many risk-sensitive areas, it is critical to continuously audit machine learning systems as we receive more data to quickly determine if they are performing as designed. This auditing task can be modeled as a sequential hypothesis testing problem with $k$ data streams and a global null hypothesis that asserts the system operates as intended across all $k$ streams. Under the alternative, the standard global sequential test, which uses a Bonferroni correction, has an expected stopping time of $O\left(\ln \frac{k}α\right)$ for large $k$ and significance level $α$. In this work, we demonstrate that efficient sequential tests, relying on merging martingales via averaging and products rules, provide improved stopping times, and thus more powerful tests against the null. Using these results, we show that a balanced test can match the Bonferroni rate of $O\left(\ln \frac{k}α\right)$ in the sparse regime (just a few non-null streams) while achieving $O\left(\frac{1}{k}\ln \frac{1}α\right)$ under dense alternatives (many non-null steams). We validate our theory through experiments on both synthetic and real-world data.

2602.12224 2026-05-26 cs.GT cs.AI econ.TH

Two-Sided Time-Independent Regret for Matching Markets with Limited Interviews

有限面试匹配市场的双面时间无关遗憾

Amirmahdi Mirfakhar, Xuchuang Wang, Mengfan Xu, Hedyeh Beyhaghi, Mohammad Hajiesmaili

发表机构 * University of Massachusetts Amherst(马萨诸塞大学阿姆赫斯特分校)

AI总结 针对面试次数有限的匹配市场,提出利用面试作为提示进行双面学习,并通过策略性延迟纠正早期错误,实现与时间无关的遗憾界。

详情
AI中文摘要

双面匹配平台依赖双方的偏好,但参与者只能评估一小部分潜在伙伴。在实践中,他们使用低成本的匹配前筛选(例如面试、个人资料浏览或试用任务)在提交申请和录用之前形成有噪声的印象。我们研究了带有面试的匹配市场中的赌博机学习,将这些交互建模为查询的提示(hints)~\citep{DBLP:conf/innovations/BhaskaraGIKM23},这些提示向双方揭示部分偏好信息,同时限制后续申请。我们的框架还允许企业方的不确定性:企业像代理人一样学习自己的偏好,并可能犯早期招聘错误。为了解决这个问题,我们引入了策略性延迟(strategic deferral),这是一种企业方行动,允许临时空缺,纠正过早的承诺,并在粗略匿名反馈下实现去中心化学习。我们为中心化和去中心化市场设计了算法,并表明每轮恒定数量的面试足以实现与时间无关的遗憾,优于已知没有面试时的$O(\log T)$保证。我们的界是接近最优的:中心化保证在信息论下界的$m$倍以内,而去中心化算法在结构化市场中达到多项式因子,在一般市场中仍然与时间无关。

英文摘要

Two-sided matching platforms rely on preferences from both sides, yet participants can evaluate only a small fraction of potential partners. In practice, they use low-cost pre-match screening, e.g., interviews, profile views, or trial tasks, to form noisy impressions before committing to applications and offers. We study bandit learning in matching markets with interviews, modeling these interactions as queried \emph{hints}~\citep{DBLP:conf/innovations/BhaskaraGIKM23} that reveal partial preference information to both sides while constraining subsequent applications. Our framework also allows firm-side uncertainty: firms, like agents, learn their preferences and may make early hiring mistakes. To address this, we introduce strategic deferral, a firm-side action that permits temporary vacancy, corrects premature commitments, and enables decentralized learning under coarse anonymous feedback. We design algorithms for centralized and decentralized markets and show that a constant number of interviews per round suffices for horizon-independent regret, improving over the $O(\log T)$ guarantees known without interviews. Our bounds are near-optimal: the centralized guarantee is within a factor $m$ of an information-theoretic lower bound, while decentralized algorithms match it up to polynomial factors in structured markets and remain horizon-independent in general markets.

2602.09431 2026-05-26 cs.CR cs.CV

Grounding-Driven Attack: Improving Encoder-based Adversarial Transferability against Large Vision-Language Models

基于文本驱动的攻击:提升编码器对抗迁移性以攻击大型视觉-语言模型

Xinwei Zhang, Li Bai, Tianwei Zhang, Youqian Zhang, Qingqing Ye, Yingnan Zhao, Ruochen Du, Haibo Hu

发表机构 * The Hong Kong Polytechnic University(香港理工大学) Nanyang Technological University(南洋理工大学) Harbin Engineering University(哈尔滨工程大学)

AI总结 提出文本驱动攻击(GDA),通过将扰动优化与文本接地证据对齐,并采用接地感知扰动分配和接地中心证据破坏策略,显著提升编码器攻击在黑盒大型视觉-语言模型上的迁移性。

Comments Under review;

详情
AI中文摘要

大型视觉-语言模型(LVLMs)在多模态任务中取得了令人印象深刻的性能,但它们对视觉输入的依赖使其面临对抗性威胁。编码器攻击通过仅通过视觉编码器生成扰动,为端到端优化提供了一种高效的替代方案。然而,现有的编码器攻击通常假设替代编码器与受害LVLM的视觉编码器相同或相似。在这项工作中,我们系统研究了它们在具有异构LVLM架构的更现实的黑盒部署中的迁移性。我们发现,模型特定的视觉证据在不同模型间不一致,而文本条件接地区域与标题相关证据更紧密相关,并提供了更稳定的迁移目标。然而,现有攻击与这些区域的对齐较弱且不足以破坏它们。受这些发现启发,我们提出了文本驱动攻击(GDA),它将扰动优化与文本接地证据对齐。GDA结合了接地感知扰动分配(将扰动预算集中在接地证据区域)和接地中心证据破坏(增强其全局和局部破坏)。在多种受害模型和任务上的实验表明,GDA在黑盒迁移中始终优于现有的编码器攻击。这些结果突显了文本接地证据在对抗迁移性中的核心作用,并激励了接地感知的鲁棒性评估和防御设计。

英文摘要

Large vision-language models (LVLMs) have achieved impressive performance across multimodal tasks, but their reliance on visual inputs exposes them to adversarial threats. Encoder-based attacks provide an efficient alternative to end-to-end optimization by crafting perturbations through the vision encoder alone. However, existing encoder-based attacks often assume that the surrogate encoder is identical or similar to the victim LVLM's vision encoder. In this work, we present a systematic study of their transferability in more realistic black-box deployments with heterogeneous LVLM architectures. We find that model-specific visual evidence is inconsistent across models, whereas text-conditioned grounding regions are more closely tied to caption-relevant evidence and provide a more stable transfer target. However, existing attacks remain weakly aligned with and insufficiently disrupt these regions. Motivated by these findings, we propose Grounding-Driven Attack (GDA), which aligns perturbation optimization with text-grounded evidence. GDA combines Grounding-Aware Perturbation Allocation to concentrate perturbation budget on grounded evidence regions with Grounding-Centric Evidence Disruption to intensify their global and local disruption. Experiments across diverse victim models and tasks show that GDA consistently outperforms existing encoder-based attacks in black-box transfer. These results highlight the central role of text-grounded evidence in adversarial transferability and motivate grounding-aware robustness evaluation and defense design.

2602.04653 2026-05-26 cs.CR cs.LG

Inference-Time Backdoors via Chat Templates: From LLM Supply Chains to Agentic System Compromise

通过聊天模板的推理时后门:从LLM供应链到代理系统妥协

Ariel Fogel, Omer Hofman, Eilon Cohen, Roman Vainshtein

发表机构 * Fujitsu Research of Europe(富士通欧洲研究)

AI总结 提出一种通过恶意修改聊天模板实现推理时后门攻击的方法,无需修改模型权重或训练数据,在LLM、代理和多代理系统层面均能成功攻击,且能绕过现有防御。

Comments V3: Accepted to ICLR 2026 Trustworthy AI Workshop, V4: Submitted to CCS 2026

详情
AI中文摘要

开源权重语言模型越来越多地用于生产环境,带来了新的安全挑战。一个突出的威胁是后门攻击,攻击者嵌入在特定条件下激活的隐藏行为。先前的工作假设攻击者能够访问训练流程或部署基础设施。我们提出了一种新颖的攻击面,不需要这些:即“聊天模板”。聊天模板是在每次推理调用时执行的可执行程序,通常用Jinja2实现,占据用户输入和模型处理之间的特权位置。我们表明,分发带有恶意修改模板的模型的攻击者可以在不修改模型权重、投毒训练数据或控制运行时基础设施的情况下植入推理时后门。我们在三个部署层级评估了这种攻击。在LLM层面,触发的后门将事实准确性从平均90%降低到15%,并诱导攻击者控制的URL发射,成功率超过80%,而良性输入没有可测量的退化;这些结果在十八个模型上成立。在代理层面,模板后门在两个基准测试(涵盖3868个回合)中劫持了工具使用,绕过了基准测试提供的所有测试过的注入防御,同时在缺乏触发条件时完全休眠。在多代理系统层面,我们展示了单个投毒工件如何损害真实世界的代理部署,并向下游传播供应链代码投毒。投毒工件在最大的开源模型分发平台上逃避了所有安全扫描;并且由于负载在用户输入处理之前由模板渲染,它在架构上无法被输入级防御(如提示注入护栏)触及。这些结果确立了聊天模板在开源权重AI供应链中作为一种可靠且未受防御的攻击方式。

英文摘要

Open-weight language models are increasingly used in production settings, raising new security challenges. One prominent threat is backdoor attacks, in which adversaries embed hidden behaviors that activate under specific conditions. Previous work has assumed that adversaries have access to training pipelines or deployment infrastructure. We propose a novel attack surface requiring neither: the "chat template". Chat templates are executable programs invoked at every inference call, often implemented in Jinja2, that occupy a privileged position between user input and model processing. We show that an adversary who distributes a model with a maliciously modified template can implant an inference-time backdoor without modifying model weights, poisoning training data, or controlling runtime infrastructure. We evaluate this attack across three deployment tiers. At the LLM level, triggered backdoors reduce factual accuracy from 90% to 15% on average and induce attacker-controlled URL emission with success rates exceeding 80%, while benign inputs show no measurable degradation; these results hold across eighteen models. At the agent level, template backdoors hijack tool-use across two benchmarks spanning 3,868 episodes, bypassing every tested injection defense offered by the benchmarks while remaining fully dormant absent the trigger. At the multi-agent system level, we demonstrate how a single poisoned artifact compromises a real-world agentic deployment and propagates supply-chain code poisoning downstream. The poisoned artifacts evade all security scans on the largest open model distribution platform; and because the payload is rendered by the template before user input is processed, it is architecturally unreachable by input-level defenses such as prompt injection guardrails. These results establish chat templates as a reliable and undefended attack in the open-weight AI supply chain.

2602.02605 2026-05-26 cs.NE cs.AI cs.CL q-bio.NC

Fine-Tuning Language Models to Know What They Know

微调语言模型使其了解自身所知

Sangjun Park, Elliot Meyerson, Xin Qiu, Risto Miikkulainen

发表机构 * The University of Texas at Austin(德克萨斯大学奥斯汀分校) Cognizant AI Lab(认知人工智能实验室)

AI总结 本文提出一种框架,通过进化策略对齐方法(ESMA)在控制偏差的同时提升大语言模型的元认知能力,并在未见数据集、语言和新知识上展现出鲁棒泛化性。

Comments Preprint

详情
AI中文摘要

评估大语言模型(LLMs)的真实元认知能力因偏差和启发式方法而困难。本文提出一个框架,在控制这些偏差的同时测量和增强LLM的元认知能力。建立了使用$d'_{\rm type2}$指标的测量方法以隔离元认知能力。提出了元认知对齐进化策略(ESMA),在未见数据集、语言和新获取的知识上展现出鲁棒泛化性。最后,参数分析表明这些改进由一组稀疏参数驱动,为定向元认知优化提供了新途径。

英文摘要

Evaluating true metacognition in Large Language Models (LLMs) is difficult due to biases and heuristics. This paper presents a framework to measure and enhance LLM metacognition while controlling for these biases. A measurement method using the $d'_{\rm type2}$ metric is established to isolate metacognitive ability. The Evolution Strategy for Metacognitive Alignment (ESMA) is proposed, demonstrating robust generalization across unseen datasets, languages, and newly acquired knowledge. Finally, parameter analysis reveals that these improvements are driven by a sparse set of parameters, offering new pathways for targeted metacognitive optimization.

2601.22925 2026-05-26 cs.IR cs.AI cs.LG

BEAR: Towards Beam-Search-Aware Optimization for Recommendation with Large Language Models

BEAR:面向大语言模型推荐中束搜索感知的优化

Weiqin Yang, Bohao Wang, Zhenxiang Xu, Jiawei Chen, Shengjia Zhang, Jingbang Chen, Canghong Jin, Can Wang

发表机构 * Zhejiang University(浙江大学) The Chinese University of Hong Kong, Shenzhen(香港中文大学(深圳)) Hangzhou City University(杭州市城市大学)

AI总结 针对监督微调与束搜索推理之间的不一致性,提出BEAR正则化方法,通过确保正例每个token在解码步骤中排名前B来避免过早剪枝,显著提升推荐性能。

Comments Accepted by SIGIR 2026

详情
AI中文摘要

近年来,利用大语言模型(LLM)进行推荐的研究迅速增长。这些方法通常采用监督微调(SFT)使LLM适应推荐场景,并在推理时使用束搜索高效检索前B个推荐项。然而,我们发现了关键的训练-推理不一致性:虽然SFT优化正例的整体概率,但即使这些项具有高整体概率,也不能保证它们会被束搜索检索到。由于贪心剪枝机制,束搜索可能会在正例的前缀概率不足时过早丢弃它。为了解决这种不一致性,我们提出了BEAR(束搜索感知正则化),一种新的微调目标,在训练中显式考虑束搜索行为。BEAR不直接模拟每个训练实例的束搜索(计算代价过高),而是强制执行一个宽松的必要条件:正例中的每个token在每个解码步骤中必须排在前B个候选token中。该目标有效降低了错误剪枝的风险,同时与标准SFT相比仅增加可忽略的计算开销。在四个真实世界数据集上的大量实验表明,BEAR显著优于强基线。代码可在https://github.com/Tiny-Snow/BEAR-SIGIR-2026获取。

英文摘要

Recent years have seen a rapid surge in research leveraging Large Language Models (LLMs) for recommendation. These methods typically employ supervised fine-tuning (SFT) to adapt LLMs to recommendation scenarios, and utilize beam search during inference to efficiently retrieve $B$ top-ranked recommended items. However, we identify a critical training-inference inconsistency: while SFT optimizes the overall probability of positive items, it does not guarantee that such items will be retrieved by beam search even if they possess high overall probabilities. Due to the greedy pruning mechanism, beam search can prematurely discard a positive item once its prefix probability is insufficient. To address this inconsistency, we propose BEAR (Beam-SEarch-Aware Regularization), a novel fine-tuning objective that explicitly accounts for beam search behavior during training. Rather than directly simulating beam search for each instance during training, which is computationally prohibitive, BEAR enforces a relaxed necessary condition: each token in a positive item must rank within the top-$B$ candidate tokens at each decoding step. This objective effectively mitigates the risk of incorrect pruning while incurring negligible computational overhead compared to standard SFT. Extensive experiments across four real-world datasets demonstrate that BEAR significantly outperforms strong baselines. Code is available at https://github.com/Tiny-Snow/BEAR-SIGIR-2026 .

2601.14340 2026-05-26 cs.CR cs.LG

Turn-Based Structural Triggers: Prompt-Free Backdoors in Multi-Turn LLMs

基于回合的结构性触发器:多轮LLM中的无提示后门

Yiyang Lu, Jinwen He, Yue Zhao, Kai Chen, Ruigang Liang, Cheng Hong, Yingjun Zhang

发表机构 * School of Cyber Security, University of Chinese Academy of Sciences(中国科学院大学网络安全学院) Institute of Information Engineering, Chinese Academy of Sciences(中国科学院信息工程研究所) Institute of Software, Chinese Academy of Sciences(中国科学院软件研究所) Ant Group(蚂蚁集团)

AI总结 提出一种利用对话结构(回合索引)作为触发器的后门攻击方法TST,无需用户输入即可激活,实现高攻击成功率并绕过提示中心防御。

详情
AI中文摘要

大型语言模型(LLM)被广泛集成到交互式系统中,如对话代理和面向任务的助手。这一日益增长的生态系统也带来了供应链风险,攻击者可以分发被污染的模型,降低下游可靠性和用户信任。现有的后门攻击和防御大多以提示为中心,关注用户可见的触发器,而忽视了多轮对话中的结构信号。我们提出了基于回合的结构性触发器(TST),这是一种从对话结构激活的后门攻击,使用回合索引作为触发器,且独立于用户输入。这造成了一种结构条件性的可靠性风险:带有后门的模型可以通过以提示为中心的检查和标准效用评估,但在选定的对话位置执行攻击者指定的行为,而用户输入中没有任何触发器。在四个开源LLM家族中,TST实现了99.52%的平均攻击成功率,同时基本保持了非触发效用,并且在未见过的对话数据集和代表性防御中仍然有效。这些结果揭示了对话结构是一个被忽视的攻击面,并激励了超越提示检查的结构感知多轮审计。

英文摘要

Large Language Models (LLMs) are widely integrated into interactive systems such as dialogue agents and task-oriented assistants. This growing ecosystem also raises supply-chain risks, where adversaries can distribute poisoned models that degrade downstream reliability and user trust. Existing backdoor attacks and defenses are largely prompt-centric, focusing on user-visible triggers while overlooking structural signals in multi-turn conversations. We propose Turn-based Structural Trigger (TST), a backdoor attack that activates from dialogue structure, using the turn index as the trigger and remaining independent of user inputs. This creates a structure-conditioned reliability risk: a backdoored model can pass prompt-centric checks and standard utility evaluations, yet execute attacker-specified behaviors at selected dialogue positions without any trigger in the user input. Across four open-source LLM families, TST achieves a 99.52% average ASR while largely preserving non-triggered utility, and remains effective across unseen dialogue datasets and representative defenses. These results reveal dialogue structure as an overlooked attack surface and motivate structure-aware multi-turn auditing beyond prompt inspection.

2601.10494 2026-05-26 stat.ML cs.LG

CROCS: A Two-Stage Clustering Framework for Behaviour-Centric Consumer Segmentation with Smart Meter Data

CROCS:一种基于智能电表数据的以行为为中心的消费者细分的两阶段聚类框架

Luke W. Yerbury, Ricardo J. G. B. Campello, G. C. Livingston, Mark Goldsworthy, Lachlan O'Neil

发表机构 * Ausgrid(澳大利亚电网公司)

AI总结 提出CROCS两阶段聚类框架,通过消费者日常负荷曲线的独立聚类和基于加权最小距离的集合间比较,实现鲁棒且可扩展的消费者行为细分。

详情
AI中文摘要

随着电网运营商面临可再生能源整合和电气化推广带来的不确定性增加,需求侧管理(DSM)——特别是需求响应(DR)——作为一种平衡现代电力系统的成本效益机制引起了广泛关注。全球持续部署的智能电表提供了前所未有的消费数据量,使得基于实际用电行为的消费者细分成为可能,有望为设计更有效的DSM和DR计划提供信息。然而,现有的基于聚类的细分方法未能充分反映消费者的行为多样性,通常依赖于严格的时间对齐,并且在存在异常值、缺失数据或大规模部署时表现不佳。为了解决这些挑战,我们提出了一种新颖的两阶段聚类框架——优化消费者细分的聚类表示(CROCS)。在第一阶段,每个消费者的每日负荷曲线被独立聚类,形成代表性负荷集(RLS),提供其典型日间消费行为的紧凑摘要。在第二阶段,使用加权最小距离和(WSMD)对消费者进行聚类,这是一种新颖的集合间度量,通过考虑这些行为的普遍性和相似性来比较RLS。最后,对WSMD诱导图进行社区检测,揭示体现定义消费者群体的共享日间行为的高阶原型,从而增强所得聚类的可解释性。在合成和真实澳大利亚智能电表数据集上的大量实验表明,CROCS能够捕捉消费者内部变异性,发现同步和异步行为相似性,对异常值和缺失数据保持鲁棒性,并通过自然并行化实现高效扩展。这些结果...

英文摘要

With grid operators confronting rising uncertainty from renewable integration and a broader push toward electrification, Demand-Side Management (DSM) -- particularly Demand Response (DR) -- has attracted significant attention as a cost-effective mechanism for balancing modern electricity systems. Unprecedented volumes of consumption data from a continuing global deployment of smart meters enable consumer segmentation based on real usage behaviours, promising to inform the design of more effective DSM and DR programs. However, existing clustering-based segmentation methods insufficiently reflect the behavioural diversity of consumers, often relying on rigid temporal alignment, and faltering in the presence of anomalies, missing data, or large-scale deployments. To address these challenges, we propose a novel two-stage clustering framework -- Clustered Representations Optimising Consumer Segmentation (CROCS). In the first stage, each consumer's daily load profiles are clustered independently to form a Representative Load Set (RLS), providing a compact summary of their typical diurnal consumption behaviours. In the second stage, consumers are clustered using the Weighted Sum of Minimum Distances (WSMD), a novel set-to-set measure that compares RLSs by accounting for both the prevalence and similarity of those behaviours. Finally, community detection on the WSMD-induced graph reveals higher-order prototypes that embody the shared diurnal behaviours defining consumer groups, enhancing the interpretability of the resulting clusters. Extensive experiments on both synthetic and real Australian smart meter datasets demonstrate that CROCS captures intra-consumer variability, uncovers both synchronous and asynchronous behavioural similarities, and remains robust to anomalies and missing data, while scaling efficiently through natural parallelisation. These results...

2512.23956 2026-05-26 stat.ML cs.LG

Implicit geometric regularization in flow matching via density weighted Stein operators

通过密度加权Stein算子的流匹配中的隐式几何正则化

Shinto Eguchi

发表机构 * The Institute of Statistical Mathematics(统计数学研究所)

AI总结 提出γ-流匹配(γ-FM),通过动态密度加权策略隐式正则化高维空间中的回归几何,改善向量场平滑性和采样效率。

Comments Revised version

详情
AI中文摘要

流匹配(FM)已成为连续归一化流的一个强大范式,但标准FM隐式地在整个环境空间上进行未加权的$L^2$回归。在高维空间中,这导致了一个根本性的低效:绝大多数积分区域由低密度的“空洞”区域组成,其中目标速度场通常是混沌或定义不良的。在本文中,我们提出了γ-流匹配(γ-FM),一种密度加权变体,它将回归几何与底层概率流对齐。虽然密度加权是可取的,但朴素实现需要评估难以处理的目标密度。我们通过引入一种动态密度加权策略来规避这一点,该策略直接从训练粒子估计目标密度。这种方法使我们能够动态降低空洞区域中的回归损失,而不损害FM的无模拟特性。理论上,我们证明了γ-FM在赋予γ-Stein度量的统计流形上最小化传输成本。谱分析进一步表明,这种几何结构引入了隐式Sobolev正则化,有效地抑制了空洞区域中的高频振荡。实验上,γ-FM显著改善了高维潜在数据集上的向量场平滑性和采样效率,同时展示了对异常值的内在鲁棒性。

英文摘要

Flow Matching (FM) has emerged as a powerful paradigm for continuous normalizing flows, yet standard FM implicitly performs an unweighted $L^2$ regression over the entire ambient space. In high dimensions, this leads to a fundamental inefficiency: the vast majority of the integration domain consists of low-density ``void'' regions where the target velocity fields are often chaotic or ill-defined. In this paper, we propose {$γ$-Flow Matching ($γ$-FM)}, a density-weighted variant that aligns the regression geometry with the underlying probability flow. While density weighting is desirable, naive implementations would require evaluating the intractable target density. We circumvent this by introducing a Dynamic Density-Weighting strategy that estimates the \emph{target} density directly from training particles. This approach allows us to dynamically downweight the regression loss in void regions without compromising the simulation-free nature of FM. Theoretically, we establish that $γ$-FM minimizes the transport cost on a statistical manifold endowed with the $γ$-Stein metric. Spectral analysis further suggests that this geometry induces an implicit Sobolev regularization, effectively damping high-frequency oscillations in void regions. Empirically, $γ$-FM significantly improves vector field smoothness and sampling efficiency on high-dimensional latent datasets, while demonstrating intrinsic robustness to outliers.