arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 2154
专题追踪
2605.19629 2026-05-20 stat.ML cs.LG math.OC

Gaussian Approximation and Multiplier Bootstrap for Federated Linear Stochastic Approximation

高斯近似与乘数自助法用于联邦线性随机逼近

Ilya Levin, Maksim Shuklin, Eric Moulines, Paul Mangold, Sergey Samsonov

发表机构 * HSE University(莫斯科国立高等经济学院) MBZUAI(马克斯·普朗克智能系统研究所) CMAP, CNRS, École Polytechnique, Institut Polytechnique de Paris(巴黎高等理工学院应用数学与计算科学实验室,法国国家科学研究中心)

AI总结 本文建立了联邦线性随机逼近的Berry-Esseen型界,首次明确捕捉通信-计算权衡和异质性误差项的联邦高斯近似,量化了局部步长、局部更新次数和异质性对收敛速率的影响。

详情
AI中文摘要

在本文中,我们为联邦线性随机逼近(LSA)建立了Berry-Esseen型界。我们的结果提供了首个明确捕捉通信-计算权衡和异质性误差项的联邦高斯近似,量化了局部步长、局部更新次数和异质性对收敛速率的影响。我们为两种情况提供了结果:(i)常数步长域和(ii)递减步长与递增局部迭代次数,恢复了Bonnerjee等人[2025]最近的速率作为特殊情况。作为我们结果的主要应用,我们开发了一个在线乘数自助法用于最后迭代的推断,避免了显式估计渐近协方差矩阵,并获得了该过程的非渐近有效性保证。

英文摘要

In this paper, we establish Berry-Esseen-type bounds for federated linear stochastic approximation (LSA). Our results provide the first federated Gaussian approximations for LSA that explicitly capture communication-computation trade-offs and heterogeneity-aware error terms, quantifying the effects of local step size, number of local updates, and heterogeneity on convergence rates. We present results for both (i) constant step size regime and (ii) decreasing step size with an increasing number of local iterations, recovering the recent rates of Bonnerjee et al. [2025] as a special case. As a primary application of our results, we develop an online multiplier bootstrap procedure for inference on the last iterate, which avoids explicit estimation of the asymptotic covariance matrix, and obtain non-asymptotic validity guarantees for this procedure.

2605.19621 2026-05-20 eess.IV cs.LG cs.NA math.NA

Diffusion Graph Posterior Sampling for Nonlinear Inverse Problems with Application to Electrical Impedance Tomography

基于扩散后验采样的图结构数据非线性反问题求解方法及其在电阻抗断层成像中的应用

Giovanni S. Alberti, Damiana Lazzaro, Serena Morigi, Matteo Santacesaria, Shibo Wang

发表机构 * MaLGa Center, Department of Mathematics, University of Genova(马尔加中心,数学系,热那亚大学) Department of Mathematics, University of Bologna(数学系,博洛尼亚大学) Department of Mathematics, Harbin Institute of Technology(数学系,哈尔滨工业大学)

AI总结 本文提出了一种扩展扩散后验采样(DPS)到图结构数据的框架,通过在二维三角网格上开发无条件分数基于扩散模型来学习物理解空间的准确先验,并引入正则化变体RDPS,结合总变差和广义Tikhonov等显式正则化项,以缓解严重病态问题,实验表明RDPS在合成和真实2D EIT数据集上产生稳定且物理合理的重建。

详情
AI中文摘要

深度生成模型已发展为解决反问题的最先进方法,但将其应用于PDE反问题,如电阻抗断层成像(EIT)仍具挑战性。由于物理领域自然离散为无结构网格而非规则网格,标准卷积架构往往不足。本文提出了一种新的框架,将扩散后验采样(DPS)扩展到图结构数据。我们开发了直接在2D三角网格上无条件分数基于扩散模型,以学习物理解空间的准确先验。此外,我们引入正则化变体RDPS,结合总变差和广义Tikhonov等显式正则化项,以补充隐含扩散先验并缓解严重病态问题。在合成和真实2D EIT数据集上的广泛实验表明,RDPS产生稳定、物理合理的重建。我们的方法能够很好地推广到非分布包含几何形状,对测量噪声具有高度鲁棒性,并在重建准确性和伪影减少方面优于当前最先进的求解器(例如GPnP-BM3D、DP-SGS)

英文摘要

Deep generative models have emerged as state-of-the-art for solving inverse problems, but applying them to inverse problems for PDEs, like electrical impedance tomography (EIT) remains challenging. Because physical domains are naturally discretized as unstructured meshes rather than regular grids, standard convolutional architectures are often inadequate. In this paper, we propose a novel framework that extends diffusion posterior sampling (DPS) to graph-structured data. We develop an unconditional score-based diffusion model directly on a 2D triangular mesh to learn an accurate prior over the physical solution space. Furthermore, we introduce a regularized variant, RDPS, which incorporates explicit regularization terms, such as total variation and generalized Tikhonov, to complement the implicit diffusion prior and mitigate severe ill-posedness. Extensive experiments on synthetic and real 2D EIT datasets demonstrate that RDPS produces stable, physically plausible reconstructions. Our approach generalizes well to out-of-distribution inclusion geometries, is highly robust to measurement noise, and outperforms current state-of-the-art solvers (e.g., GPnP-BM3D, DP-SGS) in reconstruction accuracy and artifact reduction.

2605.19610 2026-05-20 stat.ML cs.LG

Posterior Contraction of Lévy Adaptive B-spline Regression in Besov Spaces

Lévy自适应B样条回归在Besov空间中的后验收缩

Jeunghun Oh, Sewon Park, Jaeyong Lee

发表机构 * Department of Statistics, Seoul National University(首尔国立大学统计系) Department of Statistics, Sookmyung Women’s University(淑明女子大学统计系)

AI总结 本文研究了Lévy自适应B样条回归模型在Besov空间中的后验收缩性质,证明了该模型在非参数回归框架中能够以接近最优的速率收敛到真实函数,同时自动适应未知的光滑度。

详情
AI中文摘要

我们研究了Lévy自适应B样条(LABS)回归模型的渐近性质,这是一种将B样条核纳入Lévy自适应回归核(LARK)模型的贝叶斯非参数方法。LABS应用具有不同次数的样条,并独立定义结点,从而获得一个灵活的模型类,能够适应真实函数的不规则和局部结构特征。在单变量随机设计和高斯误差的非参数回归框架中,我们证明了LABS后验在Besov类中以接近最优的速率收敛到真实函数,直至一个对数因子,同时自动适应未知的光滑度。本研究填补了文献中的空白,因为关于LARK模型在Besov空间中的后验收缩的理论结果仍然很少。在Besov空间的标准测试函数(包括Blocks、Bumps、HeaviSine和Doppler)上的模拟实验补充了理论结果,并展示了LABS的实用价值。

英文摘要

We investigate the asymptotic properties of the Lévy Adaptive B-spline (LABS) regression model, a Bayesian nonparametric method that incorporates B-spline kernels into the Lévy Adaptive Regression Kernel (LARK) model. LABS applies splines of varying degrees with independently defined knots, yielding a flexible model class capable of adapting to irregular and locally structured features of the true function. Within the nonparametric regression framework with univariate random design and Gaussian errors, we establish that the LABS posterior contracts around the true function in Besov classes at nearly minimax-optimal rates, up to a logarithmic factor, while adapting automatically to unknown smoothness. This study contributes to filling a gap in the literature, where theoretical results on posterior contraction of the LARK model in Besov spaces remain scarce. Simulation experiments on standard test functions in Besov spaces, including Blocks, Bumps, HeaviSine, and Doppler, complement the theoretical results and demonstrate the practical utility of LABS.

2605.19565 2026-05-20 physics.flu-dyn cs.LG

HiLiftAeroML: High-Fidelity Computational Fluid Dynamics Dataset for High-Lift Aircraft Aerodynamics

HiLiftAeroML:高保真计算流体力学数据集用于高升力飞机气动性能

Neil Ashton, Adam Clark, Liam Heidt, Christopher Ivey, Sanjeeb Bose, Rahul Agrawal, Konrad Goc, Rishi Ranade, Corey Adams, Peter Sharpe, Sheel Nidhan, Semit Akkurt, Daniel Leibovici, Jean Kossaifi

发表机构 * nvidia

AI总结 本文介绍了一个首个开源的高保真计算流体力学数据集,用于AI代理模型开发,该数据集包含1800个样本,源自180种几何变体和10个攻角的NASA通用研究模型(CRM)几何体,用于AIAA高升力预测工作坊系列。该数据集的创新之处在于使用GPU加速的高保真显式壁模式LES方法进行每个模拟,使用300M到500M的适应性网格,以确保在已知的稳态RANS方法在飞行包线部分的挑战下尽可能高的精度。整个数据集(几何体、时间平均体积和表面变量以及积分力)免费提供,带有宽松的开源许可(CC-BY-4.0)。通过公开发布此数据,我们旨在加速航空航天工业中AI代理建模的研究与开发。

详情
AI中文摘要

本文描述了首个开源的高保真计算流体力学数据集,用于AI代理模型开发。该数据集由1800个样本组成,源自180种几何变体和10个攻角的高升力NASA通用研究模型(CRM)几何体,用于AIAA高升力预测工作坊系列。该数据集的一个创新点是使用GPU加速的高保真显式壁模式LES方法进行每个模拟,使用300M到500M的适应性网格。这确保了在已知的稳态RANS方法在飞行包线部分的挑战下尽可能高的精度。整个数据集(几何体、时间平均体积和表面变量以及积分力)免费提供,带有宽松的开源许可(CC-BY-4.0)。通过公开发布此数据,我们旨在加速航空航天工业中AI代理建模的研究与开发。

英文摘要

This paper describes the first-ever open-source high-fidelity CFD dataset of a high-lift aircraft for the purpose of AI surrogate model development. The dataset is composed of 1800 samples, arising from 180 geometry variants and 10 angles of attack for the high-lift NASA Common Research Model (CRM) geometry, used within the AIAA High-Lift Prediction Workshop series. One of the novelties of this dataset is the use of a GPU-accelerated high-fidelity explicit, wall-modeled LES approach for each simulation, using solution-adapted grids between 300M and 500M cells. This ensures the greatest possible accuracy given known challenges in steady-state RANS approaches for these portions of the flight envelope. The entire dataset (geometries, time-averaged volume and surface variables and integral forces) are available, free of charge with a permissive open-source license (CC-BY-4.0). By making this data publicly available, we aim to accelerate the research and development of AI surrogate modeling within the aerospace industry.

2605.19557 2026-05-20 stat.ML cs.LG

Density-Ratio Losses for Post-Hoc Learning to Defer

基于密度比损失的后验学习延迟

Alexander Soen, Ragnar Thobaben, Joakim Jaldén, Richard Nock

发表机构 * KTH(皇家理工学院) Google Research(谷歌研究)

AI总结 本文研究了后验学习延迟(L2D)问题,通过理想分布的视角定义延迟,并提出基于密度比损失的CPE损失函数,通过阈值判断延迟决策,从而在不重新训练的情况下调整延迟率,同时揭示了Chow规则与专家倾斜贝叶斯后验之间的联系。

Comments Preprint

详情
AI中文摘要

我们通过理想分布的视角研究后验学习延迟(L2D)。理想分布被定义为在其中模型能够取得低损失的数据分布的密度比重加权。我们通过将密度比估计还原为类别概率估计,推导出用于后验L2D评分器的DR CPE损失。延迟决策通过阈值化评分器进行,允许在不重新训练的情况下调整延迟率。对于基于KL的理想分布,我们的延迟规则在原始分布下恢复Chow规则,并在理想分布是联合或边缘分布时与专家倾斜的贝叶斯后验建立联系。实验表明,我们的方法在与常见基线相比具有竞争力,并且在不同数据集设置下更加稳健。更广泛地说,我们的结果将后验L2D视为理想分布之间的密度比学习,连接了Chow式规则、专家比较以及阐明了与异常检测等其他学习设置的相关联系。

英文摘要

We study post-hoc Learning to Defer (L2D) through the lens of ideal distributions: divergence-regularized reweightings of the data distribution under which a model attains low loss. We define deferral via the density-ratio between a model's and an expert's ideals. Using the reduction from density-ratio estimation to class-probability estimation, we derive the DR CPE losses for post-hoc L2D scorers. Deferral decisions are then made by thresholding the scorer, allowing deferral rates to be adjusted without retraining. For KL-based ideal distributions, our deferral rules recovers Chow's rule under the original distribution and a connection to an expert-tilted Bayes posterior -- which incorporates the expert's performance -- depending on if the ideal distributions are joint or marginal distributions. Experimentally, our approach is competitive compared to common baselines and more robust across dataset settings. More broadly, our results cast post-hoc L2D as density-ratio learning between ideal distributions, bridging Chow-style rules, expert comparison, and elucidating connections to related learning settings including anomaly detection.

2605.19551 2026-05-20 cs.GR cs.CV

AnchorFlow: Editable SVG Reconstruction via Sparse Anchor Point Fields

AnchorFlow: 通过稀疏锚点场实现可编辑的SVG重建

Mengnan Jiang, Christian Franke, Michele Franco Adesso, Antonio Haas, Grace Li Zhang

发表机构 * Mercedes-Benz AG(梅赛德斯-奔驰公司) Technical University of Darmstadt(达姆施塔特技术大学)

AI总结 本文提出AnchorFlow框架,通过稀疏锚点场实现路径级锚点放置,解决图像到SVG重建中精度与可编辑性的平衡问题,实验表明其在保持高质量的同时显著降低可编辑复杂度。

详情
AI中文摘要

图像到SVG重建旨在生成忠实于位图输入且易于编辑的矢量图形。现有方法在如何参数化矢量结构上面临结构性权衡,包括图像由多少路径表示以及每个路径由多少锚点定义。高保真方法通常依赖大量路径或密集参数化曲线,而过于紧凑的SVG生成可能会偏离输入几何。这个问题在局部位图证据不完美时更加明显,其中边界跟随重建可能会引入冗余锚点和碎片化结构。我们主张应在锚点放置层面解决这一权衡,因为贝塞尔曲线上的锚点定义局部路径结构,并强烈影响精度和可编辑性。我们提出AnchorFlow,一个可编辑的SVG重建框架,通过稀疏锚点场建模路径级锚点放置。给定从位图图像中提取的路径状前景组件,AnchorFlow为每个组件预测一个图像条件的稀疏锚点场,并将其解析为有序的贝塞尔路径。渲染引导的反馈随后纠正局部结构错误后再进行重新解析。恢复的路径随后被组装和优化为最终的SVG。在孤立路径和完整图像上的实验表明,AnchorFlow在精度和可编辑性之间实现了有利的权衡,显著降低了可编辑复杂度,同时保持竞争性的位图保真度。

英文摘要

Image-to-SVG reconstruction aims to produce vector graphics that are faithful to raster inputs and easy to edit. Existing methods face a structural trade-off in how vector structure is parameterized, including how many paths represent an image and how many anchor points define each path. High-fidelity methods often rely on many paths or densely parameterized curves, whereas overly compact SVG generation may deviate from the input geometry. This issue becomes more pronounced when local raster evidence is imperfect, where boundary-following reconstruction can introduce redundant anchors and fragmented structures. We argue that this trade-off should be addressed at the level of anchor placement, since anchors on Bezier curves define local path structure and strongly affect both accuracy and editability. We propose AnchorFlow, an editable SVG reconstruction framework that models path-level anchor placement with sparse anchor point fields. Given path-like foreground components extracted from a raster image, AnchorFlow predicts an image-conditioned sparse anchor field for each component and resolves it into an ordered Bezier path. Rendering-guided feedback then corrects local structural errors before re-resolution. The recovered paths are then assembled and optimized into the final SVG. Experiments on isolated paths and full images show that AnchorFlow achieves a favorable fidelity-editability trade-off, substantially reducing editable complexity while preserving competitive raster fidelity.

2605.19549 2026-05-20 cs.SE cs.LG

Provable Fairness Repair for Deep Neural Networks

深度神经网络的可证公平修复

Jianan Ma, Jingyi Wang, Qi Xuan, Zhen Wang

发表机构 * Hangzhou Dianzi University, China(杭州电子科技大学) Zhejiang University, China(浙江大学) Zhejiang University of Technology, China(浙江工业大学)

AI总结 本文提出ProF框架,通过区间界限传播技术,为深度神经网络提供可证的公平性修复,实现对偏见样本周围整个集合的公平性保障,并在多个基准数据集上验证了其有效性。

Comments 15 pages, 6 figures, 7 tables. full version of the paper accepted by ASE 2025

Journal ref Proceedings of the 40th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2025

详情
AI中文摘要

深度神经网络(DNNs)正面临诸如个体歧视等伦理问题。为此,已开发出大量NN修复技术来调整模型并减轻此类不良行为。然而,现有公平性修复方法通常是数据驱动的,往往缺乏可证保证和对未见过样本的泛化能力。为克服这些限制,我们提出了ProF,一种具有可证保证的新型公平性修复框架。ProF的核心思想是利用区间界限传播(一种广泛使用的神经网络验证技术)来在偏见样本x周围的整个集合S(x)上准确捕捉模型输出。所推导的界限用于指导公平性修复,促使模型在S(x)上产生一致的输出。具体而言,我们将公平性约束和模型修改整合到统一的约束求解公式中,该公式可转换为可由现成求解器解决的混合整数线性规划(MILP)问题。MILP问题的解有效地诱导出一个具有整体S(x)公平性保障的修复模型。我们在四个广泛使用的基准数据集上评估了ProF,并证明其实现了可证公平性修复,在完整数据集上的泛化能力高达95.93%,在整个输入空间上为93.16%。值得注意的是,ProF可以轻松配置以支持多种敏感属性和更实际的公平性定义,同时提供可证修复保证,并实现约90%的公平性提升。我们的代码可在https://github.com/nninjn/ProF上获得。

英文摘要

Deep neural networks (DNNs) are suffering from ethical issues such as individual discrimination. In response, extensive NN repair techniques have been developed to adjust models and mitigate such undesired behaviors. However, existing fairness repair methods are typically data-centric, which often lack provable guarantees and generalization to unseen samples. To overcome these limitations, we propose ProF, a novel fairness repair framework with provable guarantees. The key intuition of ProF is to leverage interval bound propagation (a widely used NN verification technique) to soundly capture model outputs over the whole set $S(\mathbf{x})$ around a biased sample $\mathbf{x}$. The derived bounds are utilized to guide fairness repair which encourages the model to produce consistent outputs on $S(\mathbf{x})$. Specifically, we integrate fairness constraints and model modifications into a unified constraint-solving formulation, which can be transformed to a Mixed-Integer Linear Programming (MILP) problem solvable by off-the-shelf solvers. The solution to the MILP problem effectively induces a repaired model with guaranteed fairness over the whole set $S(\mathbf{x})$. We evaluate ProF on four widely used benchmark datasets and demonstrate that it achieves provable fairness repair, with generalization of up to 95.93\% on full datasets and 93.16\% on the entire input space. Notably, ProF can be easily configured to support multiple sensitive attributes and more practical fairness definitions, while providing provable repair guarantees and delivering around 90\% fairness improvement. Our code is available at https://github.com/nninjn/ProF.

2605.19478 2026-05-20 cs.CR cs.CV

Exposing Functional Fusion: A New Class of Strategic Backdoor in Dynamic Prompt Architectures

揭示功能融合:动态提示架构中一种新的战略后门类别

Zeyao Liu, Zhendong Zhao, Xiaojun Chen, Xin Zhao, Yuexin Xuan, Xiaoshuang Ji

发表机构 * Institute of Information Engineering, Chinese Academy of Sciences(中国科学院信息工程研究所) State Key Laboratory of Cyberspace Security Defense(网络空间安全防御国家重点实验室) School of Cyber Security, University of Chinese Academy of Sciences(中国科学院大学网络安全学院) PetroChina (Beijing) Digital Intelligent Research Institute Co., Ltd.(中石油北京数字智能研究院有限公司)

AI总结 本文提出VIPER攻击框架,揭示动态提示架构中通过功能融合产生的新风险,该框架在轻量级动态视觉提示生成器上实现,展示了恶意逻辑与良性任务功能的紧密融合,从而在剪枝时破坏良性性能,同时保持高ASR和低延迟。

详情
AI中文摘要

现有的基于背骨重写全调优的ViT后门攻击在计算上昂贵且会降低性能。这迫使攻击者转向以适配器为基础(例如LoRA)和提示为基础(例如VPT)的视觉参数高效微调(PEFT)范式。尽管适配器安全已有一些初步研究,但快速增长的提示基础生态系统中的风险仍严重未被探索。我们填补了这个关键缺口,揭示了VPT向动态和上下文感知架构演进如何促成一种更加危险和新兴的威胁。这种漏洞即使在这些动态模块解锁了优越良性性能的情况下也会出现。我们提出了VIPER,一个基于轻量级动态视觉提示生成器(VPG)的攻击框架,展示了这种漏洞。关键的是,这种动态架构使功能融合成为可能:恶意逻辑和良性任务功能紧密融合到同一个稀疏、高幅度参数核心中。这种融合创造了一个严峻的“人质”困境,因为剪枝攻击必然破坏良性性能。全面评估显示VIPER有效解决了攻击者的三重困境:VIPER不仅在干净数据上实现了最先进的性能,而且在90% VPG模块剪枝(LoRA攻击崩溃)的情况下仍保持近100%的ASR,同时仅增加可察觉的0.06ms(1.16%)推理延迟。VIPER的结果,由功能融合驱动,揭示了动态提示架构中一种新的、范式级别的风险。

英文摘要

Existing ViT backdoor attacks based on backbone-overwriting full-tuning are computationally expensive and inflict performance degradation. This has forced adversaries towards the Visual Parameter-Efficient Fine-Tuning (PEFT) paradigm, dominated by adapter-based (e.g., LoRA) and prompt-based (e.g., VPT) approaches. While adapter security has seen initial study, the risks of the burgeoning prompt-based ecosystem remain critically unexplored. We fill this critical gap, exposing how the evolution of VPT towards dynamic and context-aware architectures can facilitate a far more dangerous and emergent threat. This vulnerability arises even though these dynamic modules unlock superior benign performance. We propose VIPER, an attack framework built on a lightweight, dynamic Visual Prompt Generator (VPG) that demonstrates this vulnerability. Critically, this dynamic architecture enables Functional Fusion: an emergent phenomenon where malicious logic and benign task utility are tightly fused into the same sparse, high-magnitude parameter core. This fusion creates a formidable ``hostage" dilemma, as pruning the attack necessarily destroys the benign performance. Comprehensive evaluations show VIPER effectively addresses the attacker's trilemma: VIPER not only achieves state-of-the-art performance on clean data, but also maintains near-100% ASR even under 90% VPG-module pruning (where LoRA attacks collapse), while adding only an imperceptible 0.06ms (1.16%) of inference latency. VIPER's results, driven by Functional Fusion, expose a new, paradigm-level risk in dynamic prompt architectures.

2605.19452 2026-05-20 cs.DC cs.AI

Resilient Byzantine Agreement with Predictions

具有预测功能的容错一致性协议

Julien Dallot, Darya Melnyk, Tijana Milentijevic, Stefan Schmid, Patrik Welters

发表机构 * TU Berlin(柏林技术大学) Weizenbaum Institute(魏泽恩堡研究所) HU Berlin(柏林洪堡大学)

AI总结 本文研究了在有预测器辅助下解决拜占庭共识问题,通过算法容错性和预测器准确度的权衡分析,提出在非认证和认证设置下容忍不同数量故障节点的算法及不可能性结果。

详情
AI中文摘要

本文研究了在有预测器辅助下解决拜占庭共识问题。我们关注算法的容错性——算法能容忍的最大故障节点数,并提出了其容错性依赖于预测器准确度的算法和不可能性结果。我们的第一个主要结果是对非认证和认证设置下的一致性-鲁棒性权衡进行了完整刻画:对于n个节点和参数α∈[0,1],当预测器正确时,算法可以容忍最多α·n个故障节点(一致性);当预测器任意错误时,可以容忍最多(1-α)/2·n -1个故障节点(鲁棒性);在认证设置下,鲁棒性界限提高到(1-α)·n -1。这些权衡是精确的,因为我们证明再多一个故障节点会使问题变得不可能。我们的第二个主要结果刻画了平滑度:预测器准确性降低时,容错性下降的速率。我们证明只要错误预测的数量保持在n的常数比例内,容错性会线性减少。具体而言,在非认证设置下,每个额外的错误预测会损失一个单位的容错性,而在认证设置下,由于需要两个错误预测才能损失一个单位的容错性,因此下降幅度减半。

英文摘要

This paper studies the Byzantine Agreement problem where the nodes have access to a predictor that flags nodes for suspicion of faulty (Byzantine) behavior. We focus on algorithmic resilience -- the maximum number of faulty nodes an algorithm can tolerate -- and present algorithms and impossibility results whose resilience depend on the accuracy of the predictor. As our first main result, we bring a complete characterization of the consistency--robustness trade-offs in both the non-authenticated and authenticated settings: for $n$ nodes and a parameter $α\in [0, 1]$, we present algorithms that tolerate up to $α\cdot n$ faulty nodes when the predictor is correct (consistency), and up to $\frac{1-α}{2} \cdot n - 1$ faulty nodes when the predictor is arbitrarily wrong (robustness); in the authenticated setting the robustness bound improves to $(1-α) \cdot n - 1$. These trade-offs are exactly tight as we show that one additional faulty node renders the problem impossible. Our second main result characterizes smoothness: the rate at which resilience degrades as the predictor becomes less accurate. We show that resilience linearly decreases in the number of wrong predictions as long as that number stays within a constant fraction of $n$. Concretely, in the non-authenticated setting each additional wrong prediction loses one unit of resilience, whereas in the authenticated setting the decline is halved since two wrong predictions are needed to lose one unit of resilience.

2605.19391 2026-05-20 stat.ML cs.LG

Tweedie's Formulae and Diffusion Generative Models Beyond Gaussian

Tweedie's公式与超越高斯的扩散生成模型

Wenpin Tang, Nizar Touzi, Zikun Zhang, Xun Yu Zhou

发表机构 * Department of Industrial Engineering and Operations Research, Columbia University(哥伦比亚大学工业工程与运筹学系) Department of Finance and Risk Engineering, New York University(纽约大学金融与风险工程系)

AI总结 本文扩展了Tweedie公式以适用于重要的非高斯过程,如几何布朗运动、平方贝塞尔过程和Cox-Ingersoll-Ross过程,并利用这些公式在图像和金融时间序列生成以及经验贝叶斯估计中应用非高斯扩散模型,展示了非高斯模型的潜力。

Comments 27 pages, 18 figures

详情
AI中文摘要

扩散模型在生成未知数据分布的样本方面取得了显著成功。大多数流行的基于随机微分方程的扩散模型通过向目标分布添加高斯噪声,将其转换为简单的先验分布,然后使用去噪分数匹配,这是Tweedie公式的结果,来学习分数函数并从噪声中生成干净的样本。然而,具有状态依赖扩散系数的非高斯扩散模型以及相应的Tweedie公式一直被忽视。在本文中,我们扩展了Tweedie公式以适用于重要的非高斯过程,包括几何布朗运动(GBM)、平方贝塞尔(BESQ)过程和Cox-Ingersoll-Ross(CIR)过程,从而得到相应的去噪分数匹配目标。然后,我们应用推导出的公式,使用基于GBM和CIR的扩散模型进行图像和金融时间序列生成,并在BESQ设置下进行经验贝叶斯估计。报告的实验结果展示了非高斯模型的潜力。

英文摘要

Diffusion models have achieved remarkable success in generating samples from unknown data distributions. Most popular stochastic differential equation-based diffusion models perturb the target distribution by adding Gaussian noise, transforming it into a simple prior, and then use denoising score matching, a consequence of Tweedie's formula, to learn the score function and generate clean samples from noise. However, non-Gaussian diffusion models with state-dependent diffusion coefficient have been largely underexplored, as have the corresponding Tweedie's formulae. In this work, we extend Tweedie's formula to important non-Gaussian processes, including geometric Brownian motion (GBM), squared Bessel (BESQ) processes, and Cox-Ingersoll-Ross (CIR) processes, thereby yielding the corresponding denoising score-matching objectives. We then apply the derived formulae to image and financial time series generation using GBM- and CIR-based diffusion models, and to empirical Bayes estimation under the BESQ setting. The reported experimental results demonstrate the potential of non-Gaussian models.

2605.19373 2026-05-20 cs.DC cs.AI cs.LG

Conflict-Free Replicated Data Types for Neural Network Model Merging: A Two-Layer Architecture Enabling CRDT-Compliant Model Merging Across 26 Strategies

用于神经网络模型融合的无冲突复制数据类型:一种双层架构,使26种策略兼容CRDT模型融合

Ryan Gillespie

发表机构 * Independent researcher(独立研究者)

AI总结 本文提出了一种双层架构CRDTMergeState,通过将任何融合策略封装在CRDT兼容层中,解决了26种神经网络融合策略在分布式操作中无法满足交换律、结合律和幂等律的结构性问题,实现了强最终一致性。

详情
AI中文摘要

我们测试的所有26种神经网络融合策略,包括加权平均、SLERP、TIES、DARE、Fisher融合和进化方法,均无法满足用于无冲突分布式操作所需的代数属性(交换性、结合性和幂等性)。我们证明这种失败是结构性的:基于规范化的方法无法同时满足这三个属性。为了解决这个问题,我们提出了一种双层架构——CRDTMergeState,它将任何融合策略封装在CRDT兼容(无冲突复制数据类型)层中。第一层通过OR-Set CRDT语义管理贡献,其中融合操作是集合并集——这显然具有交换性、结合性和幂等性。第二层将融合策略作为确定性纯函数应用于一个规范有序的贡献集上,随机性从Merkle根中播种。我们证明这种分离保证了强最终一致性:所有接收相同贡献的副本计算出相同的融合模型,无论消息顺序如何。实证验证涵盖三个层次:受控的4x4张量(104/104测试通过)、生产规模的模型(最高7.24B参数,208种策略级测试,43,368种层级属性检查在受限张量分辨率下)以及多节点收敛在 gossip 和分区修复(100个节点,20种顺序)中,CRDT开销低于0.5毫秒。由于封装器是透明的,下游性能由构造保证,通过字节相同输出验证确认。参考实现可用作crdt-merge v0.9.4。

英文摘要

All 26 neural network merge strategies we tested including weight averaging, SLERP, TIES, DARE, Fisher merging, and evolutionary approaches -- fail the algebraic properties (commutativity, associativity, idempotency) required for conflict-free distributed operation. We prove that this failure is structural: normalisation-based merges cannot simultaneously satisfy all three properties. To resolve this, we present a two-layer architecture -- CRDTMergeState -- that wraps any merge strategy in a CRDT-compliant (Conflict-Free Replicated Data Type) layer. Layer 1 manages contributions via OR-Set CRDT semantics, where the merge operation is set union -- trivially commutative, associative, and idempotent. Layer 2 applies merge strategies as deterministic pure functions over a canonically-ordered contribution set, with randomness seeded from the Merkle root. We prove that this separation guarantees Strong Eventual Consistency: all replicas receiving the same contributions compute identical merged models, regardless of message ordering. Empirical validation spans three tiers: controlled 4x4 tensors (104/104 tests pass), production-scale models up to 7.24B parameters (208 strategy-level tests, 43,368 layer-level property checks at capped tensor resolution), and multi-node convergence under gossip and partition healing (100 nodes, 20 orderings), with CRDT overhead below 0.5 ms. Because the wrapper is transparent, downstream performance is identical by construction, confirmed via byte-identical output verification. The reference implementation is available as crdt-merge v0.9.4.

2605.19355 2026-05-20 cs.GR cs.AI cs.CV cs.LG

Skinned Motion Retargeting with Spatially Adaptive Interaction Guidance

具有空间自适应交互引导的皮肤运动重定向

Soojin Choi, Seokhyeon Hong, Chaelin Kim, Junghyun Nam, Junhyuk Jeon, Junyong Noh

发表机构 * Visual Media Lab(视觉媒体实验室) KAIST(韩国科学技术院)

AI总结 本文提出了一种几何感知的运动重定向框架,通过在空间自适应锚点上进行接近匹配,保留交互语义,以解决在不同身体形状角色之间重定向运动时保持交互语义(如自接触和近身体接近)的挑战。

Comments SIGGRAPH 2026 / ACM TOG. Project page available at https://suzyn.github.io/space_page/

详情
AI中文摘要

在不同身体形状的角色之间进行运动重定向,同时保持交互语义,如自接触和近身体接近,仍是一个具有挑战性的问题。尽管最近的几何感知方法通过维持预定义对应区域之间的空间关系来解决这一问题,但它们对静态对应关系的依赖在目标角色表现出夸张的身体比例时往往遇到困难。在本文中,我们提出了一种几何感知的运动重定向框架,通过在空间自适应锚点上进行接近匹配来保留交互语义。与以往具有静态锚点定义的方法不同,所提出的方法动态地将锚点重新定位到目标角色上可到达的区域。这通过基于Transformer的锚点细化策略实现,该策略预测锚点位移,并通过可微的软投影将转换后的锚点限制在目标角色的几何结构上。通过结合源角色的姿势依赖空间结构,适应的锚点为交互感知的重定向提供结构上连贯的指导。在这些锚点的条件下,基于图的自编码器预测目标骨骼运动,以保持源的空问配置。为了鼓励锚点适应和运动重定向之间的任务对齐优化,我们采用交替训练方案,其中每个模块依次优化。通过广泛的评估,我们证明了我们的方法在保持交互保真度方面优于最先进的方法,适用于多样化的角色几何结构。

英文摘要

Retargeting motion across characters with varying body shapes while preserving interaction semantics, such as self-contact and near-body proximity, remains a challenging problem. While recent geometry-aware approaches address this by maintaining spatial relationships between predefined corresponding regions, their reliance on static correspondences often struggles when the target character exhibits exaggerated body proportions. In this paper, we present a geometry-aware motion retargeting framework that preserves interaction semantics by performing proximity matching over spatially adaptive anchors. Unlike prior methods with static anchor definitions, the proposed method dynamically repositions anchors to reachable regions on the target character. This is achieved via a Transformer-based anchor refinement strategy that predicts anchor displacements and constrains the translated anchors to remain on the target character geometry through differentiable soft projection. By incorporating pose-dependent spatial structures from the source character, the adapted anchors provide structurally coherent guidance for interaction-aware retargeting. Conditioned on these anchors, a graph-based autoencoder predicts target skeletal motion that preserves the spatial configuration of the source. To encourage task-aligned optimization between anchor adaptation and motion retargeting, we adopt an alternating training scheme in which each module is optimized in turn. Through extensive evaluations, we demonstrate that our method outperforms state-of-the-art approaches in preserving interaction fidelity across diverse character geometries.

2605.19352 2026-05-20 q-bio.NC cs.AI cs.LG

Brain alignment of reasoning and action representations from vision-language and action models during naturalistic gameplay

在自然主义游戏过程中,视觉语言和动作模型的推理与动作表示的脑部对齐

Subba Reddy Oota, Anant Khandelwal, Khushbu Pahwa, Satya Sai Srinath Namburi, Tanmoy Chakraborty, Bapi S. Raju, Manish Gupta

发表机构 * Independent(独立) Microsoft Research(微软研究院) AWS AI Labs(AWS人工智能实验室) GE HealthCare(通用电气医疗) IIT Delhi(德里理工学院) IIIT-Hyderabad(海得拉巴理工学院) Microsoft(微软)

AI总结 本文研究了在自然主义游戏过程中,视觉语言模型和大动作模型的推理与动作表示在脑部活动中的对齐情况,发现动作聚焦和推理聚焦的提示影响模型内部表示与fMRI脑活动的对齐程度。

Comments 21 pages, 11 figures

详情
AI中文摘要

理解人类和人工智能系统如何通过与环境互动来预测和规划是一个在神经科学和机器学习交汇处的基本挑战。大多数脑编码研究集中在将人工模型与大脑活动对齐,特别是在语言理解和被动视觉处理期间,而交互式脑对齐研究迄今为止大多局限于强化学习(RL)代理和理论模型。为了解决这一差距,我们使用fMRI记录参与者玩自然主义的Atari风格视频游戏,研究了来自两个基础模型家族(即视觉语言模型(VLMs)和大动作模型(LAMs))的代表性模型的脑部对齐情况。具体而言,我们研究了动作聚焦和推理聚焦的提示如何影响模型的内部表示并与其fMRI脑活动对齐。首先,我们发现VLMs和LAMs在每个体素编码性能上显著优于RL基线,即使在匹配的特征维度下,优势依然存在。其次,提示驱动的增益与皮层处理层次结构成比例:最大的改进出现在前额叶和运动规划区域,而早期视觉皮层的增益大约只有后者的二分之一。第三,方差分区揭示了不同的表征组织:VLM是提示对称的(12.5%独特的动作vs.13.6%独特的推理),而LAM是提示不对称的(27%独特的动作vs.-5%独特的推理),不对称性在前额运动皮层最强。总的来说,这些结果表明,即使在全脑预测准确性在统计上相等的情况下,动作专门化的微调也会将多模态表示重新组织到与动作相关的神经计算中。

英文摘要

Understanding how humans and artificial intelligence systems predict and plan by interacting with their environment is a fundamental challenge at the intersection of neuroscience and machine learning. Most brain-encoding studies focus on aligning artificial models with brain activity during language comprehension or passive visual processing, while interactive brain-alignment studies have to date been largely limited to reinforcement-learning (RL) agents and theory-based models. To address this gap, we study brain alignment of representative models from two foundation-model families, namely vision-language models (VLMs) and large-action models (LAMs), using fMRI recordings from participants playing naturalistic Atari-style video games. Specifically, we examine how action-focused and reasoning-focused prompts shape model's internal representations and align with fMRI brain activity. First, we find that both VLMs and LAMs exhibit significantly exhibit voxel-wise encoding performance than RL baselines, with the advantage holding even under matched feature dimensionality. Second, prompt-driven gains scale with the cortical processing hierarchy: the largest improvements appear in frontal-parietal and motor-planning regions, while early visual cortex gains roughly half as much. Third, variance partitioning reveals a qualitatively different representational organization: VLM is prompt-symmetric (12.5% unique action vs. 13.6% unique reasoning), whereas LAM is prompt-asymmetric (27% unique action vs. -5% unique reasoning), with the asymmetry strongest in frontal-motor cortex. Together, these results demonstrate that action-specialized fine-tuning reorganizes multimodal representations toward action-relevant neural computations even when whole-brain prediction accuracy is statistically equivalent between VLM and LAM.

2605.19351 2026-05-20 cs.MA cs.AI cs.CL

PAVE: A Cognitive Architecture for Legitimate Violation in Generative Agent Societies

PAVE:生成代理社会中的合法违规认知架构

Ahmad Yehia, Abduallah Mohamed, Kun Qian, Tianyi Wang, Jiseop Byeon, Omar Hassanin, Christian Claudel

发表机构 * The University of Texas at Austin(德克萨斯大学奥斯汀分校) Meta Reality Labs(Meta现实实验室) University of Calgary(卡尔加里大学)

AI总结 本文提出PAVE认知架构,通过四个模块处理生成代理在需要违规的场景中的推理问题,实现了合法违规、对权威的服从、有限的范围和恢复四个特性,同时提高了决策的结构化和可解释性。

Comments Preprint. 23 pages, 4 figures. Code and environment will be released upon publication

详情
AI中文摘要

基于大语言模型的生成代理在合作环境中能够产生可信的人类行为,但在需要违规的场景中,如火灾疏散或受监督的紧急情况,如何推理仍不明确。我们提出PAVE(感知、评估、裁决、模拟),一种新的四模块认知架构,旨在解决这一差距:(i)感知提取一个结构化的上下文,包括明确的权威距离、同伴行为和严重标记的情境线索;(ii)评估在五个标量上评分上下文,包括一个明确的合法性判断,检查必要性、比例性和无替代方案;(iii)裁决在硬合法性门下决定服从或违规,每个代理的阈值从角色中提取;(iv)模拟执行裁决并限制违规到触发所证明的规则。我们将在Voville中实例化PAVE,这是一个从Smallville衍生的基于瓷砖的交通环境,并在三个场景、四个LLM后端和一个聚焦的消融中进行评估。PAVE代理同时满足四个属性:合法违规(只有当触发证明时)、权威服从(军官指令即使高合法性也优先)、有限范围(违规限制在目标规则内)和恢复(触发结束时恢复基准)。PAVE代理在所有四个属性上比vanilla更结构化和可解释,人类评估者认为它们更合理。消融合法性门会重现vanilla-like的失败。我们发布了Voville、PAVE提示和代码以及评估流程。

英文摘要

Generative agents based on large language models reproduce believable human behavior in cooperative settings, but how they should reason in situations where rule-breaking may be required, such as fire evacuation or authority-supervised emergency, remains poorly characterized. We propose PAVE (Perception, Assessment, Verdict, Emulation), a novel four-module cognitive architecture that addresses this gap end to end: (i) Perception extracts a structured context with explicit authority distance, peer behaviors, and severity-tagged situational cues; (ii) Assessment scores the context along five scalars including an explicit legitimacy judgment that checks necessity, proportionality, and absence of alternatives; (iii) Verdict decides to comply or violate under a hard legitimacy gate, with a per-agent threshold elicited from the persona; (iv) Emulation enacts the verdict and scopes the violation to the rule the trigger justifies. We instantiate PAVE in Voville, a tile-based traffic environment forked from Smallville, and evaluate across three scenarios, four LLM backbones, and a focused ablation. PAVE agents satisfy four properties simultaneously: legitimate violation (only when a trigger justifies it), authority deference (officer instructions override even high legitimacy), bounded scope (violations confined to the targeted rule), and recovery (baseline restored once the trigger ends). PAVE agents make more structured and interpretable decisions than vanilla across all four properties, and human evaluators rate them as more plausible. Ablating the legitimacy gate reproduces vanilla-like failures. We release Voville, the PAVE prompts and code, and the evaluation pipeline.

2605.19350 2026-05-20 cs.GR cs.LG

CompoSE: Compositional Synthesis and Editing of 3D Shapes via Part-Aware Control

CompoSE:通过部分感知控制进行3D形状的组合合成与编辑

Habib Slim, Shariq Farooq Bhat, Mohamed Elhoseiny, Yifan Wang, Mike Roberts

发表机构 * King Abdullah University of Science and Technology (KAUST)(卡布斯大学) Adobe Research(Adobe研究)

AI总结 本文提出CompoSE方法,通过部分感知控制实现3D形状的组合合成与编辑,核心方法是使用扩散变压器架构在局部和全局之间交替处理部分,并通过新颖的条件技术确保对用户输入的强遵循,主要贡献是无需部分级文本提示即可直接从用户粗略布局指导中学习部分语义和对称性。

详情
AI中文摘要

创建和编辑高质量3D内容仍然是计算机图形学中的核心挑战。我们通过引入CompoSE,一种新颖的方法,通过部分感知控制进行3D形状的组合合成与编辑来解决这一挑战。我们的方法以一组粗略的几何基础原始体(例如,包围盒)作为输入,这些原始体代表不同的物体部分并以特定的空间配置排列,输出部分分离的3D对象,支持对单个部分的局部细粒度(即组合式)编辑。使方法可行的关键见解是使用扩散变压器架构,该架构在局部处理每个部分和跨部分全局聚合上下文信息之间交替,并具有新颖的条件技术,确保对用户输入的强遵循。重要的是,我们的方法学会直接从用户粗略布局指导中推断部分语义和对称性,并不需要部分级文本提示。我们证明我们的方法能够实现强大的部分级编辑能力,包括上下文感知的替换、添加、删除和风格保持的缩放操作。通过广泛的实验,我们显示我们的方法在引导合成方面显著优于现有方法,这通过客观指标和基于LLM的评估来衡量。

英文摘要

Creating and editing high-quality 3D content remains a central challenge in computer graphics. We address this challenge by introducing CompoSE, a novel method for Compositional Synthesis and Editing of 3D shapes via part-aware control. Our method takes as input a set of coarse geometric primitives (e.g., bounding boxes) that represent distinct object parts arranged in a particular spatial configuration, and synthesizes as output part-separated 3D objects that support localized granular (i.e., compositional) editing of individual parts. The key insight that enables our method is our use of a diffusion transformer architecture that alternates between processing each part locally and aggregating contextual information across parts globally, and features a novel conditioning technique that ensures strong adherence to the user's input. Importantly, our method learns to infer part semantics and symmetries directly from the user's coarse layout guidance, and does not require part-level text prompts. We demonstrate that our method enables powerful part-level editing capabilities, including context-aware substitution, addition, deletion, and style-preserving resizing operations. We show through extensive experiments that our method significantly outperforms existing approaches on guided synthesis, as measured by objective metrics and LLM-based evaluations.

2605.19338 2026-05-20 cs.MA cs.AI cs.CL

STAR-PólyaMath: Multi-Agent Reasoning under Persistent Meta-Strategic Supervision

STAR-PólyaMath: 多智能体在持久元策略监督下的推理

Jiaao Wu, Xian Zhang, Hanzhang Liu, Sophia Zhang, Fan Yang, Yinpeng Dong

发表机构 * Tsinghua University(清华大学) Microsoft Research(微软研究院) New York University(纽约大学) MIT(麻省理工学院)

AI总结 本文提出STAR-PólyaMath多智能体框架,通过元级监督和结构化的推理-验证交互系统性解决数学推理中的幻觉积累、记忆碎片化和推理工具平衡问题,并在多个顶级竞赛基准上取得最佳成绩。

Comments 25 pages, 4 figures. Code: https://github.com/Julius-Woo/STAR-PolyaMath

详情
AI中文摘要

前沿AI模型和多智能体系统在数学推理方面取得了显著进步。然而,对于需要扩展、长周期推理的问题,现有系统仍然存在根本性可靠性问题:幻觉积累、记忆碎片化以及推理工具之间的不平衡。在本文中,我们引入了STAR-PólyaMath,一个通过元级监督和结构化的推理-验证交互来系统性解决这些挑战的多智能体框架。STAR-PólyaMath被构造成一个由Python orchestrator控制的协同状态机,包含嵌套的挑战-步骤-重计划循环,该orchestrator通过分离控制与推理并利用回溯和重计划来限制误差传播。我们的关键创新是一个持续的元策略师,它通过发布高层战略指导或强制指令来维护跨尝试的记忆并执行元级控制,使系统能够逃离无生产力的循环,而不是停滞或过度依赖工具。STAR-PólyaMath在所有八个顶级竞赛基准上取得了最先进的结果:AIME 2025-2026、MathArena Apex Shortlist、MathArena Apex 2025、Putnam 2025、IMO 2025、HMMT February 2026和USAMO 2026。它在AIME、Putnam和HMMT上获得满分,并在Apex 2025上表现出最大的优势,得分93.75%相比最强基线GPT-5.5的80.21%。消融研究显示,收益来自框架的协调而非模型级多样性,因为移除关键组件或替换为混合backbone会一致削弱性能。代码可在https://github.com/Julius-Woo/STAR-PolyaMath获取。

英文摘要

Frontier AI models and multi-agent systems have led to significant improvements in mathematical reasoning. However, for problems requiring extended, long-horizon reasoning, existing systems continue to suffer from fundamental reliability issues: hallucination accumulation, memory fragmentation, and imbalanced reasoning-tool trade-offs. In this paper, we introduce STAR-PólyaMath, a multi-agent framework that systematically addresses these challenges through meta-level supervision and structured Reasoner-Verifier interaction. STAR-PólyaMath is structured as an orchestrated state machine with nested challenge-step-replan loops, governed by a reasoning-free Python orchestrator that separates control from inference and bounds error propagation through trace-back and re-planning. Our key innovation is a persistent Meta-Strategist that maintains cross-attempt memory and exercises meta-level control by issuing high-level strategic guidance or mandatory directives, so the system can escape unproductive loops rather than stagnate or over-rely on tools. STAR-PólyaMath achieves state-of-the-art results on all eight top-tier competition benchmarks: AIME 2025-2026, MathArena Apex Shortlist, MathArena Apex 2025, Putnam 2025, IMO 2025, HMMT February 2026, and USAMO 2026. It obtains perfect scores on AIMEs, Putnam, and HMMT, and shows its largest margin on Apex 2025, scoring 93.75% compared with 80.21% by the strongest baseline GPT-5.5. Ablation studies show that the gains arise from the framework's orchestration rather than from model-level diversity since removing key components or substituting in mixed backbones consistently weakens performance. Code is available at https://github.com/Julius-Woo/STAR-PolyaMath.

2605.19328 2026-05-20 cs.CR cs.RO

RoboJailBench: Benchmarking Adversarial Attacks and Defenses in Embodied Robotic Agents

RoboJailBench: 对具身体验机器人代理中对抗攻击和防御的基准测试

Doguhuan Yeke, Yanming Zhou, Leo Y. Lin, Hongyu Cai, Antonio Bianchi, Z. Berkay Celik

发表机构 * Purdue University(普渡大学)

AI总结 本文提出RoboJailBench,通过建立安全分类学、引入意图对比数据集管道以及提供一个演进的存储库,为具身体验人工智能中的对抗攻击和防御提供了标准化评估框架,同时构建了一个新的分类平衡数据集并增强了五个现有数据集。

详情
AI中文摘要

最近在视觉-语言模型(VLMs)上的进展促进了新的具身体验人工智能系统类别,其中这些模型被集成到物理平台中,例如机器人和自动驾驶车辆,以在多样环境中解释视觉场景并执行自然语言命令。先前的研究已经引入了针对具身体验人工智能的劫持攻击和防御。然而,其评估却依赖于随意的数据集、有限的指标,并强调攻击成功率,而忽略了安全性和执行良性命令能力之间的权衡。现有的基准和评估框架要么针对传统的聊天式模型,要么专注于非对抗性安全评估;既没有捕捉到具身体验人工智能系统中劫持攻击所需的输入、后果和评估标准。在本文中,我们通过RoboJailBench填补这一空白,其包含三个核心组件。我们基于ISO标准、监管规则和记录的事件建立了安全分类学,这一努力产生了18类具身体验人工智能的安全违规后果。我们引入了一个意图对比数据集管道,通过配对对抗性和良性目标来增强现有数据集,以衡量安全性和实用性。最后,我们提供了一个演进的存储库,包含标准化指标和统一的评估和整合新攻击和防御的流程。通过这个基准,我们构建了一个新的分类平衡数据集并增强了五个现有数据集。我们整合了四种攻击和两种防御,以在领先的具身体验VLMs上评估其性能。这个基准为具身体验人工智能中的劫持攻击提供了第一个标准化评估框架,并支持未来研究。我们发布了我们的代码、数据集和成果,并在https://purseclab.github.io/benchmark-for-robotics-security维护了一个排行榜。

英文摘要

Recent advances in Vision-Language Models (VLMs) facilitate a new class of embodied AI systems, where these models are integrated into physical platforms, e.g. robots and autonomous vehicles, to interpret visual scenes and execute natural language commands in diverse environments. Previous research has introduced jailbreak attacks and defenses for embodied AI. Their evaluations, however, rely on ad-hoc datasets, limited metrics, and emphasize attack success while neglecting the trade-off between security and the ability to follow benign commands. Existing benchmarks and evaluation frameworks either target traditional chat-based models or focus on non-adversarial safety evaluation for embodied AI; neither captures the adversarial risks, inputs, consequences, and evaluation criteria necessary for jailbreak attacks in embodied AI systems. In this paper, we address this gap with RoboJailBench, which consists of three core components. We establish a security taxonomy derived from ISO standards, regulatory rules, and documented incidents. This effort yields 18 categories of security violation consequences for embodied AI. We introduce an intent contrast dataset pipeline that augments existing datasets with paired adversarial and benign goals to measure both security and utility. Lastly, we provide an evolving repository with standardized metrics and a unified process for assessing and integrating new attacks and defenses. With this benchmark, we construct a new taxonomy-balanced dataset and augment five existing datasets. We integrate four attacks and two defenses to evaluate their performance on leading embodied VLMs. This benchmark provides the first standardized evaluation framework for jailbreak attacks in embodied AI and supports future research. We release our code, datasets, and artifacts, and maintain a leaderboard at https://purseclab.github.io/benchmark-for-robotics-security.

2605.19321 2026-05-20 cs.CR cs.AI

Exploring and Developing a Pre-Model Safeguard with Draft Models

探索和开发预模型安全防护机制

Hongyu Cai, Arjun Arunasalam, Yiming Liang, Antonio Bianchi, Z. Berkay Celik

发表机构 * Purdue University(普渡大学) Florida International University(佛罗里达国际大学)

AI总结 本文研究了如何通过利用 jailbreak 攻击的可转移性,在目标模型推理前确保提示的安全性,提出了一种新的安全防护设计,减少了预模型防护的误报率,并提供了一种低开销的替代方案。

Journal ref ACM Conference on AI and Agentic Systems (ACM CAIS 2026)

详情
AI中文摘要

Large Language Model (LLM) 对齐仍然容易受到 jailbreak 攻击的影响,这些攻击会引发不安全的响应,推动了预模型和后模型防护的发展。预模型防护在调用目标模型前审计提示的安全性。然而,仅依赖提示往往导致高误报率(即 jailbreak 攻击未被检测到)。后模型防护通过审计用户提示和目标模型的响应来解决这个问题,但它们会带来较高的计算成本,包括增加的 token 使用和处理时间,因为它们在目标模型推理之后运行。在本文中,我们介绍了一种安全防护设计,利用 jailbreak 攻击的可转移性来在目标模型推理前强制提示的安全性。我们首先对 jailbreak 可转移性进行了系统研究,特别是从 LLM 到小型语言模型 (SLM) 的转移。通过这些实验,我们识别了影响可转移性的关键因素。基于这些见解,我们观察到较小的草稿模型的响应反映了大型目标模型的安全性影响;即给定一个为 LLM 构建的 jailbreak 提示,SLM 很可能被触发以生成不一致的响应。基于这一观察,我们的安全防护设计利用 SLM 进行推测推理生成一组草稿响应。然后,它将原始提示和这些草稿输入现有的防护措施以预测其安全性。我们证明这种设计减少了预模型防护的误报率,并提供了一种低效率的替代方案给后模型防护。注意:本文包含有害语言的例子。

英文摘要

Large Language Model (LLM) alignment remains vulnerable to jailbreak attacks that elicit unsafe responses, motivating pre-model and post-model guards. Pre-model guards audit the safety of prompts before invoking target models. However, relying solely on the prompt often leads to high false-negative rates (i.e., jailbreak attacks go undetected). Post-model guards address this issue by auditing both the user prompt and the target model's response. However, they incur a high computational cost, including increased token usage and processing time, because they operate after target model inference. In this paper, we introduce a safeguard design that leverages the transferability of jailbreak attacks to enforce prompt safety before target model inference. We first conduct a systematic study of jailbreak transferability, particularly from LLMs to small language models (SLMs). Through these experiments, we identify key factors influencing transferability. Building on these insights, we observe that responses from smaller draft models reflect the safety implications of those from large target models; \ie given a jailbreak prompt constructed for an LLM, an SLM is likely to be triggered to generate an unaligned response. Based on this observation, our safeguard design leverages speculative inference with SLMs to generate a set of draft responses. It then feeds the original prompt and these drafts into existing guards to predict their safety. We demonstrate that this design reduces the false-negative rate of pre-model guards and offers a low \Efficiency alternative to post-model guards. \textcolor{red}{\bf Notice: This paper contains examples of harmful language.}

2605.19313 2026-05-20 stat.ML cs.LG stat.ME

A Unified Framework for Structure-Aware Clustering and Heterogeneous Causal Graph Learning

一种用于结构感知聚类和异质因果图学习的统一框架

Honglin Du, Muxuan Liang, Xiang Zhong

发表机构 * Department of Industrial and Systems Engineering, University of Florida(佛罗里达大学工业与系统工程系) Department of Biostatistics, MD Anderson Cancer Center(MD安德森癌症中心生物统计学系)

AI总结 本文提出了一种基于有向无环图的依赖聚类方法,通过交替方向乘子法解决结构异质性问题,实现对子群体依赖结构的鲁棒发现。

详情
AI中文摘要

在复杂的多变量系统中,变量间的相互作用由依赖结构定义,通常编码为有向无环图(DAGs)。然而,依赖结构可能在不同个体间变化,忽略这种结构异质性会引入偏差并掩盖子群体特定的依赖关系。为此,我们提出了一种基于有向无环图的依赖聚类方法,通过交替方向乘子法(ADMM)解决结构异质性问题,构建在结构方程模型(SEM)之上,联合学习聚类分配和子群体特定的依赖结构。我们通过平滑约束编码无环性,并整合一个组内截断Lasso融合惩罚(gTLP)以根据结构相似性聚类个体。这产生了一个非凸优化问题,结合稀疏性、无环性和结构一致性约束。我们通过增广拉格朗日方法解决非凸性,并使用适应的交替方向乘子法(ADMM)求解差分凸程序。对于某些图结构,如上三角邻接矩阵,我们的算法保证能收敛到KKT点。实验表明,我们的方法能够以高真阳性率和低假发现率恢复子群体特定的因果依赖结构。这种能力使我们能够在子群体标签未知的情况下,鲁棒地发现跨个体的异质依赖关系。

英文摘要

In complex multivariate systems, interactions among variables are defined by dependency structures, often encoded as directed acyclic graphs ($\text{DAGs}$). However, dependency structures can vary across subjects, and ignoring this structural heterogeneity introduces bias and obscures subpopulation-specific dependencies. To address this, we propose Directed Acyclic Graph-based Dependency Clustering via Alternating Direction Method of Multipliers (DAG-DC-ADMM), a unified framework built upon Structural Equation Modeling (SEM) that jointly learns cluster assignments and cluster-specific dependency structures. We encode acyclicity via a smooth constraint and integrate a groupwise truncated Lasso fusion penalty (gTLP) to cluster subjects based on their structural similarity. This yields a nonconvex optimization problem that incorporates sparsity, acyclicity, and structural consensus constraints. We address the nonconvexity by using the augmented Lagrangian method and solve it with an adapted version of the Alternating Direction Method of Multipliers (ADMM) for difference-of-convex programs. For certain graph structures, such as upper triangular adjacency matrices, our algorithm is guaranteed to converge to a Karush-Kuhn-Tucker (KKT) point. Experiments demonstrate that our method recovers cluster-specific causal dependency structures with a high true positive rate and a low false discovery rate. This capability enables the robust discovery of heterogeneous dependencies across subjects where the subpopulation label is unknown.

2605.19305 2026-05-20 cs.GR cs.CV cs.LG

Matérn Noise for Triangulation-Agnostic Flow Matching on Meshes

Matérn噪声用于三角化无关的网格上流匹配

Tianshu Kuai, Arman Maesumi, Daniel Ritchie, Noam Aigerman

发表机构 * Université de Montréal & Mila(蒙特利尔大学及Mila) Brown University(布朗大学)

AI总结 本文提出了一种三角化无关的流匹配方法,通过Matérn过程生成网格信号,实现高效且高质量的网格生成。

Comments In ACM Transactions on Graphics (SIGGRAPH 2026). Project page: https://matern-fm.github.io/

详情
AI中文摘要

本文针对在三角网格上学习生成信号的任务,提出了三角化无关的流匹配方法。理论部分提出了一种三角化无关的噪声分布,用于流匹配模型的去噪过程。通过数学定义了分布的三角化无关性,证明了Matérn过程的离散化具有所需性质,并提供了一种高效的采样算法。使用该噪声模型,并结合PoissonNet作为去噪器,实现了三角化无关的流匹配。实验显示,该方法在超过一百万三角形的网格上能够生成高质量和多样化的结果,显著超越了现有最佳水平。

英文摘要

This paper tackles the task of learning to generate signals over triangle meshes in a triangulation-agnostic manner, meaning the trained model can be applied to different meshes and triangulations effectively. Practically, the paper adapts the flow matching (FM) paradigm to a mesh-based, triangulation-agnostic setting. Theoretically, it proposes a specific noise distribution which is triangulation agnostic, to be used inside the FM model's denoising process. While noise distributions are usually trivial to devise for, e.g., images, devising a triangulation-agnostic distribution proves to be a much more difficult task. We formulate a mathematical definition of triangulation agnosticism of distributions, via their spectrum. We then show that a discretization of a specific Gaussian random field called a Matérn process holds these desired properties, and provides a simple and efficient sampling algorithm. We use it as our noise model, and adapt FM to the triangulation-agnostic setting by using a state-of-the-art approach for learning signals on meshes in the gradient domain -- PoissonNet -- as the denoiser. We conduct experiments on elaborate tasks such as sampling elastic rest states, and generating poses of humanoids. Our method is shown to be capable of producing highly realistic results for meshes of over one million triangles, significantly exceeding the state-of-the-art in quality and diversity.

2605.19293 2026-05-20 cs.IT cs.LG cs.RO math.IT

Domain-Adaptive Communication-Rate Optimization for Sim-to-Real Humanoid-Robot Wireless XR Teleoperation

领域自适应的通信速率优化用于仿真到现实的人形机器人无线XR远程操作

Caolu Xu, Zhiyong Chen, Meixia Tao, Li Song, Feng Yang, Wenjun Zhang

发表机构 * Cooperative Medianet Innovation Center(协作中位网创新中心) School of Information Science and Electronic Engineering(信息科学与电子工程学院) Shanghai Jiao Tong University(上海交通大学)

AI总结 本文提出了一种领域自适应的通信速率优化方法,通过在仿真到现实的分布偏移中平衡重建误差和通信能耗,利用PAC-Bayes泛化特性分析和密度比加权的PPO方法,结合离线真实域数据校正,以提高人形机器人无线XR远程操作的通信效率和重建精度。

Comments submitted to IEEE journal

详情
AI中文摘要

无线扩展现实(XR)远程操作为收集人形机器人演示提供了具身交互能力,但大规模应用受到高频运动传输开销的限制。本文开发了一个系统框架,集成了采样、传输、插值和重建,并制定了通信速率优化,旨在通过维度采样率控制最小化通信能耗,同时保持机器人运动轨迹的重建精度。由于从物理机器人获取实时反馈受限于硬件成本,必须通过与离线真实域数据校正的仿真交互来解决问题。为了指导仿真到现实的适应,我们提供了一种PAC-Bayes泛化特性刻画,揭示了潜在密度比估计、有限样本偏差和编码器偏差的影响。基于此分析,我们提出了一种具有密度比加权和信任区域正则化的近端策略优化(PPO)方法。在公共人形远程操作数据集上的实验表明,所提出的方法在仿真到现实分布偏移中改善了重建误差和通信能耗之间的权衡。我们进一步分析了所提出算法在各种无线信道和动态运动轨迹中的有效性。

英文摘要

Wireless extended reality (XR) teleoperation provides embodied interaction capability for collecting humanoid robot demonstrations, but the large-scale adoption is restricted by the overhead of high-frequency motion transmission. This paper develops a system framework that integrates sampling, transmission, interpolation, and reconstruction and formulates a communication-rate optimization that aims to minimize the communication energy while maintaining the reconstruction accuracy of robot motion trajectories through dimension-wise sampling-rate control. Since acquiring real-time feedback from physical robots is limited by hardware costs, it is necessary to solve the problem through simulator interaction with offline real-domain data correction. To guide sim-to-real adaptation, we provide a PAC-Bayes generalization characterization that reveals the effects of latent density-ratio estimation, finite-sample deviation, and encoder bias. Building on this analysis, we propose a proximal policy optimization (PPO) method with density-ratio weighting and trust-region regularization. Experiments on public humanoid teleoperation dataset show that the proposed method improves the tradeoff between reconstruction error and communication energy consumption under sim-to-real distribution shift. We further analyze the effectiveness of the proposed algorithm across various wireless channels and dynamic motion trajectories.

2605.19291 2026-05-20 stat.ML cs.LG math.ST stat.TH

Factor Augmented High-Dimensional SGD

因子增强的高维SGD

Shubo Li, Yuefeng Han, Xiufan Yu

发表机构 * Department of Statistics(统计学系) The Pennsylvania State University(宾夕法尼亚州立大学) Department of Applied and Computational Mathematics and Statistics(应用与计算数学与统计学系) University of Notre Dame(圣母大学)

AI总结 本文提出了一种新的优化方法Factor-Augmented SGD (FSGD),通过利用高维学习任务中的潜在因子表示,解决了传统两阶段降维方法在数据存储和在线学习中的限制,并建立了首个将潜在因子估计误差纳入SGD分析的理论框架,提供了在衰减步长和小批量更新下的$\ell^s$范数矩收敛性。

详情
AI中文摘要

随机梯度下降(SGD)是现代机器学习中广泛使用的基础优化算法。在本文中,我们提出Factor-Augmented SGD(FSGD),一种新的优化方法,利用高维学习任务中的潜在因子表示。与依赖于离线表示学习和完整数据存储的传统两阶段降维方法不同,FSGD的关键创新在于它完全在流数据上操作,使其能够扩展到大规模和高维问题。此外,我们建立了首个明确将潜在因子估计误差纳入SGD分析的理论框架,并在衰减步长和小批量更新下提供了$\ell^s$范数的矩收敛性。我们的结果为在高维机器学习系统中可靠和可扩展地使用SGD提供了新的基础。

英文摘要

Stochastic gradient descent (SGD) is a fundamental optimization algorithm widely used in modern machine learning. In this paper, we propose Factor-Augmented SGD (FSGD), a new optimization method that leverages latent factor representations in high-dimensional learning tasks. Unlike standard two-stage dimension reduction approaches that rely on offline representation learning and full data storage, a key novelty of FSGD is that it operates purely on streaming data, making it scalable to large-scale and high-dimensional problems. Furthermore, we establish the first theoretical framework that explicitly incorporates latent factor estimation error into the analysis of SGD, and provide moment convergence in $\ell^s$ norm under decaying step sizes and mini-batch updates. Our results provide a new foundation for employing SGD reliably and scalably in high-dimensional machine learning systems.

2605.19261 2026-05-20 cs.SE cs.AI cs.HC cs.PL

When Web Apps Heal Themselves: A MAPE-K Based Approach to Fault Tolerance and Adaptive Recovery

当Web应用自我修复:基于MAPE-K模型的故障容忍与自适应恢复方法

Sales Aribe, Rov Japheth Oracion

发表机构 * Information Technology Department, Bukidnon State University(布基农州大学信息科技系)

AI总结 本文提出一种基于MAPE-K模型的模块化自我修复框架,结合AutoFix机制实现自适应故障恢复,通过实验验证该框架在故障检测和恢复中的有效性,提高了Web应用的容错性和适应性。

Comments 12 pages, 3 figures, 2 tables

Journal ref Aribe, Sales G. & Oracion, R. J. G. (2026). When web apps heal themselves- A MAPE-K based approach to fault tolerance and adaptive recovery. International Journal of Informatics and Communication Technology, 15(2), 729-740

详情
AI中文摘要

确保现代Web应用的可靠性和韧性仍然是一个关键挑战,由于系统复杂性和动态运行环境的增加。本研究提出了一种基于共享知识库的监控-分析-计划-执行(MAPE-K)模型的模块化自我修复框架,并整合了受AutoFix启发的自适应故障恢复机制。通过设计和开发研究(DDR)方法,该系统在二十种运行故障场景中进行了实施和评估,包括服务崩溃、内存泄漏和数据库断开。实验结果表明,所提出的框架实现了平均故障检测F1得分为90.7%,恢复成功率为93.2%。AutoFix模块将平均恢复时间(TTR)减少了56.2%,实现了平均恢复时间为3.92秒。系统吞吐量在故障条件下保持在88%至95%之间,响应时间仅增加了3.1%。此外,迭代反馈机制通过多个循环提高了恢复效率18.6%。这些发现表明,所提出的框架通过反馈驱动的适应性提供了一种实用且可扩展的方法,以通过反馈驱动的适应性增强Web应用的容错性。尽管当前实现依赖于预定义的恢复策略,但学习导向的反馈为未来更自主的自我修复系统的开发奠定了基础。

英文摘要

Ensuring the reliability and resilience of modern web applications remains a critical challenge due to increasing system complexity and dynamic runtime environments. This study proposes a modular self-healing framework based on the monitor-analyze-plan-execute over a shared knowledge base (MAPE-K) model, integrated with an AutoFix-inspired mechanism for adaptive fault recovery. Using a design and development research (DDR) approach, the system was implemented and evaluated through controlled fault injection experiments across twenty runtime failure scenarios, including service crashes, memory leaks, and database disconnections. Experimental results demonstrate that the proposed framework achieved a mean fault detection F1-score of 90.7% and a recovery success rate of 93.2%. The AutoFix module reduced the average time-to-recovery (TTR) by 56.2%, achieving an average recovery time of 3.92 seconds. System throughput was maintained between 88% and 95% during fault conditions, with only a 3.1% increase in response time. Additionally, iterative feedback mechanisms improved recovery efficiency by 18.6% over multiple cycles. These findings indicate that the proposed framework provides a practical and extensible approach to enhancing fault tolerance in web applications through feedback-driven adaptation. While the current implementation relies on predefined recovery strategies, the integration of learning-oriented feedback establishes a foundation for future development of more autonomous self-healing systems.

2605.19227 2026-05-20 cs.CR cs.AI

Token by Token, Compromised: Backdoor Vulnerabilities in Unified Autoregressive Models

逐token被入侵:统一自回归模型中的后门漏洞

Tobias Braun, Jonas Henry Grebe, Hossein Shakibania, Anna Rohrbach, Marcus Rohrbach

发表机构 * TU Darmstadt(图宾根大学)

AI总结 本文研究了统一自回归模型中的后门漏洞问题,提出了一种名为Token by Token Backdoor Attack (ToBAC)的攻击方法,展示了如何通过数据和模型污染策略在多模态生成中引发有害行为。

详情
AI中文摘要

统一自回归模型(UAMs)是变压器模型,能够在单次自回归传递中生成文本和图像标记。共享参数和多模态词汇简化了训练流程并促进了灵活的多模态生成,但可能会引入新的漏洞。特别是,我们首次证明这种统一架构使多模态后门攻击成为可能,其中触发器可以跨多个输出模态传播恶意影响。具体而言,我们提出了Token by Token Backdoor Attack(ToBAC),这是首个针对UAMs的后门攻击,探索了基于数据和基于模型的污染策略。我们展示了无害的字符或甚至常见的单词可以被转换为触发器,从而在自回归图像生成中引发有害行为。ToBAC可以联合操控视觉输出和伴随文本,增加伪造内容的感知真实性。通过模型访问,ToBAC可以在统一的液体模型中发起攻击,其中微妙的词(例如,``cool'')在55%的生成中导致模态对齐的品牌推广或意识形态影响。在没有模型访问的情况下,ToBAC可以通过数据污染诱导,对JanusPro实现平均成功率为63.1%。

英文摘要

Unified autoregressive models (UAMs) are transformer models that generate text as well as image tokens within a single autoregressive pass. Shared parameters and a multimodal vocabulary simplify the training pipeline and facilitate flexible multimodal generation, yet might introduce new vulnerabilities. In particular, we are the first to show that this unified architecture enables multimodal backdoor attacks, where a trigger can propagate malicious effects across multiple output modalities. Specifically, we present the Token by Token Backdoor Attack (ToBAC), the first backdoor attack targeting UAMs, exploring both data-based and model-based poisoning strategies. We demonstrate that innocuous characters or even common words can be transformed into triggers that elicit harmful behavior in autoregressive image generation. ToBAC can jointly manipulate visual outputs and accompanying text, increasing the perceived authenticity of fabricated content. With model access, ToBAC enables attacks on the unified Liquid model in which a subtle word (e.g., ``cool'') induces modality-aligned brand promotion or ideological influence in 55% of generations. Without model access, ToBAC can be induced through data poisoning, achieving an average success rate of 63.1% against JanusPro.

2605.19208 2026-05-20 stat.AP cs.LG stat.ML

Precision Physical Activity Prescription via Reinforcement Learning for Functional Actions

通过强化学习实现功能动作的精准体育活动处方

Gefei Lin, Rui Miao, Jennifer Sacheck, Xiaoke Zhang

发表机构 * Department of Statistics, The George Washington University(统计系,乔治·华盛顿大学) Department of Mathematical Sciences, The University of Texas at Dallas(数学科学系,德克萨斯大学达拉斯分校) Department of Behavioral and Social Sciences, Brown University(行为与社会科学系,布朗大学)

AI总结 本文提出了一种基于强化学习的算法,用于根据心血管代谢风险个性化优化每日步数分布,通过All of Us研究数据验证了该方法在提高健康生物标志物方面的有效性。

详情
AI中文摘要

体育活动(PA)在维持和改善健康方面起着重要作用。日常步数已成为一种关键的PA测量指标,可通过常见的可穿戴设备轻松获取。然而,缺乏推荐个性化最优每日步数分布的方法以最佳改善某些健康生物标志物。本文基于All of Us研究数据,该数据包括数月的步数计数以及关键健康生物标志物的重复测量,开发了一种新的离线强化学习(RL)算法,以学习与心血管代谢风险相关的个性化和最优PA分布,其中动作是一个函数,表示一段时间内每日步数分布。模拟研究显示,所提出的方法在现有连续动作RL方法中具有优势。从All of Us数据中学习到的最优策略通常建议人们增加日常步数,并在时间上遵循更一致的PA模式,同时为血糖水平、体重指数、血压、年龄和性别等亚组提供定制推荐。

英文摘要

Physical activity (PA) plays an important role in maintaining and improving health. Daily steps have been a key PA measure that is easily accessible with common wearable devices. However, methods are lacking to recommend a personalized optimal distribution of daily steps over a period of time for the best of certain health biomarkers. In this paper, we fill this void based on the data from the All of Us Research Program which includes months of step counts as well as repeated measurements of key health biomarkers. We develop a new offline reinforcement learning (RL) algorithm to learn personalized and optimal PA distributions associated with cardiometabolic risk, where the action is a function representing the daily step distribution over a period of time. Simulation studies demonstrate the advantage of the proposed approach over existing continuous-action RL methods. The learned optimal policy from the All of Us data generally suggests people take more daily steps and also follow a more consistent pattern of PA over time while offering tailored recommendations for subgroups in blood glucose level, body mass index, blood pressure, age, and sex.

2605.19190 2026-05-20 cs.CY cs.AI cs.HC

Going PLACES: Participatory Localized Red Teaming for Text-to-Image Safety in the Global South

Going PLACES: 参与式本地化红队测试用于全球南方的文本到图像安全

Charvi Rastogi, Mukul Bhutani, Minsuk Kahng, Shamsuddeen Hassan Muhammad, Evgeniia Razumovskaia, Priyanka Suresh, Ibrahim Said Ahmad, Charu Kalia, Yaaseen Mahomed, Madhurima Maji, Minjae Lee, Alicia Parrish, Jessica Quaye, Vijay Janapa Reddi, Aishwarya Verma, Lora Aroyo

发表机构 * Google DeepMind(谷歌深Mind) Yonsei University(延世大学) Imperial College(帝国理工学院) University of Wisconsin–Stevens Point(威斯康星州立大学斯普林特分校) Google Research(谷歌研究) Harvard University(哈佛大学)

AI总结 本文提出PLACES数据集,通过在非洲和亚洲的本地社区进行参与式红队测试,收集了26000多个文本到图像模型失败案例,揭示了全球南方在文化和社会规范方面的独特对抗模式和安全框架的结构性缺失。

Comments Published at ACM Conference on FAccT 2026

详情
AI中文摘要

尽管文本到图像(T2I)模型已在全球范围内部署,但其安全框架大多基于西方默认设置,这为其他地区带来了显著的漏洞。为了拥抱文化多元主义并引入历史上代表性不足的视角,我们在全球南方进行了本地化的社区中心红队测试研究。我们的双重视角优先考虑本地化和参与,通过关注这些地区的次级城市中心,并开展社区参与和培训研讨会,以 contextualize 本地规范。结果,我们提出了PLACES数据集,其中包括与加纳、尼日利亚以及印度两个地区(卡纳塔克和旁遮普)的大学合作收集的超过26,000个T2I模型失败示例。分析收集的提示揭示了与现有地理无关的众包红队测试数据相比,社会文化和语言属性的广泛多样性。我们观察到由本地文化和语言细微差别所启用的独特对抗模式,以及在地区内围绕特定主题(如印度的宗教)的明显聚类。此外,我们通过识别新的危害,揭示了现有安全框架的结构性缺失,这些危害表现出规范不一致(例如,违反宗教规范、忽视本地习俗和 ominous 的象征意义)。这项工作认为,扩展T2I安全需要超越单纯的规模,转而采用深入本地化和参与性的数据收集和情境化方法。内容警示:本文包含可能有害或冒犯性内容的示例。

英文摘要

Despite the global deployment of text-to-image (T2I) models, their safety frameworks are largely calibrated to a Western-centric default, creating significant vulnerabilities for the rest of the world. To embrace cultural pluralism and bring historically under-represented perspectives in T2I safety, we conduct localised community-centered red teaming studies in the Global South. Our two-fold approach prioritizes localization and participation, by focusing on secondary urban centers in these regions, and conducting community engagement and training workshops to contextualize local norms. As a result, we present PLACES, a dataset comprising over 26,000 examples of T2I model failures collected in partnership with universities in Ghana, Nigeria, and two regions of India (Karnataka and Punjab). Analysis of prompts collected reveals a wide-ranging diversity in socio-cultural and linguistic attributes, when compared to existing geography-agnostic crowdsourced red-teaming data. We observe unique adversarial patterns enabled by local cultural and linguistic nuances, and distinct clusters within region around specific themes, such as religion in India. Moreover, we uncover structural contextual gaps in existing safety frameworks by identifying novel harms showing normative dissonance (e.g., violating religious norms, ignoring local customs, and ominous symbolism). This work argues that expanding T2I safety requires moving beyond mere scale to incorporate deeply localised, participatory methodologies for data collection and contextualization. Content warning: This paper includes examples containing potentially harmful or offensive content.

2605.19179 2026-05-20 astro-ph.EP astro-ph.IM cs.LG

A Cloud-Based Tool for Meteorite Recovery Using Drones and Machine Learning

基于云技术的陨石回收工具:利用无人机和机器学习

Seamus L. Anderson, Hadrien A. R. Devillepoix, Lewis Lakerink, Sawitchaya Tippaya, Dale P. Giancono, Martin C. Towner, Iona Clemente, Martin Cupák, Ashley F. Rogers, John H. Fairweather, Mia Walker, Daniel Burgin, Michael A. Frazer, Sophie E. Deam, Veronika Pazderová, Eleanor K. Sansom, Benjamin A. D. Hartig, Hely C. Branco, Thomas Stevenson, Isabella Hatty, Anna Zappatini, Anthony Lagain, Tom Lovelock, Auriane Egal, Lucy Forman, David Belton, Simon Windsor, Shibli Saleheen, Asher Leslie, Gregory B. Poole, Andrew Langendam, Rachel S. Kirby, Andrew G. Tomkins

发表机构 * NASA Goddard Space Flight Center(美国国家航空航天局戈达德太空飞行中心) Space Science and Technology Centre(空间科学与技术中心) International Centre for Radio Astronomy Research(国际射电天文研究中心) Astronomy Data and Computing Services (ADACS)(天文数据与计算服务) Curtin Institute for Data Science(Curtin数据科学研究所) Centre for Rock Art Research and Management(岩画研究与管理中心) Faculty of Mathematics, Physics and Informatics, Comenius University Bratislava(布拉迪斯拉发大学数学、物理与信息学学院) Institute of Geology, University of Bern(伯尔尼大学地质研究所) Aix-Marseille University, CNRS, IRD, INRA, CEREGE, Institut Origines(阿维尼翁大学,CNRS,IRD,INRA,CEREGE,Origines研究所) Royal Holloway University of London(皇家霍洛威大学) Planétarium de Montréal, Espace pour la Vie(蒙特利尔天文馆,生命空间) Department of Physics and Astronomy, The University of Western Ontario(滑铁卢大学物理与天文学系) School of Earth and Planetary Sciences, Curtin University(Curtin大学地球与行星科学学院) Australian Nuclear Science and Technology Organisation(澳大利亚核科学与技术组织) School of Earth, Atmosphere and Environment, Monash University(莫纳什大学地球、大气与环境学院)

AI总结 本文提出一种基于云技术的工具,利用无人机和机器学习帮助恢复通过仪器观测到的陨石坠落。该工具展示了系统迭代改进的成果,并详细说明了该技术在澳大利亚南部和西海岸陨石坠落中的成功与局限性。

Comments 23 pages, 3 figures

详情
AI中文摘要

我们提出了一种基于云技术的工具,利用无人机和机器学习来帮助恢复通过仪器观测到的陨石坠落。我们展示了一 series of improvements made upon previous iterations of our system, as well as detail the successes and limitations of this technique when applied to observed meteorite falls in South and Western Australia. This tool is available to the meteoritics research community upon request at https://find.gfo.rocks.

英文摘要

We present a cloud-based tool that uses drones and machine learning to help recover instrumentally observed meteorite falls. We showcase a collection of improvements made upon previous iterations of our system, as well as detail the successes and limitations of this technique when applied to observed meteorite falls in South and Western Australia. This tool is available to the meteoritics research community upon request at https://find.gfo.rocks.

2605.19178 2026-05-20 cond-mat.dis-nn cond-mat.stat-mech cs.LG physics.data-an

Activation Functions, Statistics and Learning of Higher-Order Interactions in Restricted Boltzmann Machines

激活函数、统计学和受限玻尔兹曼机中高阶相互作用的学习

Giovanni di Sarra, Yasser Roudi

发表机构 * Kavli Institute for Systems Neuroscience, Norwegian University of Science and Technology(Kavli系统神经科学研究所,挪威科学技术大学) Department of Mathematics, King’s College London(伦敦国王学院数学系)

AI总结 本文研究了受限玻尔兹曼机中激活函数对高阶相互作用统计学和学习的影响,分析了四种常见激活函数在不同参数范围内的表示和学习能力。

Comments 38 pages, 27 figures

详情
AI中文摘要

神经网络在复杂数据中识别隐藏模式和相关性的巨大成功,归功于它们利用大量参数和非线性单单元激活函数的方式。受限玻尔兹曼机(RBMs)提供了一个简单而强大的框架,用于研究激活非线性对性能和表示的影响。在本工作中,我们利用RBMs与相互作用二元变量模型之间的双重性,研究了不同隐藏单元激活函数的RBM集合所诱导的相互作用的统计学。我们以四种常用激活函数(线性、阶跃、ReLU和指数)的诱导相互作用分布的矩来分析可表示模型的空间。对学习的定量预测与训练过程模拟的结果有很好的一致。特别是,我们的分析表明,某些数据结构,即由具有大相互作用项的相互作用变量模型生成的结构,对于任何RBM来说都难以表示和学习。然而,我们发现快速增加的非线性,如指数函数,可以促进特定参数范围内的此类数据结构的表示和学习。

英文摘要

The great success of neural networks in recognizing hidden patterns and correlations in complex data lies in the way they take advantage of the large number of parameters and nonlinear single-unit activation, jointly. Restricted Boltzmann Machines (RBMs) provide a simple yet powerful framework for studying the impact of activation nonlinearities on performance and representation. In this work, we exploit the duality between RBMs and models of interacting binary variables to study the statistics of the interactions induced by RBM ensembles with different hidden unit activation functions. We characterize the space of representable models analytically in terms of moments of the distribution of induced interactions for four commonly used activation functions: Linear, Step, ReLU, and Exponential. Quantitative predictions of the analytical calculations on learning show a very good agreement with results of the simulations of the training process. In particular, our analysis shows that there are certain data structures, namely those generated by models of interacting variables with large interaction terms beyond pairwise, that are difficult to represent, and thus to learn, for any RBM. Yet, we find that rapidly increasing nonlinearities, such as the Exponential function, can facilitate the representation and learning of such data structures for a specific range of parameters that is determined analytically.

2605.19147 2026-05-20 cs.CR cs.AI cs.LG

Be Kind, Rewrite: Benign Projections via Rewriting Defend Against LLM Data Poisoning Attacks

仁者重写:通过重写实现良性投影以防御大语言模型数据中毒攻击

John T. Halloran, Noopur S. Bhatt

发表机构 * Leidos University of Pennsylvania(宾夕法尼亚大学)

AI总结 本文提出了一种基于重写的良性投影方法(OBBR),通过利用开放书本的良性样本来提高大语言模型对数据中毒攻击的防御能力,实验表明OBBR在多种已知攻击模式中表现出更高的安全性能,并且在计算效率和模型性能方面具有优势。

Comments 15 pages, 2 Figures, 5 Tables

详情
AI中文摘要

大型语言模型(LLMs)对后门攻击(BAs)非常敏感,其中训练样本通过基于触发器的有害内容进行中毒。此外,现有防御措施在广泛测试不同BA模式时已被证明无效。为了更好地对抗BAs,我们探索了使用LLM重写作为主动防御数据中毒的方法。首先,我们理论证明,当LLM重写利用开放书本良性样本(称为开放书本良性重写,OBBR)时,重写输出为良性的概率严格大于闭合书本重写。因此,OBBR通过将训练样本投影到良性提示空间来中和有害内容。我们随后表明,与以往的防御措施不同,OBBR有效缓解了大量现有的BAs:在五种已知BAs和四种广泛使用的LLMs中,OBBR相比最先进的BA防御措施平均提高了51%的安全性能,相比闭合书本重写方法提高了25.7%。最后,我们表明OBBR在计算效率方面优于其他BA防御措施,经过微调后不会降低模型在自然语言任务上的性能,并且能够防御非触发基于的数据中毒攻击。

英文摘要

Large language models (LLMs) are highly susceptible to backdoor attacks (BAs), wherein training samples are poisoned using trigger-based harmful content. Furthermore, existing defenses have proven ineffective when extensively tested across BA patterns. To better combat BAs, we explore the use of LLM rewriting as a proactive defense against data poisoning. First, we theoretically show that when LLM rewriting utilizes open-book benign samples--termed open-book benign rewriting (OBBR)--the probability of a rewritten output being benign is strictly greater than that of closed-book rewriting. Thus, OBBR neutralizes harmful content by projecting training samples to the space of benign prompts. We then show that, in contrast to previous defenses, OBBR effectively mitigates a large number of existing BAs: across five known BAs and four widely used LLMs, OBBR increases safety performance by an average 51% compared to state-of-the-art BA defenses and 25.7% compared to closed-book rewriting methods. Finally, we show that OBBR is computationally efficient relative to other BA defenses, does not degrade model performance on natural language tasks after fine-tuning, and is capable of defending against non-trigger based data poisoning attacks.

2605.19124 2026-05-20 cond-mat.mtrl-sci cond-mat.dis-nn cs.LG physics.chem-ph

Atomistic Modeling of Chemical Disorder in Materials: Bridging Classical Methods and AI-Assisted Approaches

材料中化学无序的原子模型:连接经典方法和AI辅助方法

Jiayu Peng, Peichen Zhong

发表机构 * Department of Materials Design and Innovation, University at Buffalo(布法罗大学材料设计与创新系) Department of Materials Science and Engineering, National University of Singapore(新加坡国立大学材料科学与工程系)

AI总结 本文探讨了如何通过结合经典方法和AI技术来解决材料中化学无序的表示差距问题,重点介绍了如何利用计算方法将平均无序描述转换为具有代表性的构型集合,并平衡成本、偏差和保真度。

详情
AI中文摘要

化学无序源于多种元素占据晶格位置的混合占据,广泛存在于合金、陶瓷和成分复杂的材料中,其中短程和长程有序可以显著影响性质。一个核心障碍是实验与模拟之间的表示差距:实验通常报告无序为部分占据和集体平均行为,而原子模拟和AI工作流程通常需要完全指定的配置。解决这一差距需要能够将平均无序描述转换为代表性构型集合的计算方法,同时平衡成本、偏差和保真度。这一挑战在AI驱动的计算发现中变得更加紧迫,因为忽略无序可能导致AI工作流程错误排名稳定性、错误判断新颖性和误导实验,使用过于理想化的表示。本文综述了经典方法和AI驱动方法如何弥合这一表示差距。我们评估了从平均场理论、簇扩展、准随机近似、蒙特卡洛以及新兴的由通用原子间势能和生成模型驱动的方法的优缺点。我们还强调了AI如何通过降低微状态评估、构型探索和原子到热力学闭合的成本来加速经典计算方案。我们还强调了AI如何使无序原生能力成为可能,包括工作流程优先级、对有序敏感和化学表示、生成模型的无序结构和分布以及对动力学敏感的无序预测。共同,这一框架概述了通往无序原生AI的实用路线图,将化学无序从一个表示障碍转变为现实AI加速材料发现中的可控变量。

英文摘要

Chemical disorder, originating from the mixed occupation of crystallographic sites by multiple elements, is widespread in alloys, ceramics, and compositionally complex materials, where short- and long-range orderings can strongly influence properties. A central obstacle is the representation gap between experiments and simulations: experiments often report disorder as partial occupancies and ensemble-averaged behaviors, whereas atomistic simulations and AI workflows usually require fully specified configurations. Tackling this gap requires computational methods that convert averaged disorder descriptions into representative configurational ensembles while balancing cost, bias, and fidelity. This challenge has become more urgent in AI-driven computational discovery, where ignoring disorder may cause AI workflows to misrank stability, misjudge novelty, and misdirect experiments with too-idealized representations. This Review highlights how classical and AI-driven methods can bridge this representation gap. We assess the strengths and limitations of approaches spanning mean-field theories, cluster expansion, quasi-random approximations, Monte Carlo, and emerging schemes powered by universal interatomic potentials and generative models. We further highlight how AI can accelerate classical computational schemes by lowering the cost of microstate evaluation, configurational exploration, and atomistic-to-thermodynamic closure. We also emphasize how AI can enable disorder-native capabilities, including workflow triage, ordering-sensitive and alchemical representations, generative models of disordered structures and distributions, and kinetics-aware disorder prediction. Together, this framework outlines a practical roadmap toward disorder-native AI, which can transform chemical disorder from a representational obstacle into a controllable variable for realistic AI-accelerated materials discovery.