arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2157
2605.19549 2026-05-20 cs.SE cs.LG

Provable Fairness Repair for Deep Neural Networks

深度神经网络的可证公平修复

Jianan Ma, Jingyi Wang, Qi Xuan, Zhen Wang

AI总结 本文提出ProF框架,通过区间界限传播技术,为深度神经网络提供可证的公平性修复,实现对偏见样本周围整个集合的公平性保障,并在多个基准数据集上验证了其有效性。

Comments 15 pages, 6 figures, 7 tables. full version of the paper accepted by ASE 2025

详情
Journal ref
Proceedings of the 40th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2025
AI中文摘要

深度神经网络(DNNs)正面临诸如个体歧视等伦理问题。为此,已开发出大量NN修复技术来调整模型并减轻此类不良行为。然而,现有公平性修复方法通常是数据驱动的,往往缺乏可证保证和对未见过样本的泛化能力。为克服这些限制,我们提出了ProF,一种具有可证保证的新型公平性修复框架。ProF的核心思想是利用区间界限传播(一种广泛使用的神经网络验证技术)来在偏见样本x周围的整个集合S(x)上准确捕捉模型输出。所推导的界限用于指导公平性修复,促使模型在S(x)上产生一致的输出。具体而言,我们将公平性约束和模型修改整合到统一的约束求解公式中,该公式可转换为可由现成求解器解决的混合整数线性规划(MILP)问题。MILP问题的解有效地诱导出一个具有整体S(x)公平性保障的修复模型。我们在四个广泛使用的基准数据集上评估了ProF,并证明其实现了可证公平性修复,在完整数据集上的泛化能力高达95.93%,在整个输入空间上为93.16%。值得注意的是,ProF可以轻松配置以支持多种敏感属性和更实际的公平性定义,同时提供可证修复保证,并实现约90%的公平性提升。我们的代码可在https://github.com/nninjn/ProF上获得。

英文摘要

Deep neural networks (DNNs) are suffering from ethical issues such as individual discrimination. In response, extensive NN repair techniques have been developed to adjust models and mitigate such undesired behaviors. However, existing fairness repair methods are typically data-centric, which often lack provable guarantees and generalization to unseen samples. To overcome these limitations, we propose ProF, a novel fairness repair framework with provable guarantees. The key intuition of ProF is to leverage interval bound propagation (a widely used NN verification technique) to soundly capture model outputs over the whole set $S(\mathbf{x})$ around a biased sample $\mathbf{x}$. The derived bounds are utilized to guide fairness repair which encourages the model to produce consistent outputs on $S(\mathbf{x})$. Specifically, we integrate fairness constraints and model modifications into a unified constraint-solving formulation, which can be transformed to a Mixed-Integer Linear Programming (MILP) problem solvable by off-the-shelf solvers. The solution to the MILP problem effectively induces a repaired model with guaranteed fairness over the whole set $S(\mathbf{x})$. We evaluate ProF on four widely used benchmark datasets and demonstrate that it achieves provable fairness repair, with generalization of up to 95.93\% on full datasets and 93.16\% on the entire input space. Notably, ProF can be easily configured to support multiple sensitive attributes and more practical fairness definitions, while providing provable repair guarantees and delivering around 90\% fairness improvement. Our code is available at https://github.com/nninjn/ProF.

2605.19478 2026-05-20 cs.CR cs.CV

Exposing Functional Fusion: A New Class of Strategic Backdoor in Dynamic Prompt Architectures

揭示功能融合:动态提示架构中一种新的战略后门类别

Zeyao Liu, Zhendong Zhao, Xiaojun Chen, Xin Zhao, Yuexin Xuan, Xiaoshuang Ji

AI总结 本文提出VIPER攻击框架,揭示动态提示架构中通过功能融合产生的新风险,该框架在轻量级动态视觉提示生成器上实现,展示了恶意逻辑与良性任务功能的紧密融合,从而在剪枝时破坏良性性能,同时保持高ASR和低延迟。

详情
AI中文摘要

现有的基于背骨重写全调优的ViT后门攻击在计算上昂贵且会降低性能。这迫使攻击者转向以适配器为基础(例如LoRA)和提示为基础(例如VPT)的视觉参数高效微调(PEFT)范式。尽管适配器安全已有一些初步研究,但快速增长的提示基础生态系统中的风险仍严重未被探索。我们填补了这个关键缺口,揭示了VPT向动态和上下文感知架构演进如何促成一种更加危险和新兴的威胁。这种漏洞即使在这些动态模块解锁了优越良性性能的情况下也会出现。我们提出了VIPER,一个基于轻量级动态视觉提示生成器(VPG)的攻击框架,展示了这种漏洞。关键的是,这种动态架构使功能融合成为可能:恶意逻辑和良性任务功能紧密融合到同一个稀疏、高幅度参数核心中。这种融合创造了一个严峻的“人质”困境,因为剪枝攻击必然破坏良性性能。全面评估显示VIPER有效解决了攻击者的三重困境:VIPER不仅在干净数据上实现了最先进的性能,而且在90% VPG模块剪枝(LoRA攻击崩溃)的情况下仍保持近100%的ASR,同时仅增加可察觉的0.06ms(1.16%)推理延迟。VIPER的结果,由功能融合驱动,揭示了动态提示架构中一种新的、范式级别的风险。

英文摘要

Existing ViT backdoor attacks based on backbone-overwriting full-tuning are computationally expensive and inflict performance degradation. This has forced adversaries towards the Visual Parameter-Efficient Fine-Tuning (PEFT) paradigm, dominated by adapter-based (e.g., LoRA) and prompt-based (e.g., VPT) approaches. While adapter security has seen initial study, the risks of the burgeoning prompt-based ecosystem remain critically unexplored. We fill this critical gap, exposing how the evolution of VPT towards dynamic and context-aware architectures can facilitate a far more dangerous and emergent threat. This vulnerability arises even though these dynamic modules unlock superior benign performance. We propose VIPER, an attack framework built on a lightweight, dynamic Visual Prompt Generator (VPG) that demonstrates this vulnerability. Critically, this dynamic architecture enables Functional Fusion: an emergent phenomenon where malicious logic and benign task utility are tightly fused into the same sparse, high-magnitude parameter core. This fusion creates a formidable ``hostage" dilemma, as pruning the attack necessarily destroys the benign performance. Comprehensive evaluations show VIPER effectively addresses the attacker's trilemma: VIPER not only achieves state-of-the-art performance on clean data, but also maintains near-100% ASR even under 90% VPG-module pruning (where LoRA attacks collapse), while adding only an imperceptible 0.06ms (1.16%) of inference latency. VIPER's results, driven by Functional Fusion, expose a new, paradigm-level risk in dynamic prompt architectures.

2605.19452 2026-05-20 cs.DC cs.AI

Resilient Byzantine Agreement with Predictions

具有预测功能的容错一致性协议

Julien Dallot, Darya Melnyk, Tijana Milentijevic, Stefan Schmid, Patrik Welters

AI总结 本文研究了在有预测器辅助下解决拜占庭共识问题,通过算法容错性和预测器准确度的权衡分析,提出在非认证和认证设置下容忍不同数量故障节点的算法及不可能性结果。

详情
AI中文摘要

本文研究了在有预测器辅助下解决拜占庭共识问题。我们关注算法的容错性——算法能容忍的最大故障节点数,并提出了其容错性依赖于预测器准确度的算法和不可能性结果。我们的第一个主要结果是对非认证和认证设置下的一致性-鲁棒性权衡进行了完整刻画:对于n个节点和参数α∈[0,1],当预测器正确时,算法可以容忍最多α·n个故障节点(一致性);当预测器任意错误时,可以容忍最多(1-α)/2·n -1个故障节点(鲁棒性);在认证设置下,鲁棒性界限提高到(1-α)·n -1。这些权衡是精确的,因为我们证明再多一个故障节点会使问题变得不可能。我们的第二个主要结果刻画了平滑度:预测器准确性降低时,容错性下降的速率。我们证明只要错误预测的数量保持在n的常数比例内,容错性会线性减少。具体而言,在非认证设置下,每个额外的错误预测会损失一个单位的容错性,而在认证设置下,由于需要两个错误预测才能损失一个单位的容错性,因此下降幅度减半。

英文摘要

This paper studies the Byzantine Agreement problem where the nodes have access to a predictor that flags nodes for suspicion of faulty (Byzantine) behavior. We focus on algorithmic resilience -- the maximum number of faulty nodes an algorithm can tolerate -- and present algorithms and impossibility results whose resilience depend on the accuracy of the predictor. As our first main result, we bring a complete characterization of the consistency--robustness trade-offs in both the non-authenticated and authenticated settings: for $n$ nodes and a parameter $α\in [0, 1]$, we present algorithms that tolerate up to $α\cdot n$ faulty nodes when the predictor is correct (consistency), and up to $\frac{1-α}{2} \cdot n - 1$ faulty nodes when the predictor is arbitrarily wrong (robustness); in the authenticated setting the robustness bound improves to $(1-α) \cdot n - 1$. These trade-offs are exactly tight as we show that one additional faulty node renders the problem impossible. Our second main result characterizes smoothness: the rate at which resilience degrades as the predictor becomes less accurate. We show that resilience linearly decreases in the number of wrong predictions as long as that number stays within a constant fraction of $n$. Concretely, in the non-authenticated setting each additional wrong prediction loses one unit of resilience, whereas in the authenticated setting the decline is halved since two wrong predictions are needed to lose one unit of resilience.

2605.19391 2026-05-20 stat.ML cs.LG

Tweedie's Formulae and Diffusion Generative Models Beyond Gaussian

Tweedie's公式与超越高斯的扩散生成模型

Wenpin Tang, Nizar Touzi, Zikun Zhang, Xun Yu Zhou

AI总结 本文扩展了Tweedie公式以适用于重要的非高斯过程,如几何布朗运动、平方贝塞尔过程和Cox-Ingersoll-Ross过程,并利用这些公式在图像和金融时间序列生成以及经验贝叶斯估计中应用非高斯扩散模型,展示了非高斯模型的潜力。

Comments 27 pages, 18 figures

详情
AI中文摘要

扩散模型在生成未知数据分布的样本方面取得了显著成功。大多数流行的基于随机微分方程的扩散模型通过向目标分布添加高斯噪声,将其转换为简单的先验分布,然后使用去噪分数匹配,这是Tweedie公式的结果,来学习分数函数并从噪声中生成干净的样本。然而,具有状态依赖扩散系数的非高斯扩散模型以及相应的Tweedie公式一直被忽视。在本文中,我们扩展了Tweedie公式以适用于重要的非高斯过程,包括几何布朗运动(GBM)、平方贝塞尔(BESQ)过程和Cox-Ingersoll-Ross(CIR)过程,从而得到相应的去噪分数匹配目标。然后,我们应用推导出的公式,使用基于GBM和CIR的扩散模型进行图像和金融时间序列生成,并在BESQ设置下进行经验贝叶斯估计。报告的实验结果展示了非高斯模型的潜力。

英文摘要

Diffusion models have achieved remarkable success in generating samples from unknown data distributions. Most popular stochastic differential equation-based diffusion models perturb the target distribution by adding Gaussian noise, transforming it into a simple prior, and then use denoising score matching, a consequence of Tweedie's formula, to learn the score function and generate clean samples from noise. However, non-Gaussian diffusion models with state-dependent diffusion coefficient have been largely underexplored, as have the corresponding Tweedie's formulae. In this work, we extend Tweedie's formula to important non-Gaussian processes, including geometric Brownian motion (GBM), squared Bessel (BESQ) processes, and Cox-Ingersoll-Ross (CIR) processes, thereby yielding the corresponding denoising score-matching objectives. We then apply the derived formulae to image and financial time series generation using GBM- and CIR-based diffusion models, and to empirical Bayes estimation under the BESQ setting. The reported experimental results demonstrate the potential of non-Gaussian models.

2605.19373 2026-05-20 cs.DC cs.AI cs.LG

Conflict-Free Replicated Data Types for Neural Network Model Merging: A Two-Layer Architecture Enabling CRDT-Compliant Model Merging Across 26 Strategies

用于神经网络模型融合的无冲突复制数据类型:一种双层架构,使26种策略兼容CRDT模型融合

Ryan Gillespie

AI总结 本文提出了一种双层架构CRDTMergeState,通过将任何融合策略封装在CRDT兼容层中,解决了26种神经网络融合策略在分布式操作中无法满足交换律、结合律和幂等律的结构性问题,实现了强最终一致性。

详情
AI中文摘要

我们测试的所有26种神经网络融合策略,包括加权平均、SLERP、TIES、DARE、Fisher融合和进化方法,均无法满足用于无冲突分布式操作所需的代数属性(交换性、结合性和幂等性)。我们证明这种失败是结构性的:基于规范化的方法无法同时满足这三个属性。为了解决这个问题,我们提出了一种双层架构——CRDTMergeState,它将任何融合策略封装在CRDT兼容(无冲突复制数据类型)层中。第一层通过OR-Set CRDT语义管理贡献,其中融合操作是集合并集——这显然具有交换性、结合性和幂等性。第二层将融合策略作为确定性纯函数应用于一个规范有序的贡献集上,随机性从Merkle根中播种。我们证明这种分离保证了强最终一致性:所有接收相同贡献的副本计算出相同的融合模型,无论消息顺序如何。实证验证涵盖三个层次:受控的4x4张量(104/104测试通过)、生产规模的模型(最高7.24B参数,208种策略级测试,43,368种层级属性检查在受限张量分辨率下)以及多节点收敛在 gossip 和分区修复(100个节点,20种顺序)中,CRDT开销低于0.5毫秒。由于封装器是透明的,下游性能由构造保证,通过字节相同输出验证确认。参考实现可用作crdt-merge v0.9.4。

英文摘要

All 26 neural network merge strategies we tested including weight averaging, SLERP, TIES, DARE, Fisher merging, and evolutionary approaches -- fail the algebraic properties (commutativity, associativity, idempotency) required for conflict-free distributed operation. We prove that this failure is structural: normalisation-based merges cannot simultaneously satisfy all three properties. To resolve this, we present a two-layer architecture -- CRDTMergeState -- that wraps any merge strategy in a CRDT-compliant (Conflict-Free Replicated Data Type) layer. Layer 1 manages contributions via OR-Set CRDT semantics, where the merge operation is set union -- trivially commutative, associative, and idempotent. Layer 2 applies merge strategies as deterministic pure functions over a canonically-ordered contribution set, with randomness seeded from the Merkle root. We prove that this separation guarantees Strong Eventual Consistency: all replicas receiving the same contributions compute identical merged models, regardless of message ordering. Empirical validation spans three tiers: controlled 4x4 tensors (104/104 tests pass), production-scale models up to 7.24B parameters (208 strategy-level tests, 43,368 layer-level property checks at capped tensor resolution), and multi-node convergence under gossip and partition healing (100 nodes, 20 orderings), with CRDT overhead below 0.5 ms. Because the wrapper is transparent, downstream performance is identical by construction, confirmed via byte-identical output verification. The reference implementation is available as crdt-merge v0.9.4.

2605.19355 2026-05-20 cs.GR cs.AI cs.CV cs.LG

Skinned Motion Retargeting with Spatially Adaptive Interaction Guidance

具有空间自适应交互引导的皮肤运动重定向

Soojin Choi, Seokhyeon Hong, Chaelin Kim, Junghyun Nam, Junhyuk Jeon, Junyong Noh

AI总结 本文提出了一种几何感知的运动重定向框架,通过在空间自适应锚点上进行接近匹配,保留交互语义,以解决在不同身体形状角色之间重定向运动时保持交互语义(如自接触和近身体接近)的挑战。

Comments SIGGRAPH 2026 / ACM TOG. Project page available at https://suzyn.github.io/space_page/

详情
AI中文摘要

在不同身体形状的角色之间进行运动重定向,同时保持交互语义,如自接触和近身体接近,仍是一个具有挑战性的问题。尽管最近的几何感知方法通过维持预定义对应区域之间的空间关系来解决这一问题,但它们对静态对应关系的依赖在目标角色表现出夸张的身体比例时往往遇到困难。在本文中,我们提出了一种几何感知的运动重定向框架,通过在空间自适应锚点上进行接近匹配来保留交互语义。与以往具有静态锚点定义的方法不同,所提出的方法动态地将锚点重新定位到目标角色上可到达的区域。这通过基于Transformer的锚点细化策略实现,该策略预测锚点位移,并通过可微的软投影将转换后的锚点限制在目标角色的几何结构上。通过结合源角色的姿势依赖空间结构,适应的锚点为交互感知的重定向提供结构上连贯的指导。在这些锚点的条件下,基于图的自编码器预测目标骨骼运动,以保持源的空问配置。为了鼓励锚点适应和运动重定向之间的任务对齐优化,我们采用交替训练方案,其中每个模块依次优化。通过广泛的评估,我们证明了我们的方法在保持交互保真度方面优于最先进的方法,适用于多样化的角色几何结构。

英文摘要

Retargeting motion across characters with varying body shapes while preserving interaction semantics, such as self-contact and near-body proximity, remains a challenging problem. While recent geometry-aware approaches address this by maintaining spatial relationships between predefined corresponding regions, their reliance on static correspondences often struggles when the target character exhibits exaggerated body proportions. In this paper, we present a geometry-aware motion retargeting framework that preserves interaction semantics by performing proximity matching over spatially adaptive anchors. Unlike prior methods with static anchor definitions, the proposed method dynamically repositions anchors to reachable regions on the target character. This is achieved via a Transformer-based anchor refinement strategy that predicts anchor displacements and constrains the translated anchors to remain on the target character geometry through differentiable soft projection. By incorporating pose-dependent spatial structures from the source character, the adapted anchors provide structurally coherent guidance for interaction-aware retargeting. Conditioned on these anchors, a graph-based autoencoder predicts target skeletal motion that preserves the spatial configuration of the source. To encourage task-aligned optimization between anchor adaptation and motion retargeting, we adopt an alternating training scheme in which each module is optimized in turn. Through extensive evaluations, we demonstrate that our method outperforms state-of-the-art approaches in preserving interaction fidelity across diverse character geometries.

2605.19352 2026-05-20 q-bio.NC cs.AI cs.LG

Brain alignment of reasoning and action representations from vision-language and action models during naturalistic gameplay

在自然主义游戏过程中,视觉语言和动作模型的推理与动作表示的脑部对齐

Subba Reddy Oota, Anant Khandelwal, Khushbu Pahwa, Satya Sai Srinath Namburi, Tanmoy Chakraborty, Bapi S. Raju, Manish Gupta

AI总结 本文研究了在自然主义游戏过程中,视觉语言模型和大动作模型的推理与动作表示在脑部活动中的对齐情况,发现动作聚焦和推理聚焦的提示影响模型内部表示与fMRI脑活动的对齐程度。

Comments 21 pages, 11 figures

详情
AI中文摘要

理解人类和人工智能系统如何通过与环境互动来预测和规划是一个在神经科学和机器学习交汇处的基本挑战。大多数脑编码研究集中在将人工模型与大脑活动对齐,特别是在语言理解和被动视觉处理期间,而交互式脑对齐研究迄今为止大多局限于强化学习(RL)代理和理论模型。为了解决这一差距,我们使用fMRI记录参与者玩自然主义的Atari风格视频游戏,研究了来自两个基础模型家族(即视觉语言模型(VLMs)和大动作模型(LAMs))的代表性模型的脑部对齐情况。具体而言,我们研究了动作聚焦和推理聚焦的提示如何影响模型的内部表示并与其fMRI脑活动对齐。首先,我们发现VLMs和LAMs在每个体素编码性能上显著优于RL基线,即使在匹配的特征维度下,优势依然存在。其次,提示驱动的增益与皮层处理层次结构成比例:最大的改进出现在前额叶和运动规划区域,而早期视觉皮层的增益大约只有后者的二分之一。第三,方差分区揭示了不同的表征组织:VLM是提示对称的(12.5%独特的动作vs.13.6%独特的推理),而LAM是提示不对称的(27%独特的动作vs.-5%独特的推理),不对称性在前额运动皮层最强。总的来说,这些结果表明,即使在全脑预测准确性在统计上相等的情况下,动作专门化的微调也会将多模态表示重新组织到与动作相关的神经计算中。

英文摘要

Understanding how humans and artificial intelligence systems predict and plan by interacting with their environment is a fundamental challenge at the intersection of neuroscience and machine learning. Most brain-encoding studies focus on aligning artificial models with brain activity during language comprehension or passive visual processing, while interactive brain-alignment studies have to date been largely limited to reinforcement-learning (RL) agents and theory-based models. To address this gap, we study brain alignment of representative models from two foundation-model families, namely vision-language models (VLMs) and large-action models (LAMs), using fMRI recordings from participants playing naturalistic Atari-style video games. Specifically, we examine how action-focused and reasoning-focused prompts shape model's internal representations and align with fMRI brain activity. First, we find that both VLMs and LAMs exhibit significantly exhibit voxel-wise encoding performance than RL baselines, with the advantage holding even under matched feature dimensionality. Second, prompt-driven gains scale with the cortical processing hierarchy: the largest improvements appear in frontal-parietal and motor-planning regions, while early visual cortex gains roughly half as much. Third, variance partitioning reveals a qualitatively different representational organization: VLM is prompt-symmetric (12.5% unique action vs. 13.6% unique reasoning), whereas LAM is prompt-asymmetric (27% unique action vs. -5% unique reasoning), with the asymmetry strongest in frontal-motor cortex. Together, these results demonstrate that action-specialized fine-tuning reorganizes multimodal representations toward action-relevant neural computations even when whole-brain prediction accuracy is statistically equivalent between VLM and LAM.

2605.19351 2026-05-20 cs.MA cs.AI cs.CL

PAVE: A Cognitive Architecture for Legitimate Violation in Generative Agent Societies

PAVE:生成代理社会中的合法违规认知架构

Ahmad Yehia, Abduallah Mohamed, Kun Qian, Tianyi Wang, Jiseop Byeon, Omar Hassanin, Christian Claudel

AI总结 本文提出PAVE认知架构,通过四个模块处理生成代理在需要违规的场景中的推理问题,实现了合法违规、对权威的服从、有限的范围和恢复四个特性,同时提高了决策的结构化和可解释性。

Comments Preprint. 23 pages, 4 figures. Code and environment will be released upon publication

详情
AI中文摘要

基于大语言模型的生成代理在合作环境中能够产生可信的人类行为,但在需要违规的场景中,如火灾疏散或受监督的紧急情况,如何推理仍不明确。我们提出PAVE(感知、评估、裁决、模拟),一种新的四模块认知架构,旨在解决这一差距:(i)感知提取一个结构化的上下文,包括明确的权威距离、同伴行为和严重标记的情境线索;(ii)评估在五个标量上评分上下文,包括一个明确的合法性判断,检查必要性、比例性和无替代方案;(iii)裁决在硬合法性门下决定服从或违规,每个代理的阈值从角色中提取;(iv)模拟执行裁决并限制违规到触发所证明的规则。我们将在Voville中实例化PAVE,这是一个从Smallville衍生的基于瓷砖的交通环境,并在三个场景、四个LLM后端和一个聚焦的消融中进行评估。PAVE代理同时满足四个属性:合法违规(只有当触发证明时)、权威服从(军官指令即使高合法性也优先)、有限范围(违规限制在目标规则内)和恢复(触发结束时恢复基准)。PAVE代理在所有四个属性上比vanilla更结构化和可解释,人类评估者认为它们更合理。消融合法性门会重现vanilla-like的失败。我们发布了Voville、PAVE提示和代码以及评估流程。

英文摘要

Generative agents based on large language models reproduce believable human behavior in cooperative settings, but how they should reason in situations where rule-breaking may be required, such as fire evacuation or authority-supervised emergency, remains poorly characterized. We propose PAVE (Perception, Assessment, Verdict, Emulation), a novel four-module cognitive architecture that addresses this gap end to end: (i) Perception extracts a structured context with explicit authority distance, peer behaviors, and severity-tagged situational cues; (ii) Assessment scores the context along five scalars including an explicit legitimacy judgment that checks necessity, proportionality, and absence of alternatives; (iii) Verdict decides to comply or violate under a hard legitimacy gate, with a per-agent threshold elicited from the persona; (iv) Emulation enacts the verdict and scopes the violation to the rule the trigger justifies. We instantiate PAVE in Voville, a tile-based traffic environment forked from Smallville, and evaluate across three scenarios, four LLM backbones, and a focused ablation. PAVE agents satisfy four properties simultaneously: legitimate violation (only when a trigger justifies it), authority deference (officer instructions override even high legitimacy), bounded scope (violations confined to the targeted rule), and recovery (baseline restored once the trigger ends). PAVE agents make more structured and interpretable decisions than vanilla across all four properties, and human evaluators rate them as more plausible. Ablating the legitimacy gate reproduces vanilla-like failures. We release Voville, the PAVE prompts and code, and the evaluation pipeline.

2605.19350 2026-05-20 cs.GR cs.LG

CompoSE: Compositional Synthesis and Editing of 3D Shapes via Part-Aware Control

CompoSE:通过部分感知控制进行3D形状的组合合成与编辑

Habib Slim, Shariq Farooq Bhat, Mohamed Elhoseiny, Yifan Wang, Mike Roberts

AI总结 本文提出CompoSE方法,通过部分感知控制实现3D形状的组合合成与编辑,核心方法是使用扩散变压器架构在局部和全局之间交替处理部分,并通过新颖的条件技术确保对用户输入的强遵循,主要贡献是无需部分级文本提示即可直接从用户粗略布局指导中学习部分语义和对称性。

详情
AI中文摘要

创建和编辑高质量3D内容仍然是计算机图形学中的核心挑战。我们通过引入CompoSE,一种新颖的方法,通过部分感知控制进行3D形状的组合合成与编辑来解决这一挑战。我们的方法以一组粗略的几何基础原始体(例如,包围盒)作为输入,这些原始体代表不同的物体部分并以特定的空间配置排列,输出部分分离的3D对象,支持对单个部分的局部细粒度(即组合式)编辑。使方法可行的关键见解是使用扩散变压器架构,该架构在局部处理每个部分和跨部分全局聚合上下文信息之间交替,并具有新颖的条件技术,确保对用户输入的强遵循。重要的是,我们的方法学会直接从用户粗略布局指导中推断部分语义和对称性,并不需要部分级文本提示。我们证明我们的方法能够实现强大的部分级编辑能力,包括上下文感知的替换、添加、删除和风格保持的缩放操作。通过广泛的实验,我们显示我们的方法在引导合成方面显著优于现有方法,这通过客观指标和基于LLM的评估来衡量。

英文摘要

Creating and editing high-quality 3D content remains a central challenge in computer graphics. We address this challenge by introducing CompoSE, a novel method for Compositional Synthesis and Editing of 3D shapes via part-aware control. Our method takes as input a set of coarse geometric primitives (e.g., bounding boxes) that represent distinct object parts arranged in a particular spatial configuration, and synthesizes as output part-separated 3D objects that support localized granular (i.e., compositional) editing of individual parts. The key insight that enables our method is our use of a diffusion transformer architecture that alternates between processing each part locally and aggregating contextual information across parts globally, and features a novel conditioning technique that ensures strong adherence to the user's input. Importantly, our method learns to infer part semantics and symmetries directly from the user's coarse layout guidance, and does not require part-level text prompts. We demonstrate that our method enables powerful part-level editing capabilities, including context-aware substitution, addition, deletion, and style-preserving resizing operations. We show through extensive experiments that our method significantly outperforms existing approaches on guided synthesis, as measured by objective metrics and LLM-based evaluations.

2605.19338 2026-05-20 cs.MA cs.AI cs.CL

STAR-PólyaMath: Multi-Agent Reasoning under Persistent Meta-Strategic Supervision

STAR-PólyaMath: 多智能体在持久元策略监督下的推理

Jiaao Wu, Xian Zhang, Hanzhang Liu, Sophia Zhang, Fan Yang, Yinpeng Dong

AI总结 本文提出STAR-PólyaMath多智能体框架,通过元级监督和结构化的推理-验证交互系统性解决数学推理中的幻觉积累、记忆碎片化和推理工具平衡问题,并在多个顶级竞赛基准上取得最佳成绩。

Comments 25 pages, 4 figures. Code: https://github.com/Julius-Woo/STAR-PolyaMath

详情
AI中文摘要

前沿AI模型和多智能体系统在数学推理方面取得了显著进步。然而,对于需要扩展、长周期推理的问题,现有系统仍然存在根本性可靠性问题:幻觉积累、记忆碎片化以及推理工具之间的不平衡。在本文中,我们引入了STAR-PólyaMath,一个通过元级监督和结构化的推理-验证交互来系统性解决这些挑战的多智能体框架。STAR-PólyaMath被构造成一个由Python orchestrator控制的协同状态机,包含嵌套的挑战-步骤-重计划循环,该orchestrator通过分离控制与推理并利用回溯和重计划来限制误差传播。我们的关键创新是一个持续的元策略师,它通过发布高层战略指导或强制指令来维护跨尝试的记忆并执行元级控制,使系统能够逃离无生产力的循环,而不是停滞或过度依赖工具。STAR-PólyaMath在所有八个顶级竞赛基准上取得了最先进的结果:AIME 2025-2026、MathArena Apex Shortlist、MathArena Apex 2025、Putnam 2025、IMO 2025、HMMT February 2026和USAMO 2026。它在AIME、Putnam和HMMT上获得满分,并在Apex 2025上表现出最大的优势,得分93.75%相比最强基线GPT-5.5的80.21%。消融研究显示,收益来自框架的协调而非模型级多样性,因为移除关键组件或替换为混合backbone会一致削弱性能。代码可在https://github.com/Julius-Woo/STAR-PolyaMath获取。

英文摘要

Frontier AI models and multi-agent systems have led to significant improvements in mathematical reasoning. However, for problems requiring extended, long-horizon reasoning, existing systems continue to suffer from fundamental reliability issues: hallucination accumulation, memory fragmentation, and imbalanced reasoning-tool trade-offs. In this paper, we introduce STAR-PólyaMath, a multi-agent framework that systematically addresses these challenges through meta-level supervision and structured Reasoner-Verifier interaction. STAR-PólyaMath is structured as an orchestrated state machine with nested challenge-step-replan loops, governed by a reasoning-free Python orchestrator that separates control from inference and bounds error propagation through trace-back and re-planning. Our key innovation is a persistent Meta-Strategist that maintains cross-attempt memory and exercises meta-level control by issuing high-level strategic guidance or mandatory directives, so the system can escape unproductive loops rather than stagnate or over-rely on tools. STAR-PólyaMath achieves state-of-the-art results on all eight top-tier competition benchmarks: AIME 2025-2026, MathArena Apex Shortlist, MathArena Apex 2025, Putnam 2025, IMO 2025, HMMT February 2026, and USAMO 2026. It obtains perfect scores on AIMEs, Putnam, and HMMT, and shows its largest margin on Apex 2025, scoring 93.75% compared with 80.21% by the strongest baseline GPT-5.5. Ablation studies show that the gains arise from the framework's orchestration rather than from model-level diversity since removing key components or substituting in mixed backbones consistently weakens performance. Code is available at https://github.com/Julius-Woo/STAR-PolyaMath.

2605.19328 2026-05-20 cs.CR cs.RO

RoboJailBench: Benchmarking Adversarial Attacks and Defenses in Embodied Robotic Agents

RoboJailBench: 对具身体验机器人代理中对抗攻击和防御的基准测试

Doguhuan Yeke, Yanming Zhou, Leo Y. Lin, Hongyu Cai, Antonio Bianchi, Z. Berkay Celik

AI总结 本文提出RoboJailBench,通过建立安全分类学、引入意图对比数据集管道以及提供一个演进的存储库,为具身体验人工智能中的对抗攻击和防御提供了标准化评估框架,同时构建了一个新的分类平衡数据集并增强了五个现有数据集。

详情
AI中文摘要

最近在视觉-语言模型(VLMs)上的进展促进了新的具身体验人工智能系统类别,其中这些模型被集成到物理平台中,例如机器人和自动驾驶车辆,以在多样环境中解释视觉场景并执行自然语言命令。先前的研究已经引入了针对具身体验人工智能的劫持攻击和防御。然而,其评估却依赖于随意的数据集、有限的指标,并强调攻击成功率,而忽略了安全性和执行良性命令能力之间的权衡。现有的基准和评估框架要么针对传统的聊天式模型,要么专注于非对抗性安全评估;既没有捕捉到具身体验人工智能系统中劫持攻击所需的输入、后果和评估标准。在本文中,我们通过RoboJailBench填补这一空白,其包含三个核心组件。我们基于ISO标准、监管规则和记录的事件建立了安全分类学,这一努力产生了18类具身体验人工智能的安全违规后果。我们引入了一个意图对比数据集管道,通过配对对抗性和良性目标来增强现有数据集,以衡量安全性和实用性。最后,我们提供了一个演进的存储库,包含标准化指标和统一的评估和整合新攻击和防御的流程。通过这个基准,我们构建了一个新的分类平衡数据集并增强了五个现有数据集。我们整合了四种攻击和两种防御,以在领先的具身体验VLMs上评估其性能。这个基准为具身体验人工智能中的劫持攻击提供了第一个标准化评估框架,并支持未来研究。我们发布了我们的代码、数据集和成果,并在https://purseclab.github.io/benchmark-for-robotics-security维护了一个排行榜。

英文摘要

Recent advances in Vision-Language Models (VLMs) facilitate a new class of embodied AI systems, where these models are integrated into physical platforms, e.g. robots and autonomous vehicles, to interpret visual scenes and execute natural language commands in diverse environments. Previous research has introduced jailbreak attacks and defenses for embodied AI. Their evaluations, however, rely on ad-hoc datasets, limited metrics, and emphasize attack success while neglecting the trade-off between security and the ability to follow benign commands. Existing benchmarks and evaluation frameworks either target traditional chat-based models or focus on non-adversarial safety evaluation for embodied AI; neither captures the adversarial risks, inputs, consequences, and evaluation criteria necessary for jailbreak attacks in embodied AI systems. In this paper, we address this gap with RoboJailBench, which consists of three core components. We establish a security taxonomy derived from ISO standards, regulatory rules, and documented incidents. This effort yields 18 categories of security violation consequences for embodied AI. We introduce an intent contrast dataset pipeline that augments existing datasets with paired adversarial and benign goals to measure both security and utility. Lastly, we provide an evolving repository with standardized metrics and a unified process for assessing and integrating new attacks and defenses. With this benchmark, we construct a new taxonomy-balanced dataset and augment five existing datasets. We integrate four attacks and two defenses to evaluate their performance on leading embodied VLMs. This benchmark provides the first standardized evaluation framework for jailbreak attacks in embodied AI and supports future research. We release our code, datasets, and artifacts, and maintain a leaderboard at https://purseclab.github.io/benchmark-for-robotics-security.

2605.19321 2026-05-20 cs.CR cs.AI

Exploring and Developing a Pre-Model Safeguard with Draft Models

探索和开发预模型安全防护机制

Hongyu Cai, Arjun Arunasalam, Yiming Liang, Antonio Bianchi, Z. Berkay Celik

AI总结 本文研究了如何通过利用 jailbreak 攻击的可转移性,在目标模型推理前确保提示的安全性,提出了一种新的安全防护设计,减少了预模型防护的误报率,并提供了一种低开销的替代方案。

详情
Journal ref
ACM Conference on AI and Agentic Systems (ACM CAIS 2026)
AI中文摘要

Large Language Model (LLM) 对齐仍然容易受到 jailbreak 攻击的影响,这些攻击会引发不安全的响应,推动了预模型和后模型防护的发展。预模型防护在调用目标模型前审计提示的安全性。然而,仅依赖提示往往导致高误报率(即 jailbreak 攻击未被检测到)。后模型防护通过审计用户提示和目标模型的响应来解决这个问题,但它们会带来较高的计算成本,包括增加的 token 使用和处理时间,因为它们在目标模型推理之后运行。在本文中,我们介绍了一种安全防护设计,利用 jailbreak 攻击的可转移性来在目标模型推理前强制提示的安全性。我们首先对 jailbreak 可转移性进行了系统研究,特别是从 LLM 到小型语言模型 (SLM) 的转移。通过这些实验,我们识别了影响可转移性的关键因素。基于这些见解,我们观察到较小的草稿模型的响应反映了大型目标模型的安全性影响;即给定一个为 LLM 构建的 jailbreak 提示,SLM 很可能被触发以生成不一致的响应。基于这一观察,我们的安全防护设计利用 SLM 进行推测推理生成一组草稿响应。然后,它将原始提示和这些草稿输入现有的防护措施以预测其安全性。我们证明这种设计减少了预模型防护的误报率,并提供了一种低效率的替代方案给后模型防护。注意:本文包含有害语言的例子。

英文摘要

Large Language Model (LLM) alignment remains vulnerable to jailbreak attacks that elicit unsafe responses, motivating pre-model and post-model guards. Pre-model guards audit the safety of prompts before invoking target models. However, relying solely on the prompt often leads to high false-negative rates (i.e., jailbreak attacks go undetected). Post-model guards address this issue by auditing both the user prompt and the target model's response. However, they incur a high computational cost, including increased token usage and processing time, because they operate after target model inference. In this paper, we introduce a safeguard design that leverages the transferability of jailbreak attacks to enforce prompt safety before target model inference. We first conduct a systematic study of jailbreak transferability, particularly from LLMs to small language models (SLMs). Through these experiments, we identify key factors influencing transferability. Building on these insights, we observe that responses from smaller draft models reflect the safety implications of those from large target models; \ie given a jailbreak prompt constructed for an LLM, an SLM is likely to be triggered to generate an unaligned response. Based on this observation, our safeguard design leverages speculative inference with SLMs to generate a set of draft responses. It then feeds the original prompt and these drafts into existing guards to predict their safety. We demonstrate that this design reduces the false-negative rate of pre-model guards and offers a low \Efficiency alternative to post-model guards. \textcolor{red}{\bf Notice: This paper contains examples of harmful language.}

2605.19313 2026-05-20 stat.ML cs.LG stat.ME

A Unified Framework for Structure-Aware Clustering and Heterogeneous Causal Graph Learning

一种用于结构感知聚类和异质因果图学习的统一框架

Honglin Du, Muxuan Liang, Xiang Zhong

AI总结 本文提出了一种基于有向无环图的依赖聚类方法,通过交替方向乘子法解决结构异质性问题,实现对子群体依赖结构的鲁棒发现。

详情
AI中文摘要

在复杂的多变量系统中,变量间的相互作用由依赖结构定义,通常编码为有向无环图(DAGs)。然而,依赖结构可能在不同个体间变化,忽略这种结构异质性会引入偏差并掩盖子群体特定的依赖关系。为此,我们提出了一种基于有向无环图的依赖聚类方法,通过交替方向乘子法(ADMM)解决结构异质性问题,构建在结构方程模型(SEM)之上,联合学习聚类分配和子群体特定的依赖结构。我们通过平滑约束编码无环性,并整合一个组内截断Lasso融合惩罚(gTLP)以根据结构相似性聚类个体。这产生了一个非凸优化问题,结合稀疏性、无环性和结构一致性约束。我们通过增广拉格朗日方法解决非凸性,并使用适应的交替方向乘子法(ADMM)求解差分凸程序。对于某些图结构,如上三角邻接矩阵,我们的算法保证能收敛到KKT点。实验表明,我们的方法能够以高真阳性率和低假发现率恢复子群体特定的因果依赖结构。这种能力使我们能够在子群体标签未知的情况下,鲁棒地发现跨个体的异质依赖关系。

英文摘要

In complex multivariate systems, interactions among variables are defined by dependency structures, often encoded as directed acyclic graphs ($\text{DAGs}$). However, dependency structures can vary across subjects, and ignoring this structural heterogeneity introduces bias and obscures subpopulation-specific dependencies. To address this, we propose Directed Acyclic Graph-based Dependency Clustering via Alternating Direction Method of Multipliers (DAG-DC-ADMM), a unified framework built upon Structural Equation Modeling (SEM) that jointly learns cluster assignments and cluster-specific dependency structures. We encode acyclicity via a smooth constraint and integrate a groupwise truncated Lasso fusion penalty (gTLP) to cluster subjects based on their structural similarity. This yields a nonconvex optimization problem that incorporates sparsity, acyclicity, and structural consensus constraints. We address the nonconvexity by using the augmented Lagrangian method and solve it with an adapted version of the Alternating Direction Method of Multipliers (ADMM) for difference-of-convex programs. For certain graph structures, such as upper triangular adjacency matrices, our algorithm is guaranteed to converge to a Karush-Kuhn-Tucker (KKT) point. Experiments demonstrate that our method recovers cluster-specific causal dependency structures with a high true positive rate and a low false discovery rate. This capability enables the robust discovery of heterogeneous dependencies across subjects where the subpopulation label is unknown.

2605.19305 2026-05-20 cs.GR cs.CV cs.LG

Matérn Noise for Triangulation-Agnostic Flow Matching on Meshes

Matérn噪声用于三角化无关的网格上流匹配

Tianshu Kuai, Arman Maesumi, Daniel Ritchie, Noam Aigerman

AI总结 本文提出了一种三角化无关的流匹配方法,通过Matérn过程生成网格信号,实现高效且高质量的网格生成。

Comments In ACM Transactions on Graphics (SIGGRAPH 2026). Project page: https://matern-fm.github.io/

详情
AI中文摘要

本文针对在三角网格上学习生成信号的任务,提出了三角化无关的流匹配方法。理论部分提出了一种三角化无关的噪声分布,用于流匹配模型的去噪过程。通过数学定义了分布的三角化无关性,证明了Matérn过程的离散化具有所需性质,并提供了一种高效的采样算法。使用该噪声模型,并结合PoissonNet作为去噪器,实现了三角化无关的流匹配。实验显示,该方法在超过一百万三角形的网格上能够生成高质量和多样化的结果,显著超越了现有最佳水平。

英文摘要

This paper tackles the task of learning to generate signals over triangle meshes in a triangulation-agnostic manner, meaning the trained model can be applied to different meshes and triangulations effectively. Practically, the paper adapts the flow matching (FM) paradigm to a mesh-based, triangulation-agnostic setting. Theoretically, it proposes a specific noise distribution which is triangulation agnostic, to be used inside the FM model's denoising process. While noise distributions are usually trivial to devise for, e.g., images, devising a triangulation-agnostic distribution proves to be a much more difficult task. We formulate a mathematical definition of triangulation agnosticism of distributions, via their spectrum. We then show that a discretization of a specific Gaussian random field called a Matérn process holds these desired properties, and provides a simple and efficient sampling algorithm. We use it as our noise model, and adapt FM to the triangulation-agnostic setting by using a state-of-the-art approach for learning signals on meshes in the gradient domain -- PoissonNet -- as the denoiser. We conduct experiments on elaborate tasks such as sampling elastic rest states, and generating poses of humanoids. Our method is shown to be capable of producing highly realistic results for meshes of over one million triangles, significantly exceeding the state-of-the-art in quality and diversity.

2605.19293 2026-05-20 cs.IT cs.LG cs.RO math.IT

Domain-Adaptive Communication-Rate Optimization for Sim-to-Real Humanoid-Robot Wireless XR Teleoperation

领域自适应的通信速率优化用于仿真到现实的人形机器人无线XR远程操作

Caolu Xu, Zhiyong Chen, Meixia Tao, Li Song, Feng Yang, Wenjun Zhang

AI总结 本文提出了一种领域自适应的通信速率优化方法,通过在仿真到现实的分布偏移中平衡重建误差和通信能耗,利用PAC-Bayes泛化特性分析和密度比加权的PPO方法,结合离线真实域数据校正,以提高人形机器人无线XR远程操作的通信效率和重建精度。

Comments submitted to IEEE journal

详情
AI中文摘要

无线扩展现实(XR)远程操作为收集人形机器人演示提供了具身交互能力,但大规模应用受到高频运动传输开销的限制。本文开发了一个系统框架,集成了采样、传输、插值和重建,并制定了通信速率优化,旨在通过维度采样率控制最小化通信能耗,同时保持机器人运动轨迹的重建精度。由于从物理机器人获取实时反馈受限于硬件成本,必须通过与离线真实域数据校正的仿真交互来解决问题。为了指导仿真到现实的适应,我们提供了一种PAC-Bayes泛化特性刻画,揭示了潜在密度比估计、有限样本偏差和编码器偏差的影响。基于此分析,我们提出了一种具有密度比加权和信任区域正则化的近端策略优化(PPO)方法。在公共人形远程操作数据集上的实验表明,所提出的方法在仿真到现实分布偏移中改善了重建误差和通信能耗之间的权衡。我们进一步分析了所提出算法在各种无线信道和动态运动轨迹中的有效性。

英文摘要

Wireless extended reality (XR) teleoperation provides embodied interaction capability for collecting humanoid robot demonstrations, but the large-scale adoption is restricted by the overhead of high-frequency motion transmission. This paper develops a system framework that integrates sampling, transmission, interpolation, and reconstruction and formulates a communication-rate optimization that aims to minimize the communication energy while maintaining the reconstruction accuracy of robot motion trajectories through dimension-wise sampling-rate control. Since acquiring real-time feedback from physical robots is limited by hardware costs, it is necessary to solve the problem through simulator interaction with offline real-domain data correction. To guide sim-to-real adaptation, we provide a PAC-Bayes generalization characterization that reveals the effects of latent density-ratio estimation, finite-sample deviation, and encoder bias. Building on this analysis, we propose a proximal policy optimization (PPO) method with density-ratio weighting and trust-region regularization. Experiments on public humanoid teleoperation dataset show that the proposed method improves the tradeoff between reconstruction error and communication energy consumption under sim-to-real distribution shift. We further analyze the effectiveness of the proposed algorithm across various wireless channels and dynamic motion trajectories.

2605.19291 2026-05-20 stat.ML cs.LG math.ST stat.TH

Factor Augmented High-Dimensional SGD

因子增强的高维SGD

Shubo Li, Yuefeng Han, Xiufan Yu

AI总结 本文提出了一种新的优化方法Factor-Augmented SGD (FSGD),通过利用高维学习任务中的潜在因子表示,解决了传统两阶段降维方法在数据存储和在线学习中的限制,并建立了首个将潜在因子估计误差纳入SGD分析的理论框架,提供了在衰减步长和小批量更新下的$\ell^s$范数矩收敛性。

详情
AI中文摘要

随机梯度下降(SGD)是现代机器学习中广泛使用的基础优化算法。在本文中,我们提出Factor-Augmented SGD(FSGD),一种新的优化方法,利用高维学习任务中的潜在因子表示。与依赖于离线表示学习和完整数据存储的传统两阶段降维方法不同,FSGD的关键创新在于它完全在流数据上操作,使其能够扩展到大规模和高维问题。此外,我们建立了首个明确将潜在因子估计误差纳入SGD分析的理论框架,并在衰减步长和小批量更新下提供了$\ell^s$范数的矩收敛性。我们的结果为在高维机器学习系统中可靠和可扩展地使用SGD提供了新的基础。

英文摘要

Stochastic gradient descent (SGD) is a fundamental optimization algorithm widely used in modern machine learning. In this paper, we propose Factor-Augmented SGD (FSGD), a new optimization method that leverages latent factor representations in high-dimensional learning tasks. Unlike standard two-stage dimension reduction approaches that rely on offline representation learning and full data storage, a key novelty of FSGD is that it operates purely on streaming data, making it scalable to large-scale and high-dimensional problems. Furthermore, we establish the first theoretical framework that explicitly incorporates latent factor estimation error into the analysis of SGD, and provide moment convergence in $\ell^s$ norm under decaying step sizes and mini-batch updates. Our results provide a new foundation for employing SGD reliably and scalably in high-dimensional machine learning systems.

2605.19261 2026-05-20 cs.SE cs.AI cs.HC cs.PL

When Web Apps Heal Themselves: A MAPE-K Based Approach to Fault Tolerance and Adaptive Recovery

当Web应用自我修复:基于MAPE-K模型的故障容忍与自适应恢复方法

Sales Aribe, Rov Japheth Oracion

AI总结 本文提出一种基于MAPE-K模型的模块化自我修复框架,结合AutoFix机制实现自适应故障恢复,通过实验验证该框架在故障检测和恢复中的有效性,提高了Web应用的容错性和适应性。

Comments 12 pages, 3 figures, 2 tables

详情
Journal ref
Aribe, Sales G. & Oracion, R. J. G. (2026). When web apps heal themselves- A MAPE-K based approach to fault tolerance and adaptive recovery. International Journal of Informatics and Communication Technology, 15(2), 729-740
AI中文摘要

确保现代Web应用的可靠性和韧性仍然是一个关键挑战,由于系统复杂性和动态运行环境的增加。本研究提出了一种基于共享知识库的监控-分析-计划-执行(MAPE-K)模型的模块化自我修复框架,并整合了受AutoFix启发的自适应故障恢复机制。通过设计和开发研究(DDR)方法,该系统在二十种运行故障场景中进行了实施和评估,包括服务崩溃、内存泄漏和数据库断开。实验结果表明,所提出的框架实现了平均故障检测F1得分为90.7%,恢复成功率为93.2%。AutoFix模块将平均恢复时间(TTR)减少了56.2%,实现了平均恢复时间为3.92秒。系统吞吐量在故障条件下保持在88%至95%之间,响应时间仅增加了3.1%。此外,迭代反馈机制通过多个循环提高了恢复效率18.6%。这些发现表明,所提出的框架通过反馈驱动的适应性提供了一种实用且可扩展的方法,以通过反馈驱动的适应性增强Web应用的容错性。尽管当前实现依赖于预定义的恢复策略,但学习导向的反馈为未来更自主的自我修复系统的开发奠定了基础。

英文摘要

Ensuring the reliability and resilience of modern web applications remains a critical challenge due to increasing system complexity and dynamic runtime environments. This study proposes a modular self-healing framework based on the monitor-analyze-plan-execute over a shared knowledge base (MAPE-K) model, integrated with an AutoFix-inspired mechanism for adaptive fault recovery. Using a design and development research (DDR) approach, the system was implemented and evaluated through controlled fault injection experiments across twenty runtime failure scenarios, including service crashes, memory leaks, and database disconnections. Experimental results demonstrate that the proposed framework achieved a mean fault detection F1-score of 90.7% and a recovery success rate of 93.2%. The AutoFix module reduced the average time-to-recovery (TTR) by 56.2%, achieving an average recovery time of 3.92 seconds. System throughput was maintained between 88% and 95% during fault conditions, with only a 3.1% increase in response time. Additionally, iterative feedback mechanisms improved recovery efficiency by 18.6% over multiple cycles. These findings indicate that the proposed framework provides a practical and extensible approach to enhancing fault tolerance in web applications through feedback-driven adaptation. While the current implementation relies on predefined recovery strategies, the integration of learning-oriented feedback establishes a foundation for future development of more autonomous self-healing systems.

2605.19227 2026-05-20 cs.CR cs.AI

Token by Token, Compromised: Backdoor Vulnerabilities in Unified Autoregressive Models

逐token被入侵:统一自回归模型中的后门漏洞

Tobias Braun, Jonas Henry Grebe, Hossein Shakibania, Anna Rohrbach, Marcus Rohrbach

AI总结 本文研究了统一自回归模型中的后门漏洞问题,提出了一种名为Token by Token Backdoor Attack (ToBAC)的攻击方法,展示了如何通过数据和模型污染策略在多模态生成中引发有害行为。

详情
AI中文摘要

统一自回归模型(UAMs)是变压器模型,能够在单次自回归传递中生成文本和图像标记。共享参数和多模态词汇简化了训练流程并促进了灵活的多模态生成,但可能会引入新的漏洞。特别是,我们首次证明这种统一架构使多模态后门攻击成为可能,其中触发器可以跨多个输出模态传播恶意影响。具体而言,我们提出了Token by Token Backdoor Attack(ToBAC),这是首个针对UAMs的后门攻击,探索了基于数据和基于模型的污染策略。我们展示了无害的字符或甚至常见的单词可以被转换为触发器,从而在自回归图像生成中引发有害行为。ToBAC可以联合操控视觉输出和伴随文本,增加伪造内容的感知真实性。通过模型访问,ToBAC可以在统一的液体模型中发起攻击,其中微妙的词(例如,``cool'')在55%的生成中导致模态对齐的品牌推广或意识形态影响。在没有模型访问的情况下,ToBAC可以通过数据污染诱导,对JanusPro实现平均成功率为63.1%。

英文摘要

Unified autoregressive models (UAMs) are transformer models that generate text as well as image tokens within a single autoregressive pass. Shared parameters and a multimodal vocabulary simplify the training pipeline and facilitate flexible multimodal generation, yet might introduce new vulnerabilities. In particular, we are the first to show that this unified architecture enables multimodal backdoor attacks, where a trigger can propagate malicious effects across multiple output modalities. Specifically, we present the Token by Token Backdoor Attack (ToBAC), the first backdoor attack targeting UAMs, exploring both data-based and model-based poisoning strategies. We demonstrate that innocuous characters or even common words can be transformed into triggers that elicit harmful behavior in autoregressive image generation. ToBAC can jointly manipulate visual outputs and accompanying text, increasing the perceived authenticity of fabricated content. With model access, ToBAC enables attacks on the unified Liquid model in which a subtle word (e.g., ``cool'') induces modality-aligned brand promotion or ideological influence in 55% of generations. Without model access, ToBAC can be induced through data poisoning, achieving an average success rate of 63.1% against JanusPro.

2605.19208 2026-05-20 stat.AP cs.LG stat.ML

Precision Physical Activity Prescription via Reinforcement Learning for Functional Actions

通过强化学习实现功能动作的精准体育活动处方

Gefei Lin, Rui Miao, Jennifer Sacheck, Xiaoke Zhang

AI总结 本文提出了一种基于强化学习的算法,用于根据心血管代谢风险个性化优化每日步数分布,通过All of Us研究数据验证了该方法在提高健康生物标志物方面的有效性。

详情
AI中文摘要

体育活动(PA)在维持和改善健康方面起着重要作用。日常步数已成为一种关键的PA测量指标,可通过常见的可穿戴设备轻松获取。然而,缺乏推荐个性化最优每日步数分布的方法以最佳改善某些健康生物标志物。本文基于All of Us研究数据,该数据包括数月的步数计数以及关键健康生物标志物的重复测量,开发了一种新的离线强化学习(RL)算法,以学习与心血管代谢风险相关的个性化和最优PA分布,其中动作是一个函数,表示一段时间内每日步数分布。模拟研究显示,所提出的方法在现有连续动作RL方法中具有优势。从All of Us数据中学习到的最优策略通常建议人们增加日常步数,并在时间上遵循更一致的PA模式,同时为血糖水平、体重指数、血压、年龄和性别等亚组提供定制推荐。

英文摘要

Physical activity (PA) plays an important role in maintaining and improving health. Daily steps have been a key PA measure that is easily accessible with common wearable devices. However, methods are lacking to recommend a personalized optimal distribution of daily steps over a period of time for the best of certain health biomarkers. In this paper, we fill this void based on the data from the All of Us Research Program which includes months of step counts as well as repeated measurements of key health biomarkers. We develop a new offline reinforcement learning (RL) algorithm to learn personalized and optimal PA distributions associated with cardiometabolic risk, where the action is a function representing the daily step distribution over a period of time. Simulation studies demonstrate the advantage of the proposed approach over existing continuous-action RL methods. The learned optimal policy from the All of Us data generally suggests people take more daily steps and also follow a more consistent pattern of PA over time while offering tailored recommendations for subgroups in blood glucose level, body mass index, blood pressure, age, and sex.

2605.19190 2026-05-20 cs.CY cs.AI cs.HC

Going PLACES: Participatory Localized Red Teaming for Text-to-Image Safety in the Global South

Going PLACES: 参与式本地化红队测试用于全球南方的文本到图像安全

Charvi Rastogi, Mukul Bhutani, Minsuk Kahng, Shamsuddeen Hassan Muhammad, Evgeniia Razumovskaia, Priyanka Suresh, Ibrahim Said Ahmad, Charu Kalia, Yaaseen Mahomed, Madhurima Maji, Minjae Lee, Alicia Parrish, Jessica Quaye, Vijay Janapa Reddi, Aishwarya Verma, Lora Aroyo

AI总结 本文提出PLACES数据集,通过在非洲和亚洲的本地社区进行参与式红队测试,收集了26000多个文本到图像模型失败案例,揭示了全球南方在文化和社会规范方面的独特对抗模式和安全框架的结构性缺失。

Comments Published at ACM Conference on FAccT 2026

详情
AI中文摘要

尽管文本到图像(T2I)模型已在全球范围内部署,但其安全框架大多基于西方默认设置,这为其他地区带来了显著的漏洞。为了拥抱文化多元主义并引入历史上代表性不足的视角,我们在全球南方进行了本地化的社区中心红队测试研究。我们的双重视角优先考虑本地化和参与,通过关注这些地区的次级城市中心,并开展社区参与和培训研讨会,以 contextualize 本地规范。结果,我们提出了PLACES数据集,其中包括与加纳、尼日利亚以及印度两个地区(卡纳塔克和旁遮普)的大学合作收集的超过26,000个T2I模型失败示例。分析收集的提示揭示了与现有地理无关的众包红队测试数据相比,社会文化和语言属性的广泛多样性。我们观察到由本地文化和语言细微差别所启用的独特对抗模式,以及在地区内围绕特定主题(如印度的宗教)的明显聚类。此外,我们通过识别新的危害,揭示了现有安全框架的结构性缺失,这些危害表现出规范不一致(例如,违反宗教规范、忽视本地习俗和 ominous 的象征意义)。这项工作认为,扩展T2I安全需要超越单纯的规模,转而采用深入本地化和参与性的数据收集和情境化方法。内容警示:本文包含可能有害或冒犯性内容的示例。

英文摘要

Despite the global deployment of text-to-image (T2I) models, their safety frameworks are largely calibrated to a Western-centric default, creating significant vulnerabilities for the rest of the world. To embrace cultural pluralism and bring historically under-represented perspectives in T2I safety, we conduct localised community-centered red teaming studies in the Global South. Our two-fold approach prioritizes localization and participation, by focusing on secondary urban centers in these regions, and conducting community engagement and training workshops to contextualize local norms. As a result, we present PLACES, a dataset comprising over 26,000 examples of T2I model failures collected in partnership with universities in Ghana, Nigeria, and two regions of India (Karnataka and Punjab). Analysis of prompts collected reveals a wide-ranging diversity in socio-cultural and linguistic attributes, when compared to existing geography-agnostic crowdsourced red-teaming data. We observe unique adversarial patterns enabled by local cultural and linguistic nuances, and distinct clusters within region around specific themes, such as religion in India. Moreover, we uncover structural contextual gaps in existing safety frameworks by identifying novel harms showing normative dissonance (e.g., violating religious norms, ignoring local customs, and ominous symbolism). This work argues that expanding T2I safety requires moving beyond mere scale to incorporate deeply localised, participatory methodologies for data collection and contextualization. Content warning: This paper includes examples containing potentially harmful or offensive content.

2605.19179 2026-05-20 astro-ph.EP astro-ph.IM cs.LG

A Cloud-Based Tool for Meteorite Recovery Using Drones and Machine Learning

基于云技术的陨石回收工具:利用无人机和机器学习

Seamus L. Anderson, Hadrien A. R. Devillepoix, Lewis Lakerink, Sawitchaya Tippaya, Dale P. Giancono, Martin C. Towner, Iona Clemente, Martin Cupák, Ashley F. Rogers, John H. Fairweather, Mia Walker, Daniel Burgin, Michael A. Frazer, Sophie E. Deam, Veronika Pazderová, Eleanor K. Sansom, Benjamin A. D. Hartig, Hely C. Branco, Thomas Stevenson, Isabella Hatty, Anna Zappatini, Anthony Lagain, Tom Lovelock, Auriane Egal, Lucy Forman, David Belton, Simon Windsor, Shibli Saleheen, Asher Leslie, Gregory B. Poole, Andrew Langendam, Rachel S. Kirby, Andrew G. Tomkins

AI总结 本文提出一种基于云技术的工具,利用无人机和机器学习帮助恢复通过仪器观测到的陨石坠落。该工具展示了系统迭代改进的成果,并详细说明了该技术在澳大利亚南部和西海岸陨石坠落中的成功与局限性。

Comments 23 pages, 3 figures

详情
AI中文摘要

我们提出了一种基于云技术的工具,利用无人机和机器学习来帮助恢复通过仪器观测到的陨石坠落。我们展示了一 series of improvements made upon previous iterations of our system, as well as detail the successes and limitations of this technique when applied to observed meteorite falls in South and Western Australia. This tool is available to the meteoritics research community upon request at https://find.gfo.rocks.

英文摘要

We present a cloud-based tool that uses drones and machine learning to help recover instrumentally observed meteorite falls. We showcase a collection of improvements made upon previous iterations of our system, as well as detail the successes and limitations of this technique when applied to observed meteorite falls in South and Western Australia. This tool is available to the meteoritics research community upon request at https://find.gfo.rocks.

2605.19178 2026-05-20 cond-mat.dis-nn cond-mat.stat-mech cs.LG physics.data-an

Activation Functions, Statistics and Learning of Higher-Order Interactions in Restricted Boltzmann Machines

激活函数、统计学和受限玻尔兹曼机中高阶相互作用的学习

Giovanni di Sarra, Yasser Roudi

AI总结 本文研究了受限玻尔兹曼机中激活函数对高阶相互作用统计学和学习的影响,分析了四种常见激活函数在不同参数范围内的表示和学习能力。

Comments 38 pages, 27 figures

详情
AI中文摘要

神经网络在复杂数据中识别隐藏模式和相关性的巨大成功,归功于它们利用大量参数和非线性单单元激活函数的方式。受限玻尔兹曼机(RBMs)提供了一个简单而强大的框架,用于研究激活非线性对性能和表示的影响。在本工作中,我们利用RBMs与相互作用二元变量模型之间的双重性,研究了不同隐藏单元激活函数的RBM集合所诱导的相互作用的统计学。我们以四种常用激活函数(线性、阶跃、ReLU和指数)的诱导相互作用分布的矩来分析可表示模型的空间。对学习的定量预测与训练过程模拟的结果有很好的一致。特别是,我们的分析表明,某些数据结构,即由具有大相互作用项的相互作用变量模型生成的结构,对于任何RBM来说都难以表示和学习。然而,我们发现快速增加的非线性,如指数函数,可以促进特定参数范围内的此类数据结构的表示和学习。

英文摘要

The great success of neural networks in recognizing hidden patterns and correlations in complex data lies in the way they take advantage of the large number of parameters and nonlinear single-unit activation, jointly. Restricted Boltzmann Machines (RBMs) provide a simple yet powerful framework for studying the impact of activation nonlinearities on performance and representation. In this work, we exploit the duality between RBMs and models of interacting binary variables to study the statistics of the interactions induced by RBM ensembles with different hidden unit activation functions. We characterize the space of representable models analytically in terms of moments of the distribution of induced interactions for four commonly used activation functions: Linear, Step, ReLU, and Exponential. Quantitative predictions of the analytical calculations on learning show a very good agreement with results of the simulations of the training process. In particular, our analysis shows that there are certain data structures, namely those generated by models of interacting variables with large interaction terms beyond pairwise, that are difficult to represent, and thus to learn, for any RBM. Yet, we find that rapidly increasing nonlinearities, such as the Exponential function, can facilitate the representation and learning of such data structures for a specific range of parameters that is determined analytically.

2605.19147 2026-05-20 cs.CR cs.AI cs.LG

Be Kind, Rewrite: Benign Projections via Rewriting Defend Against LLM Data Poisoning Attacks

仁者重写:通过重写实现良性投影以防御大语言模型数据中毒攻击

John T. Halloran, Noopur S. Bhatt

AI总结 本文提出了一种基于重写的良性投影方法(OBBR),通过利用开放书本的良性样本来提高大语言模型对数据中毒攻击的防御能力,实验表明OBBR在多种已知攻击模式中表现出更高的安全性能,并且在计算效率和模型性能方面具有优势。

Comments 15 pages, 2 Figures, 5 Tables

详情
AI中文摘要

大型语言模型(LLMs)对后门攻击(BAs)非常敏感,其中训练样本通过基于触发器的有害内容进行中毒。此外,现有防御措施在广泛测试不同BA模式时已被证明无效。为了更好地对抗BAs,我们探索了使用LLM重写作为主动防御数据中毒的方法。首先,我们理论证明,当LLM重写利用开放书本良性样本(称为开放书本良性重写,OBBR)时,重写输出为良性的概率严格大于闭合书本重写。因此,OBBR通过将训练样本投影到良性提示空间来中和有害内容。我们随后表明,与以往的防御措施不同,OBBR有效缓解了大量现有的BAs:在五种已知BAs和四种广泛使用的LLMs中,OBBR相比最先进的BA防御措施平均提高了51%的安全性能,相比闭合书本重写方法提高了25.7%。最后,我们表明OBBR在计算效率方面优于其他BA防御措施,经过微调后不会降低模型在自然语言任务上的性能,并且能够防御非触发基于的数据中毒攻击。

英文摘要

Large language models (LLMs) are highly susceptible to backdoor attacks (BAs), wherein training samples are poisoned using trigger-based harmful content. Furthermore, existing defenses have proven ineffective when extensively tested across BA patterns. To better combat BAs, we explore the use of LLM rewriting as a proactive defense against data poisoning. First, we theoretically show that when LLM rewriting utilizes open-book benign samples--termed open-book benign rewriting (OBBR)--the probability of a rewritten output being benign is strictly greater than that of closed-book rewriting. Thus, OBBR neutralizes harmful content by projecting training samples to the space of benign prompts. We then show that, in contrast to previous defenses, OBBR effectively mitigates a large number of existing BAs: across five known BAs and four widely used LLMs, OBBR increases safety performance by an average 51% compared to state-of-the-art BA defenses and 25.7% compared to closed-book rewriting methods. Finally, we show that OBBR is computationally efficient relative to other BA defenses, does not degrade model performance on natural language tasks after fine-tuning, and is capable of defending against non-trigger based data poisoning attacks.

2605.19124 2026-05-20 cond-mat.mtrl-sci cond-mat.dis-nn cs.LG physics.chem-ph

Atomistic Modeling of Chemical Disorder in Materials: Bridging Classical Methods and AI-Assisted Approaches

材料中化学无序的原子模型:连接经典方法和AI辅助方法

Jiayu Peng, Peichen Zhong

AI总结 本文探讨了如何通过结合经典方法和AI技术来解决材料中化学无序的表示差距问题,重点介绍了如何利用计算方法将平均无序描述转换为具有代表性的构型集合,并平衡成本、偏差和保真度。

详情
AI中文摘要

化学无序源于多种元素占据晶格位置的混合占据,广泛存在于合金、陶瓷和成分复杂的材料中,其中短程和长程有序可以显著影响性质。一个核心障碍是实验与模拟之间的表示差距:实验通常报告无序为部分占据和集体平均行为,而原子模拟和AI工作流程通常需要完全指定的配置。解决这一差距需要能够将平均无序描述转换为代表性构型集合的计算方法,同时平衡成本、偏差和保真度。这一挑战在AI驱动的计算发现中变得更加紧迫,因为忽略无序可能导致AI工作流程错误排名稳定性、错误判断新颖性和误导实验,使用过于理想化的表示。本文综述了经典方法和AI驱动方法如何弥合这一表示差距。我们评估了从平均场理论、簇扩展、准随机近似、蒙特卡洛以及新兴的由通用原子间势能和生成模型驱动的方法的优缺点。我们还强调了AI如何通过降低微状态评估、构型探索和原子到热力学闭合的成本来加速经典计算方案。我们还强调了AI如何使无序原生能力成为可能,包括工作流程优先级、对有序敏感和化学表示、生成模型的无序结构和分布以及对动力学敏感的无序预测。共同,这一框架概述了通往无序原生AI的实用路线图,将化学无序从一个表示障碍转变为现实AI加速材料发现中的可控变量。

英文摘要

Chemical disorder, originating from the mixed occupation of crystallographic sites by multiple elements, is widespread in alloys, ceramics, and compositionally complex materials, where short- and long-range orderings can strongly influence properties. A central obstacle is the representation gap between experiments and simulations: experiments often report disorder as partial occupancies and ensemble-averaged behaviors, whereas atomistic simulations and AI workflows usually require fully specified configurations. Tackling this gap requires computational methods that convert averaged disorder descriptions into representative configurational ensembles while balancing cost, bias, and fidelity. This challenge has become more urgent in AI-driven computational discovery, where ignoring disorder may cause AI workflows to misrank stability, misjudge novelty, and misdirect experiments with too-idealized representations. This Review highlights how classical and AI-driven methods can bridge this representation gap. We assess the strengths and limitations of approaches spanning mean-field theories, cluster expansion, quasi-random approximations, Monte Carlo, and emerging schemes powered by universal interatomic potentials and generative models. We further highlight how AI can accelerate classical computational schemes by lowering the cost of microstate evaluation, configurational exploration, and atomistic-to-thermodynamic closure. We also emphasize how AI can enable disorder-native capabilities, including workflow triage, ordering-sensitive and alchemical representations, generative models of disordered structures and distributions, and kinetics-aware disorder prediction. Together, this framework outlines a practical roadmap toward disorder-native AI, which can transform chemical disorder from a representational obstacle into a controllable variable for realistic AI-accelerated materials discovery.

2605.19122 2026-05-20 stat.ML cs.LG

Dual-Channel Tensor Neural Networks: Finite-Sample Theory and Conformal Structure Selection

双通道张量神经网络:有限样本理论与符合结构选择

Elynn Chen, Jiayu Li, Zheshi Zheng, Jian Pei

AI总结 本文提出双通道张量神经网络(DC-TNN),通过分解张量输入为低秩核心和稀疏细化部分,并通过耦合的神经通道处理两者。该框架结构无关,可容纳CP、Tucker和张量列车核心。在估计方面,建立了DC-TNN估计器的非渐近风险界,并展示了有效维度由核心秩和细化稀疏性共同决定。在推断方面,开发了结构感知符合ROC程序,产生具有有限样本、分布自由覆盖的ROC和AUC置信带。基于此,提出了符合结构选择器,是首个具有有限样本有效性的分布自由候选张量分解选择方法。模拟和蛋白质数据集分析显示了竞争性的预测精度、可靠的不确定性量化和一致的张量结构恢复。

详情
AI中文摘要

张量值数据自然出现在神经影像、基因组学、气候科学和时空网络中,其中多线性依赖关系在模式间携带信息,而向量化会破坏这些信息。现有方法要么施加单一低秩结构,可能遗漏局部信号,要么将张量视为长向量,从而丢弃其多维几何。我们提出双通道张量神经网络(DC-TNN),将每个张量输入分解为低秩核心和稀疏细化,并通过耦合的神经通道处理两个组件。该框架结构无关,可容纳CP、Tucker和张量列车核心于单一架构中。在估计方面,我们建立了DC-TNN估计器的非渐近风险界,将其分解为网络近似、核心估计和细化选择项,并显示有效维度由核心秩和细化稀疏性共同决定,而非由张量环境大小决定。在推断方面,我们开发了结构感知符合ROC程序,校准在核心-细化潜在空间中,并产生具有有限样本、分布自由覆盖的ROC和AUC置信带。基于此,我们提出了符合结构选择器,据我们所知,是首个具有有限样本有效性的分布自由候选张量分解选择方法。模拟和蛋白质数据集分析显示了竞争性的预测精度、可靠的不确定性量化和一致的张量结构恢复。

英文摘要

Tensor-valued data arise naturally in neuroimaging, genomics, climate science, and spatiotemporal networks, where multilinear dependencies across modes carry information that is destroyed under vectorization. Existing approaches either impose a single low-rank structure, which can miss localized signal, or treat the tensor as a long vector, which discards its multiway geometry. We propose a *Dual-Channel Tensor Neural Network* (DC-TNN) that decomposes each tensor input into a low-rank core and a sparse refinement, and processes the two components through coupled neural channels. The framework is structure-agnostic and accommodates CP, Tucker, and tensor-train cores within a single architecture. For estimation, we establish non-asymptotic risk bounds for the DC-TNN estimator that decompose into network approximation, core estimation, and refinement-selection terms, and show that the effective dimension is determined jointly by the core rank and refinement sparsity rather than by the ambient tensor size. For inference, we develop a *structure-aware conformal ROC* procedure that calibrates within the core-refinement latent space and produces ROC and AUC confidence bands with finite-sample, distribution-free coverage. Building on this, we propose a *conformal structure selector* that, to our knowledge, is the *first distribution-free procedure* for choosing among candidate tensor decompositions with finite-sample validity. Simulations and an analysis of a protein dataset demonstrate competitive predictive accuracy, reliable uncertainty quantification, and consistent recovery of the tensor structure.

2605.19119 2026-05-20 cs.NE cs.AI cs.LG

GOAL: Graph-based Objective-Aligned Diffusion Solvers for Dynamic Multi-Objective Optimization

GOAL: 图基基于的目标对齐扩散求解器用于动态多目标优化

Xingyu Li

AI总结 本文提出GOAL,一种基于图的扩散求解器,用于动态多目标优化问题,通过条件化扩散求解器实现可控决策生成,通过人类指定的目标进行条件化,引入异构图编码,允许信息根据约束的本体进行选择性传播,并在三个经典调度基准上实现了100%的解可行性和接近零的MAPE。

详情
AI中文摘要

现有的神经组合优化求解器将解决方案搜索框定为模仿最优决策,本质上限制了其在单目标最小化和静态约束下的用途。我们提出了GOAL,一种基于关系图表示的条件扩散求解器,能够通过在人类指定的目标上进行条件化来实现可控的决策生成。我们引入了一种异构图编码,在其中不同的边类型,对应于不同类别的约束,定义了图神经网络的消息传递结构,这允许信息根据每个约束的本体进行选择性传播。GOAL在三个经典调度基准上进行了实例化和评估,这些基准涵盖了各种约束复杂度:流水作业问题(FSP)、作业调度问题(JSP)和灵活作业调度问题(FJSP)。在不进行架构修改的情况下,通用性在结构上不同的约束领域和问题类型中得到证明。在所有三个基准上,GOAL在20个作业和60个操作的问题规模上实现了100%的解可行性和接近零的MAPE(低于0.20%)在多个目标上,优于NSGA-II和MOEA/D在解质量和推理速度上最多提高了25倍。

英文摘要

Existing neural combinatorial optimization solvers frame solution search as imitation of optimal decisions, inherently limiting their utility to single-objective minimization and static constraints. We propose GOAL, a conditioned diffusion solver over relational graph representations that enables controllable decision generations by conditioning on human-specified objectives. We introduce a heterogeneous graph encoding in which distinct edge types, corresponding to different classes of constraints, define the message passing structure of the graph neural network, which allows information to propagate selectively according to the ontology of each constraint. GOAL is instantiated and evaluated on three canonical scheduling benchmarks of various constraint complexity: the Flow Shop Problem (FSP), the Job Shop Scheduling Problem (JSP), and the Flexible Job Shop Scheduling Problem (FJSP). Generalization is demonstrated across structurally distinct constraint regimes and problem types without architectural modification. On all three benchmarks, GOAL achieves 100% solution feasibility and near-zero MAPE (below 0.20%) on multiple objectives for problem sizes up to 20 jobs and 60 operations, outperforming NSGA-II and MOEA/D in both solution quality and inference speed by up to 25x.

2605.19113 2026-05-20 stat.ME cs.LG stat.ML

Learning Interpretable Point-Based Clinical Risk Scores via Direct Optimization

通过直接优化学习可解释的基于点的临床风险评分

Ying Cui, Albert M Li, Vivek Charu, Yeon-Mi Hwang, Tina Hernandez-Boussard, Lu Tian

AI总结 本文提出了一种新的机器学习算法,通过灵活的贪心优化策略直接学习可解释的基于点的临床风险评分,以在明确的最优性目标下优化加法评分。

Comments 23 pages, 4 figures

详情
AI中文摘要

许多临床风险评分被部署为加法规则,其中相关的二元预测特征被分配非负整数点。这些整数权重不仅使评分在实践中更容易使用,还促进了所得到的预测模型的稀疏性。此类风险评分通常通过首先拟合回归模型,然后经过适当缩放后将估计的系数四舍五入到最近的整数来获得。这种方法计算速度快,但不能保证最终评分的最优性。替代方法是通过遍历所有可能的整数权重,将问题视为整数规划任务,直接优化价值函数。然而,相关计算负担可能相当大,尤其是当价值函数是非凸甚至不连续时。在本文中,我们开发了新的机器学习算法,采用灵活的贪心优化策略,在明确且合理的最优性目标下直接学习此类加法评分。我们应用所提出的方法,利用Epic Cosmos中的大规模电子健康记录(EHR)队列,构建一个整数加权共病评分,用于衡量出院后死亡风险。我们还进行了模拟研究,以考察有限样本的操作特性。

英文摘要

Many clinical risk scores are deployed as additive rules with nonnegative integer points assigned to relevant binary predictive features. These integer weights not only make the score easier to use in practice but also promote sparsity in the resulting prediction model. Such risk scores are often derived by first fitting a regression model and then rounding the estimated coefficients to the nearest integer after appropriate scaling. This approach is computationally fast but does not guarantee optimality of the resulting score. Alternatively, one may search over all possible integer weights to directly optimize a value function by posing the problem as an integer programming task. However, the associated computational burden can be substantial, especially when the value function is nonconcave or even discontinuous. In this paper, we develop new machine learning algorithms that employ a flexible greedy optimization strategy to learn such additive scoring directly under explicit and sensible optimality objectives. We apply the proposed method to a large electronic health record (EHR) cohort in Epic Cosmos to construct an integer-weighted comorbidity score for measuring the risk of post-discharge mortality. We also conduct a simulation study to examine the finite-sample operating characteristics.

2605.19064 2026-05-20 cs.HC cs.AI

Toward an AI-Powered Computational Testbed for Workforce Policy

迈向由人工智能驱动的劳动力政策计算测试平台

Sumer S. Vaid, Ashley V. Whillans

AI总结 本文提出了一种动态员工代理,结合LLM生成代理、管理科学和组织行为研究,以预测员工在组织变革中的心理和行为反应,同时定义了隐私、准确性和代表性保障措施。

详情
AI中文摘要

劳动力转型难以预测且管理不当成本高昂。特别是,人工智能在知识工作中的整合目前影响了全球大量劳动力,但这一转变缺乏工具来预测个体员工的心理和行为反应。我们结合最近的LLM生成代理进展与基础管理科学和组织行为研究,提出动态员工代理。在同意的群体中,这些代理可以利用HR记录、验证心理测量和数字活动数据进行播种,以模拟员工在计划组织变革期间连续工作日中的认知、情感和行为轨迹。本文详细说明了构建此模拟平台所需的计算架构,并定义了负责任部署所需的隐私、准确性和代表性保障措施。我们主张建立这种前瞻性预测基础设施是管理当前全球劳动力围绕人工智能重新调整的关键技术要求。

英文摘要

Workforce transformations are difficult to forecast and costly to mismanage. In particular, the integration of artificial intelligence into knowledge work currently affects a substantial share of the global workforce, yet this transition proceeds without tools to forecast how individual employees will respond psychologically and behaviorally. We combine recent advances in LLM-powered generative agents with foundational management science and organizational behavior research to propose dynamic employee agents. Among consenting populations, these agents can be seeded with HR records, validated psychometric measures, and digital activity data to simulate employees' cognitive, emotional, and behavioral trajectories across successive workdays during planned organizational changes. In this article, we detail the computational architecture required to construct this simulation platform and define the privacy, accuracy, and representativeness safeguards necessary for responsible deployment. We argue that establishing this prospective forecasting infrastructure is a critical technical requirement for managing the current global workforce realignment around AI.

2605.19043 2026-05-20 cs.CY cs.AI cs.HC

Automated Grading of Handwritten Mathematics Using Vision-Capable LLMs

使用具备视觉能力的LLM进行手写数学自动评分

Jacob Levine, Miguel Aenlle, Craig Zilles, Matthew West, Mariana Silva

AI总结 本文研究了使用具备视觉能力的LLM对手写数学作业进行自动评分,通过对比AI评分与人工评分,发现大多数错误源于转录失败而非评分标准应用错误,揭示了LLM在手写数学评分中的潜力与局限。

Comments To be published in the International Conference on AI in Education (AIED), 2026

详情
AI中文摘要

自动评分系统已能对多种响应类型进行大规模评估,但手写数学作业仍是一个障碍,因为其多步骤解决方案的复杂性。具备视觉能力的大语言模型(LLM)在此领域提供了新机会,但其在真实教学环境中的可靠性仍不明确。本文介绍了基于LLM的手写数学作业评分系统,使用教师定义的评分标准进行评估。在扩展先前针对 typed 响应的流程时,我们整合了对照片提交的转录和基于评分标准的评估,通过单次LLM调用完成。在两个大学STEM课程的学生作业上进行了评估。将AI评分决策与人工分配的地面真实值在评分项层面进行比较,我们观察到总体准确率较高,大多数错误——在最佳模型中为87%——归因于转录失败,而非评分标准应用错误。我们分类了常见的错误模式,包括图像质量问题、幻觉内容以及等价表达的处理错误。这些发现突显了LLM在手写数学评分中的潜力和局限,为系统设计、提示优化和教育环境中的部署提供了指导。

英文摘要

Automated grading systems have enabled scalable assessment for many response types, but handwritten mathematics remains a barrier due to the complexity of multi-step solutions. Vision-capable large language models (LLMs) offer new opportunities here, yet their reliability in authentic instructional settings remains poorly understood. We present an empirical evaluation of an LLM-based grader for handwritten mathematical work using instructor-defined rubrics. Extending a prior pipeline for typed responses, we integrate transcription and rubric-based evaluation of photographic submissions within a single LLM call, evaluating on student work from two university STEM courses. Comparing AI grading decisions against human-assigned ground truth at the rubric-item level, we observe high overall accuracy, with most errors -- 87\% in the best model -- attributable to transcription failures rather than rubric misapplication. We categorize common error modes, including image quality issues, hallucinated content, and incorrect handling of equivalent expressions. These findings highlight both the promise and limitations of LLM-based grading for handwritten mathematics, providing guidance for system design, prompt refinement, and deployment in educational settings.

2605.19024 2026-05-20 stat.ML cs.LG stat.ME

Conformal Prediction via Transported Beta Laws

通过运输的贝塔定律进行符合预测

Thiago R. Ramos, Helton Graziadei, Luben M. C. Cabezas

AI总结 本文研究了通过实现的符合阈值诱导的校准-条件覆盖定律,利用贝塔分布作为有限样本参考对象,并通过Wasserstein距离量化偏离,从而提供对边际覆盖差距和坏校准概率的直接界限,并区分不同非i.i.d行为的来源。

详情
AI中文摘要

分割符合预测在交换性下提供有限样本边际覆盖保证,但此保证平均于随机校准样本。我们研究的是由实现的符合阈值诱导的校准-条件覆盖定律。在连续i.i.d情况下,此定律恰好为Beta(k,n+1-k),因此常规的边际保证对应于其均值。我们将此贝塔定律作为有限样本参考对象,并利用Wasserstein距离在[0,1]上量化偏离。该框架提供了对边际覆盖差距和坏校准概率的直接界限,并根据如何变形贝塔参考来区分不同的非i.i.d行为:测试侧偏移通过覆盖尺度上的运输映射作用,而校准依赖性改变顺序统计学定律本身。我们将在尺度-偏移、聚类和稳定混合设置中实例化该框架,其中诱导的变形可以明确表征或通过Berry-Esseen近似表征。在依赖过程上的模拟证实,一阶近似在中等样本大小下能够跟踪经验Wasserstein距离。

英文摘要

Split conformal prediction provides finite-sample marginal coverage under exchangeability, but this guarantee averages over the random calibration sample. We study instead the law of the calibration-conditional coverage induced by a realized conformal threshold. In the continuous i.i.d. setting this law is exactly $Beta(k,n+1-k)$, so the usual marginal guarantee corresponds to its mean. We take this beta law as a finite-sample reference object and quantify departures from it using Wasserstein distances on $[0,1]$. The framework yields direct bounds on marginal coverage gaps and on bad-calibration probabilities, and separates different sources of non-i.i.d. behavior according to how they deform the beta reference: test-side shift acts through a transport map on the coverage scale, while calibration dependence changes the order-statistic law itself. We instantiate the framework in scale-shift, clustered, and stationary mixing settings, where the induced deformations can be characterized explicitly or through Berry-Esseen approximations. Simulations on dependent processes confirm that the first-order approximation tracks the empirical Wasserstein distance even at moderate sample sizes.