arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2256
2603.14515 2026-05-28 cs.LG physics.chem-ph physics.comp-ph quant-ph

Excited Pfaffians: Generalized Neural Wave Functions Across Structure and State

激发Pfaffians:跨结构和状态的广义神经波函数

Nicholas Gao, Till Grutschus, Frank Noé, Stephan Günnemann

AI总结 提出多态重要性采样(MSIS)和激发Pfaffians架构,以近恒定样本量高效计算多态重叠,并在单个神经网络中表示多个激发态,实现更快训练和更多状态建模。

详情
AI中文摘要

变分蒙特卡洛(VMC)中的神经网络波函数在精确表示基态和激发态方面取得了巨大成功。然而,在状态重叠中实现足够的数值精度需要增加蒙特卡洛样本数量,从而增加计算成本,且随状态数增加。我们提出了一种近乎恒定样本量的方法——多态重要性采样(MSIS),利用来自所有状态的样本来估计成对重叠。为了高效评估所有样本的所有状态,我们引入了激发Pfaffians。受Hartree-Fock启发,该架构在单个神经网络内表示多个状态。激发Pfaffians还作为广义波函数,允许单个模型表示多态势能面。在碳二聚体上,我们匹配了$O(N_s^4)$标度的自然激发态,同时训练速度提高了$>200$倍,并建模了多50%的状态。我们有利的标度使我们能够首次使用神经网络找到铍原子的所有不同能级。最后,我们证明了单个波函数可以表示不同分子中的激发态。

英文摘要

Neural-network wave functions in Variational Monte Carlo (VMC) have achieved great success in accurately representing both ground and excited states. However, achieving sufficient numerical accuracy in state overlaps requires increasing the number of Monte Carlo samples, and consequently the computational cost, with the number of states. We present a nearly constant sample-size approach, Multi-State Importance Sampling (MSIS), that leverages samples from all states to estimate pairwise overlap. To efficiently evaluate all states for all samples, we introduce Excited Pfaffians. Inspired by Hartree-Fock, this architecture represents many states within a single neural network. Excited Pfaffians also serve as generalized wave functions, allowing a single model to represent multi-state potential energy surfaces. On the carbon dimer, we match the $O(N_s^4)$-scaling natural excited states while training $>200\times$ faster and modeling 50% more states. Our favorable scaling enables us to be the first to use neural networks to find all distinct energy levels of the beryllium atom. Finally, we demonstrate that a single wave function can represent excited states across various molecules.

2602.20497 2026-05-28 cs.CV cs.AI

LESA: Learnable Stage-Aware Predictors for Diffusion Model Acceleration

LESA: 可学习的阶段感知预测器用于扩散模型加速

Peiliang Cai, Jiacheng Liu, Haowen Xu, Xinyu Wang, Chang Zou, Linfeng Zhang

AI总结 针对扩散模型计算开销大、现有缓存策略难以适应去噪过程阶段动态变化的问题,提出基于两阶段训练的可学习阶段感知预测器框架,利用KAN网络学习时序特征映射并采用多阶段多专家架构,在保持高质量生成的同时实现显著加速。

Comments Accepted to CVPR 2026

详情
AI中文摘要

扩散模型在图像和视频生成任务中取得了显著成功。然而,扩散Transformer(DiTs)的高计算需求对其实际部署构成了重大挑战。虽然特征缓存是一种有前景的加速策略,但现有基于简单重用或无训练预测的方法难以适应扩散过程中复杂的、阶段相关的动态变化,常常导致质量下降,并无法保持与标准去噪过程的一致性。为解决这一问题,我们提出了一种基于两阶段训练的可学习阶段感知(LESA)预测器框架。我们的方法利用Kolmogorov-Arnold网络(KAN)从数据中准确学习时序特征映射。我们进一步引入了一种多阶段、多专家架构,为不同噪声水平阶段分配专门的预测器,从而实现更精确和鲁棒的特征预测。大量实验表明,我们的方法在保持高保真生成的同时实现了显著加速。实验显示,在FLUX.1-dev上实现了5.00倍加速,质量下降极小(1.0%);在Qwen-Image上实现了6.25倍加速,质量比之前的最优方法(TaylorSeer)提升20.2%;在HunyuanVideo上实现了5.00倍加速,PSNR比TaylorSeer提升24.7%。在文本到图像和文本到视频合成任务上的最先进性能验证了我们基于训练框架在不同模型上的有效性和泛化能力。我们的代码可在https://github.com/caipeiliang2004/LESA获取。

英文摘要

Diffusion models have achieved remarkable success in image and video generation tasks. However, the high computational demands of Diffusion Transformers (DiTs) pose a significant challenge to their practical deployment. While feature caching is a promising acceleration strategy, existing methods based on simple reusing or training-free forecasting struggle to adapt to the complex, stage-dependent dynamics of the diffusion process, often resulting in quality degradation and failing to maintain consistency with the standard denoising process. To address this, we propose a LEarnable Stage-Aware (LESA) predictor framework based on two-stage training. Our approach leverages a Kolmogorov-Arnold Network (KAN) to accurately learn temporal feature mappings from data. We further introduce a multi-stage, multi-expert architecture that assigns specialized predictors to different noise-level stages, enabling more precise and robust feature forecasting. Extensive experiments show our method achieves significant acceleration while maintaining high-fidelity generation. Experiments demonstrate 5.00x acceleration on FLUX.1-dev with minimal quality degradation (1.0% drop), 6.25x speedup on Qwen-Image with a 20.2% quality improvement over the previous SOTA (TaylorSeer), and 5.00x acceleration on HunyuanVideo with a 24.7% PSNR improvement over TaylorSeer. State-of-the-art performance on both text-to-image and text-to-video synthesis validates the effectiveness and generalization capability of our training-based framework across different models. Our code is available at https://github.com/caipeiliang2004/LESA.

2602.18982 2026-05-28 cs.LG q-bio.PE

Conditionally Site-Independent Neural Evolution of Antibody Sequences

抗体序列的条件性位点无关神经进化

Stephen Zhewen Lu, Aakarsh Vermani, Kohei Sanno, Jiarui Lu, Frederick A Matsen, Milind Jagota, Yun S. Song

AI总结 提出CoSiNE模型,用深度神经网络参数化的连续时间马尔可夫链桥接系统发育模型与深度学习,实现抗体序列进化建模,在零样本变异效应预测中优于现有语言模型,并引入引导吉莱斯皮采样优化抗体亲和力。

Comments 28 pages, 15 figures. Accepted as a poster at ICML 2026

详情
AI中文摘要

常见的抗体工程深度学习方法侧重于建模序列的边缘分布。然而,这些方法将序列视为独立样本,忽略了亲和力成熟作为抗体探索潜在适应度景观的进化过程中丰富且很大程度上未开发的信息来源。相比之下,经典的系统发育模型明确表示进化动力学,但缺乏捕捉复杂上位相互作用的表达能力。我们通过CoSiNE(一种由深度神经网络参数化的连续时间马尔可夫链)弥合了这一差距。数学上,我们证明CoSiNE提供了难以处理的顺序点突变过程的一阶近似,以分支长度二次方的误差界捕捉上位效应。实验上,CoSiNE通过明确区分选择与上下文依赖的体细胞超突变,在零样本变异效应预测中优于最先进的语言模型。最后,我们引入了引导吉莱斯皮(Guided Gillespie),一种在推理时引导CoSiNE的分类器引导采样方案,从而实现对特定抗原的抗体结合亲和力的高效优化。

英文摘要

Common deep learning approaches for antibody engineering focus on modeling the marginal distribution of sequences. By treating sequences as independent samples, however, these methods overlook affinity maturation as a rich and largely untapped source of information about the evolutionary process by which antibodies explore the underlying fitness landscape. In contrast, classical phylogenetic models explicitly represent evolutionary dynamics but lack the expressivity to capture complex epistatic interactions. We bridge this gap with CoSiNE, a continuous-time Markov chain parameterized by a deep neural network. Mathematically, we prove that CoSiNE provides a first-order approximation to the intractable sequential point mutation process, capturing epistatic effects with an error bound that is quadratic in branch length. Empirically, CoSiNE outperforms state-of-the-art language models in zero-shot variant effect prediction by explicitly disentangling selection from context-dependent somatic hypermutation. Finally, we introduce Guided Gillespie, a classifier-guided sampling scheme that steers CoSiNE at inference time, enabling efficient optimization of antibody binding affinity toward specific antigens.

2603.13003 2026-05-28 cs.RO cs.SY eess.SY

From Passive Monitoring to Active Defence: Resilient Control of Manipulators Under Cyberattacks

从被动监测到主动防御:网络攻击下机械臂的弹性控制

Gabriele Gualandi, Alessandro V. Papadopoulos

AI总结 针对虚假数据注入攻击(FDIA)下冗余机械臂的弹性控制问题,提出一种基于异常分数的主动控制级防御方法,通过单调函数衰减控制输入,显著降低攻击引起的末端执行器偏差,同时保证无攻击时的标称性能。

Comments v2: Accepted at ICRA 2026. Corrected minor typos, grammatical errors, and notation inconsistencies. Corrected the attacker's PD law in Sec. III-C: removed the feedforward acceleration term, viable only when the attacker assumes sufficient tracking precision; the active defence prevents this in our experiments, so only PD terms are used

详情
AI中文摘要

网络物理机器人系统容易受到虚假数据注入攻击(FDIAs),其中攻击者在规避基于残差的被动异常检测器(如卡方检验)的同时破坏传感器信号。这种隐蔽攻击可以在不触发警报的情况下引起显著的末端执行器偏差。本文研究了冗余机械臂对隐蔽FDIAs的弹性,并将架构从被动监测推进到主动防御。我们建立了一个闭环模型,包括反馈线性化机械臂、稳态卡尔曼滤波器和基于卡方的异常检测器。在此被动监测层的基础上,我们提出了一种主动控制级防御,通过一个新颖的驱动投影、无测量状态预测器生成的异常分数的单调函数来衰减控制输入。所提出的设计在标称驱动损失上提供了概率保证,并保持了闭环稳定性。从攻击者角度,我们推导了一个凸QCQP来计算一步最优隐蔽攻击。在六自由度平面机械臂上的仿真表明,所提出的防御显著减少了攻击引起的末端执行器偏差,同时在无攻击时保持了标称任务性能。

英文摘要

Cyber-physical robotic systems are vulnerable to false data injection attacks (FDIAs), in which an adversary corrupts sensor signals while evading residual-based passive anomaly detectors such as the chi-squared test. Such stealthy attacks can induce substantial end-effector deviations without triggering alarms. This paper studies the resilience of redundant manipulators to stealthy FDIAs and advances the architecture from passive monitoring to active defence. We formulate a closed-loop model comprising a feedback-linearized manipulator, a steady-state Kalman filter, and a chi-squared-based anomaly detector. Building on this passive monitoring layer, we propose an active control-level defence that attenuates the control input through a monotone function of an anomaly score generated by a novel actuation-projected, measurement-free state predictor. The proposed design provides probabilistic guarantees on nominal actuation loss and preserves closed-loop stability. From the attacker perspective, we derive a convex QCQP for computing one-step optimal stealthy attacks. Simulations on a 6-DOF planar manipulator show that the proposed defence significantly reduces attack-induced end-effector deviation while preserving nominal task performance in the absence of attacks.

2603.12344 2026-05-28 cs.LG

Can Decision Trees Teach Large Language Models? Distilling Verbalized Knowledge for Molecular Property Prediction

决策树能否教会大语言模型?为分子性质预测提炼语言化知识

Khiem Le, Sreejata Dey, Marcos Martínez Galindo, Vanessa Lopez, Ting Hua, Nitesh V. Chawla, Hoang Thanh Lam

AI总结 提出TreeKD方法,通过将基于决策树/随机森林的专业模型知识语言化并融入提示,训练大语言模型,显著提升其在分子性质预测任务上的性能。

详情
AI中文摘要

分子性质预测(MPP)是药物发现中的一个基本问题,近年来受到越来越多的关注。大语言模型(LLMs)以其跨领域的惊人能力而闻名,有望成为MPP的通用模型。然而,它们目前的性能仍低于实际应用所需的阈值。为了弥补这一差距,我们提出了TreeKD,用于将基于树的专业模型的知识提炼到LLMs中,以补充LLMs的内部知识并提高其预测准确性。对于每个性质,我们使用输入分子中4万个功能基团衍生的特征训练一个专业决策树。然后,将决策树学习到的预测规则(编码了其知识)语言化,并纳入用于训练LLMs的提示中。此外,通过用随机森林替换单个决策树,我们引入了一种称为规则一致性的测试时缩放技术,该技术聚合了从不同规则构建的不同提示生成的预测。使用两个LLM(Gemma-2-2B和Granite-3.3-2B)在包含22个预测任务的TDC基准上进行的大量评估表明,我们的方法显著提高了LLMs的性能,推动了MPP通用模型的发展。

英文摘要

Molecular Property Prediction (MPP) is a fundamental problem in drug discovery that has recently attracted growing attention. Large Language Models (LLMs), known for their impressive proficiency across domains, show promise as generalist models for MPP. However, their current performance remains below the threshold needed for practical adoption. To bridge this gap, we propose TreeKD for distilling the knowledge of tree-based specialist models into LLMs to complement the internal knowledge of LLMs and improve their predictive accuracy. For each property, we train a specialist decision tree using features derived from 40K functional groups in the input molecules. Then, the predictive rule learned by the decision tree, which encodes its knowledge, is verbalized and incorporated into the prompts for training LLMs. In addition, by replacing a single decision tree with a Random Forest, we introduce a test-time scaling technique called rule-consistency, which aggregates predictions generated from different prompts constructed with different rules. An extensive evaluation with two LLMs, Gemma-2-2B and Granite-3.3-2B, on the TDC benchmark with 22 prediction tasks shows that our method substantially enhances the performance of LLMs, advancing the development of generalist models for MPP.

2603.10961 2026-05-28 cs.LG

Bio-Inspired Self-Supervised Learning for Wrist-worn Accelerometer Data

生物启发的自监督学习用于腕戴式加速度计数据

Prithviraj Tarale, Kiet Chu, Abhishek Varghese, Kai-Chun Liu, Maxwell A. Xu, Mohit Iyyer, Sunghoon I. Lee

AI总结 提出基于运动子单元理论的令牌化策略,通过掩码重建预训练Transformer编码器,在六个HAR基准上超越现有自监督方法。

详情
AI中文摘要

可穿戴加速度计能够实现大规模健康监测,但学习鲁棒的人体活动表示受到标记数据稀缺的限制。虽然自监督学习提供了一种解决方案,但现有方法将传感器流视为非结构化时间序列,忽略了人体运动的潜在生物结构,我们认为这一因素对于有效的人类活动识别(HAR)至关重要。我们引入了一种新颖的令牌化策略,该策略基于运动控制的子单元理论,该理论认为连续的手腕运动由称为子单元的基本基函数组成。我们将令牌定义为运动片段,这是一个计算上可处理的运动单元,由有限序列的子单元组成。通过掩码重建这些令牌来预训练Transformer编码器,我们将学习焦点从局部波形形态转移到高层次的结构和时间组织。在NHANES语料库(约28k小时;11k参与者)上预训练后,我们的表示在六个受试者分离的HAR基准上优于强大的可穿戴SSL基线。代码和预训练权重可在https://prithvitarale.github.io/biopm-site/获取。

英文摘要

Wearable accelerometers enable large-scale health monitoring, yet learning robust human-activity representations has been constrained by scarce labeled data. While self-supervised learning offers a remedy, existing methods treat sensor streams as unstructured time series, overlooking the underlying biological structure of human movement, a factor we argue is critical for effective Human Activity Recognition (HAR). We introduce a novel tokenization strategy grounded in the submovement theory of motor control, which posits that continuous wrist motion is composed of elementary basis functions called submovements. We define our token as the movement segment, a computationally tractable unit of motion composed of a finite sequence of submovements. By pretraining a Transformer encoder via masked reconstruction of these tokens, we shift the learning focus from local waveform morphology to high-level structural and temporal organization. Pretrained on the NHANES corpus (approximately 28k hours; 11k participants), our representations outperform strong wearable SSL baselines across six subject-disjoint HAR benchmarks. Code and pretrained weights are available at https://prithvitarale.github.io/biopm-site/.

2603.09882 2026-05-28 cs.RO cs.AI

Emerging Extrinsic Dexterity in Cluttered Scenes via Dynamics-aware Policy Learning

杂乱场景中通过动力学感知策略学习涌现的外在灵巧性

Yixin Zheng, Jiangran Lyu, Yifan Zhang, Jiayi Chen, Mi Yan, Yuntian Deng, Xuesong Shi, Xiaoguang Zhao, Yizhou Wang, Zhizheng Zhang, He Wang

AI总结 提出动力学感知策略学习框架,通过显式世界建模学习接触诱导物体动力学表示并用于强化学习,使杂乱场景中的外在灵巧性无需手工启发式或复杂奖励塑造即可涌现。

Comments Accepted to Robotics: Science and Systems (RSS) 2026. Project page: https://pku-epic.github.io/DAPL/

详情
AI中文摘要

外在灵巧性利用环境接触来克服抓取操作的局限性。然而,在杂乱场景中实现这种灵巧性仍然具有挑战性且未被充分探索,因为它需要选择性地利用多个相互作用的物体之间的接触,而这些物体具有内在耦合的动力学。现有方法缺乏对这种复杂动力学的显式建模,因此在杂乱环境中的非抓取操作方面表现不足,这反过来限制了它们在现实环境中的实际应用。在本文中,我们介绍了一种动力学感知策略学习(DAPL)框架,该框架可以利用在杂乱环境中学习到的接触诱导物体动力学的表示来促进策略学习。这种表示通过显式世界建模学习,并用于条件化强化学习,使得外在灵巧性无需手工制作的接触启发式或复杂的奖励塑造即可涌现。我们在仿真和现实世界中评估了我们的方法。在具有不同密度的未见过的仿真杂乱场景中,我们的方法在成功率上比抓取操作、人类遥操作和基于先前表示的策略高出25%以上。在10个杂乱场景中,现实世界的成功率达到了约50%,而实际杂货部署进一步证明了稳健的仿真到现实迁移和适用性。

英文摘要

Extrinsic dexterity leverages environmental contact to overcome the limitations of prehensile manipulation. However, achieving such dexterity in cluttered scenes remains challenging and underexplored, as it requires selectively exploiting contact among multiple interacting objects with inherently coupled dynamics. Existing approaches lack explicit modeling of such complex dynamics and therefore fall short in non-prehensile manipulation in cluttered environments, which in turn limits their practical applicability in real-world environments. In this paper, we introduce a Dynamics-Aware Policy Learning (DAPL) framework that can facilitate policy learning with a learned representation of contact-induced object dynamics in cluttered environments. This representation is learned through explicit world modeling and used to condition reinforcement learning, enabling extrinsic dexterity to emerge without hand-crafted contact heuristics or complex reward shaping. We evaluate our approach in both simulation and the real world. Our method outperforms prehensile manipulation, human teleoperation, and prior representation-based policies by over 25% in success rate on unseen simulated cluttered scenes with varying densities. The real-world success rate reaches around 50% across 10 cluttered scenes, while a practical grocery deployment further demonstrates robust sim-to-real transfer and applicability.

2603.02702 2026-05-28 cs.AI cs.LG

FinTexTS: Financial Text-Paired Time-Series Dataset via Semantic-Based and Multi-Level Pairing

FinTexTS: 基于语义和多层级配对的金融文本-时间序列数据集

Jaehoon Lee, Suhwan Park, Taeyoon Lim, Seunghan Lee, Jun Seo, Dongwan Kang, Hwanil Choi, Minjae Kim, Sungdong Yoo, Soonyoung Lee, Yongjae Lee, Wonbin Ahn

AI总结 提出基于语义和多层级配对的框架,从SEC文件和新闻中提取并匹配多层级文本信息,构建大规模文本配对的股票价格数据集FinTexTS,提升股价预测性能。

Comments 12 pages, KDD 2026, Datasets and Benchmarks Track

详情
AI中文摘要

金融领域涉及多种重要的时间序列问题。近年来,联合利用文本和数值信息的时间序列分析方法越来越受到关注。因此,人们做出了大量努力来构建金融领域中的文本配对时间序列数据集。然而,金融市场具有复杂的相互依赖性,一家公司的股票价格不仅受公司特定事件的影响,还受其他公司事件和更广泛的宏观经济因素的影响。现有的基于简单关键词匹配的文本与金融时间序列数据配对方法往往无法捕捉这种复杂关系。为了解决这一局限性,我们提出了一种基于语义和多层级的配对框架。具体来说,我们从SEC文件中提取目标公司的特定上下文,并应用基于嵌入的匹配机制,根据该上下文检索语义相关的新闻文章。此外,我们使用大语言模型(LLMs)将新闻文章分为四个层级(宏观层级、行业层级、相关公司层级和目标公司层级),实现新闻文章与目标公司的多层级配对。将该框架应用于公开可用的新闻数据集,我们构建了FinTexTS,这是一个新的大规模文本配对的股票价格数据集。在FinTexTS上的实验结果表明,我们的基于语义和多层级的配对策略在股价预测中是有效的。除了FinTexTS所依赖的公开新闻外,我们还表明,将我们的方法应用于专有但精心策划的新闻源,可以产生更高质量的配对数据,并提高股价预测性能。

英文摘要

The financial domain involves a variety of important time-series problems. Recently, time-series analysis methods that jointly leverage textual and numerical information have gained increasing attention. Accordingly, numerous efforts have been made to construct text-paired time-series datasets in the financial domain. However, financial markets are characterized by complex interdependencies, in which a company's stock price is influenced not only by company-specific events but also by events in other companies and broader macroeconomic factors. Existing approaches that pair text with financial time-series data based on simple keyword matching often fail to capture such complex relationships. To address this limitation, we propose a semantic-based and multi-level pairing framework. Specifically, we extract company-specific context for the target company from SEC filings and apply an embedding-based matching mechanism to retrieve semantically relevant news articles based on this context. Furthermore, we classify news articles into four levels (macro-level, sector-level, related company-level, and target company-level) using large language models (LLMs), enabling multi-level pairing of news articles with the target company. Applying this framework to publicly-available news datasets, we construct FinTexTS, a new large-scale text-paired stock price dataset. Experimental results on FinTexTS demonstrate the effectiveness of our semantic-based and multi-level pairing strategy in stock price forecasting. In addition to publicly-available news underlying FinTexTS, we show that applying our method to proprietary yet carefully curated news sources leads to higher-quality paired data and improved stock price forecasting performance.

2603.08264 2026-05-28 cs.CV

Event-based Motion & Appearance Fusion for 6D Object Pose Tracking

基于事件的运动与外观融合的6D物体姿态跟踪

Zhichao Li, Chiara Bartolozzi, Lorenzo Natale, Arren Glover

AI总结 提出一种结合事件相机高时间分辨率优势的无学习方法,通过事件光流传播姿态并利用模板匹配校正,在高速运动物体上达到或超越现有算法性能。

详情
AI中文摘要

物体姿态跟踪是机器人在家庭和工业环境中执行任务的基本且必要的任务。最常用的传感器是RGB-D相机,但在高动态环境中,由于运动模糊和帧率限制,它们可能达到极限。事件相机具有高时间分辨率和低延迟等显著特性,使其成为高速物体姿态跟踪的理想视觉传感器。尽管如此,目前仅有少数工作涉及事件相机的6D姿态跟踪。在这项工作中,我们利用高时间分辨率的优势,提出了一种结合传播步骤与姿态校正策略的方法。具体而言,我们使用从事件光流中获得的6D物体速度进行姿态传播,然后利用基于模板的局部姿态校正模块进行姿态校正。我们的无学习方法与最先进的算法性能相当,并且在某些情况下对快速移动物体的表现更优。结果表明,在深度网络方法受限于低更新速率的高动态场景中,事件相机具有应用潜力。

英文摘要

Object pose tracking is a fundamental and essential task for robotics to perform tasks in the home and industrial settings. The most commonly used sensors to do so are RGB-D cameras, which can hit limitations in highly dynamic environments due to motion blur and frame-rate constraints. Event cameras have remarkable features such as high temporal resolution and low latency, which make them a potentially ideal vision sensors for object pose tracking at high speed. Even so, there are still only few works on 6D pose tracking with event cameras. In this work, we take advantage of the high temporal resolution and propose a method that uses both a propagation step fused with a pose correction strategy. Specifically, we use 6D object velocity obtained from event-based optical flow for pose propagation, after which, a template-based local pose correction module is utilized for pose correction. Our learning-free method has comparable performance to the state-of-the-art algorithms, and in some cases out performs them for fast-moving objects. The results indicate the potential for using event cameras in highly-dynamic scenarios where the use of deep network approaches are limited by low update rates.

2601.21309 2026-05-28 cs.LG

Transferable Graph Condensation from the Causal Perspective

从因果视角的可迁移图压缩

Huaming Du, Yijie Huang, Su Yao, Yiying Wang, Yueyang Zhou, Jingwen Yang, Jinshi Zhang, Han Ji, Yu Zhao, Guisong Liu, Hegui Zhang, Carl Yang, Gang Kou

AI总结 提出基于因果不变性的可迁移图压缩方法TGCC,通过因果干预提取域不变特征并注入压缩图,实现跨任务和跨域场景下的有效压缩。

详情
AI中文摘要

图数据集的规模日益增大,显著提升了图表示学习方法的性能,但也带来了巨大的训练挑战。图数据集压缩技术应运而生,旨在将大规模数据集压缩为更小但信息丰富的数据集,同时保持相似的测试性能。然而,这些方法严格要求下游应用与原始数据集和任务匹配,在跨任务和跨域场景中往往失效。为解决这些挑战,我们提出了一种新颖的基于因果不变性的可迁移图数据集压缩方法,命名为TGCC,提供有效且可迁移的压缩数据集。具体而言,为保留域不变知识,我们首先通过因果干预从图的空间域提取域因果不变特征。然后,为充分捕捉原始图的结构和特征信息,我们执行增强压缩操作。最后,通过谱域增强对比学习,将因果不变特征注入压缩图,确保压缩图保留原始图的因果信息。在五个公开数据集和我们新构建的FinReport数据集上的实验结果表明,TGCC在跨任务和跨域复杂场景下相比现有方法提升高达13.41%,并在6个数据集中的5个上,在单一数据集和任务场景下达到了最先进性能。

英文摘要

The increasing scale of graph datasets has significantly improved the performance of graph representation learning methods, but it has also introduced substantial training challenges. Graph dataset condensation techniques have emerged to compress large datasets into smaller yet information-rich datasets, while maintaining similar test performance. However, these methods strictly require downstream applications to match the original dataset and task, which often fails in cross-task and cross-domain scenarios. To address these challenges, we propose a novel causal-invariance-based and transferable graph dataset condensation method, named TGCC, providing effective and transferable condensed datasets. Specifically, to preserve domain-invariant knowledge, we first extract domain causal-invariant features from the spatial domain of the graph using causal interventions. Then, to fully capture the structural and feature information of the original graph, we perform enhanced condensation operations. Finally, through spectral-domain enhanced contrastive learning, we inject the causal-invariant features into the condensed graph, ensuring that the compressed graph retains the causal information of the original graph. Experimental results on five public datasets and our novel FinReport dataset demonstrate that TGCC achieves up to a 13.41% improvement in cross-task and cross-domain complex scenarios compared to existing methods, and achieves state-of-the-art performance on 5 out of 6 datasets in the single dataset and task scenario.

2603.05642 2026-05-28 cs.RO cs.AI

Relational Semantic Reasoning on 3D Scene Graphs for Open World Interactive Object Search

基于3D场景图的开放世界交互式物体搜索的关系语义推理

Imen Mahdi, Matteo Cassinelli, Fabien Despinoy, Tim Welschehold, Abhinav Valada

AI总结 提出SCOUT方法,通过从LLM蒸馏的关系探索启发式直接搜索3D场景图,实现高效开放世界交互式物体搜索,性能匹配LLM且计算高效。

详情
AI中文摘要

家庭环境中的开放世界交互式物体搜索需要理解物体与其周围环境之间的语义关系,以有效引导探索。先前的方法要么依赖视觉-语言嵌入相似性,这不能可靠地捕获任务相关的关系语义,要么依赖大型语言模型(LLM),这对于实时部署来说太慢且成本高昂。我们提出SCOUT:基于场景图探索的开放世界交互式物体搜索学习效用,这是一种新颖的方法,通过使用关系探索启发式(如房间-物体包含和物体-物体共现)为房间、前沿和物体分配效用分数,直接搜索3D场景图。为了在不牺牲开放词汇泛化能力的情况下使其实用,我们提出了一种离线程序化蒸馏框架,将LLM中的结构化关系知识提取到轻量级模型中,用于机器人上的推理。此外,我们提出了SymSearch,一个用于评估交互式物体搜索任务中语义推理的可扩展符号基准。在符号和模拟环境中的广泛评估表明,SCOUT优于基于嵌入相似性的方法,并在保持计算效率的同时达到LLM级别的性能。最后,真实世界实验证明了向物理环境的有效迁移,在现实感知和导航约束下实现了开放世界交互式物体搜索。

英文摘要

Open-world interactive object search in household environments requires understanding semantic relationships between objects and their surrounding context to guide exploration efficiently. Prior methods either rely on vision-language embeddings similarity, which does not reliably capture task-relevant relational semantics, or large language models (LLMs), which are too slow and costly for real-time deployment. We introduce SCOUT: Scene Graph-Based Exploration with Learned Utility for Open-World Interactive Object Search, a novel method that searches directly over 3D scene graphs by assigning utility scores to rooms, frontiers, and objects using relational exploration heuristics such as room-object containment and object-object co-occurrence. To make this practical without sacrificing open-vocabulary generalization, we propose an offline procedural distillation framework that extracts structured relational knowledge from LLMs into lightweight models for on-robot inference. Furthermore, we present SymSearch, a scalable symbolic benchmark for evaluating semantic reasoning in interactive object search tasks. Extensive evaluations across symbolic and simulation environments show that SCOUT outperforms embedding similarity-based methods and matches LLM-level performance while remaining computationally efficient. Finally, real-world experiments demonstrate effective transfer to physical environments, enabling open-world interactive object search under realistic sensing and navigation constraints.

2603.05425 2026-05-28 cs.CV cs.AI

RelaxFlow: Text-Driven Amodal 3D Generation

RelaxFlow: 文本驱动的非模态3D生成

Jiayin Zhu, Guoji Fu, Xiaolu Liu, Qiyuan He, Yicong Li, Angela Yao

AI总结 针对遮挡下图像到3D生成的语义歧义问题,提出无训练的双分支框架RelaxFlow,通过多先验共识模块和松弛机制解耦控制粒度,实现文本提示引导下对未观察区域的补全,同时严格保留输入观测。

Comments Accepted as a spotlight presentation at ICML 2026. Code: https://github.com/viridityzhu/RelaxFlow

详情
AI中文摘要

图像到3D生成在遮挡下面临固有的语义歧义,仅凭部分观测通常不足以确定物体类别。在这项工作中,我们形式化了文本驱动的非模态3D生成,其中文本提示引导对未观察区域的补全,同时严格保留输入观测。关键的是,我们识别出这些目标需要不同的控制粒度:对观测的刚性控制与对提示的松弛结构控制。为此,我们提出RelaxFlow,一个无训练的双分支框架,通过多先验共识模块和松弛机制解耦控制粒度。理论上,我们证明我们的松弛等价于在生成向量场上应用低通滤波器,抑制高频实例细节以隔离适应观测的几何结构。为便于评估,我们引入了两个诊断基准:ExtremeOcc-3D和AmbiSem-3D。大量实验表明,RelaxFlow成功引导未观察区域的生成以匹配提示意图,同时不损害视觉保真度。

英文摘要

Image-to-3D generation faces inherent semantic ambiguity under occlusion, where partial observation alone is often insufficient to determine object category. In this work, we formalize text-driven amodal 3D generation, where text prompts steer the completion of unseen regions while strictly preserving input observation. Crucially, we identify that these objectives demand distinct control granularities: rigid control for the observation versus relaxed structural control for the prompt. To this end, we propose RelaxFlow, a training-free dual-branch framework that decouples control granularity via a Multi-Prior Consensus Module and a Relaxation Mechanism. Theoretically, we prove that our relaxation is equivalent to applying a low-pass filter on the generative vector field, which suppresses high-frequency instance details to isolate geometric structure that accommodates the observation. To facilitate evaluation, we introduce two diagnostic benchmarks, ExtremeOcc-3D and AmbiSem-3D. Extensive experiments demonstrate that RelaxFlow successfully steers the generation of unseen regions to match the prompt intent without compromising visual fidelity.

2603.04631 2026-05-28 cs.AI

Towards automated data analysis: A guided framework for LLM-based risk estimation

迈向自动化数据分析:基于LLM的风险评估引导框架

Panteleimon Rodis

AI总结 提出一个在人类指导和监督下利用大语言模型进行数据集风险评估的框架,通过识别模式、生成聚类代码并解释结果,为自动化风险分析奠定基础。

详情
AI中文摘要

大语言模型(LLMs)正越来越多地集成到关键决策流程中,这一趋势引发了对稳健且自动化数据分析的需求。当前数据集风险分析方法局限于手动审计,涉及耗时且复杂的任务,而基于人工智能(AI)的完全自动化分析则存在幻觉和AI对齐问题。为此,本文提出一个在人类指导和监督下集成生成式AI的数据集风险评估框架,旨在为未来的自动化风险分析范式奠定基础。我们的方法利用LLMs识别数据库模式中的语义和结构属性,随后提出聚类技术,为其生成代码,并最终解释产生的结果。人类监督者指导模型进行所需分析,确保过程完整性和与任务目标的一致性。通过概念验证,展示了该框架在风险评估任务中产生有意义结果的可行性。

英文摘要

Large Language Models (LLMs) are increasingly integrated into critical decision-making pipelines, a trend that raises the demand for robust and automated data analysis. Current approaches to dataset risk analysis are limited to manual auditing methods which involve time-consuming and complex tasks, whereas fully automated analysis based on Artificial Intelligence (AI) suffers from hallucinations and issues stemming from AI alignment. To this end, this work proposes a framework for dataset risk estimation that integrates Generative AI under human guidance and supervision, aiming to set the foundations for a future automated risk analysis paradigm. Our approach utilizes LLMs to identify semantic and structural properties in database schemata, subsequently propose clustering techniques, generate the code for them and finally interpret the produced results. The human supervisor guides the model on the desired analysis and ensures process integrity and alignment with the task's objectives. A proof of concept is presented to demonstrate the feasibility of the framework's utility in producing meaningful results in risk assessment tasks.

2602.22769 2026-05-28 cs.AI cs.LG

AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications

AMA-Bench:评估智能体应用的长时记忆

Yujie Zhao, Boqin Yuan, Junbo Huang, Haocheng Yuan, Zhongming Yu, Haozhou Xu, Lanxiang Hu, Abhilash Shankarampeta, Zimeng Huang, Wentao Ni, Yuandong Tian, Jishen Zhao

AI总结 提出AMA-Bench基准,通过真实与合成轨迹评估LLM智能体的长时记忆,并基于因果图与工具增强检索提出AMA-Agent系统,在基准上提升11.16%准确率。

详情
AI中文摘要

大型语言模型(LLM)越来越多地被用作复杂、长时应用中的自主智能体,其中有效的记忆对于持续性能至关重要。然而,现有的记忆基准主要围绕对话,而真实的智能体记忆由连续的智能体-环境交互轨迹组成,包括状态、动作、观察和工具输出。为填补这一空白,我们引入了**AMA-Bench**(任意长度的智能体记忆),一个在现实智能体设置中评估长时记忆的基准。AMA-Bench结合了来自代表性应用的真实智能体轨迹与专家策划的问答,以及可扩展到任意视野的合成轨迹与基于规则的问答。我们的研究表明,现有记忆系统表现不佳,因为它们未能捕获因果和客观信息,并严重依赖有损的基于相似性的检索。我们进一步提出了**AMA-Agent**,一个基于因果图构建和工具增强检索的记忆系统。AMA-Agent在AMA-Bench上达到**57.22%**的准确率,超过最强基线**11.16%**。资源可在[https://ama-bench.github.io/](https://ama-bench.github.io/)获取。

英文摘要

Large Language Models (LLMs) are increasingly used as autonomous agents in complex, long-horizon applications, where effective memory is critical for sustained performance. Yet existing memory benchmarks are largely dialogue-centric, while real agent memory consists of continuous agent-environment interaction trajectories composed of states, actions, observations, and tool outputs. To address this gap, we introduce **AMA-Bench** (**A**gent **M**emory with **A**ny length), a benchmark for evaluating long-horizon memory in realistic agentic settings. AMA-Bench combines real-world agent trajectories from representative applications with expert-curated QA, as well as synthetic trajectories that scale to arbitrary horizons with rule-based QA. Our study shows that existing memory systems underperform because they fail to capture causal and objective information and rely heavily on lossy similarity-based retrieval. We further propose **AMA-Agent**, a memory system based on causality-graph construction and tool-augmented retrieval. AMA-Agent achieves **57.22%** accuracy on AMA-Bench, outperforming the strongest baseline by **11.16%**. Resources are available at: [https://ama-bench.github.io/](https://ama-bench.github.io/).

2603.01766 2026-05-28 cs.RO

Neural Implicit Action Fields: From Discrete Waypoints to Continuous Functions for Vision-Language-Action Models

神经隐式动作场:从离散路点到连续函数的视觉-语言-动作模型

Haoyun Liu, Jianzhuang Zhao, Xinyuan Chang, Tianle Shi, Chuanzhang Meng, Jiayuan Tan, Feng Xiong, Tong Lin, Dongjie Huo, Mu Xu, SongLin Dong, Zhiheng Ma, Yihong Gong, Sheng Zhong

AI总结 针对视觉-语言-动作模型预测离散动作路点与物理运动连续性不匹配的问题,提出神经隐式动作场(NIAF),通过将动作表示从离散路点重构为连续函数,实现任意时间分辨率的连续动作流形合成,支持解析求导和显式速度监督,提升控制平滑性和物理合理性。

Comments Accepted at ICML 2026

详情
AI中文摘要

尽管视觉-语言-动作(VLA)模型取得了快速进展,但将动作块预测为离散路点的普遍做法在结构上与物理运动的内在连续性不一致。这种离散化自然源于固定频率的机器人数据收集和大语言模型的逐词预测范式,但将动作绑定到固定的采样率,不能自然支持解析一致的高阶导数,并引入量化伪影,阻碍精确、柔顺的交互。我们提出神经隐式动作场(NIAF),将块级动作表示从离散路点重构为连续动作函数。通过使用视觉-语言模型作为可学习运动先验上的分层频谱调制器,NIAF 合成具有任意时间分辨率的连续时间动作流形。这种公式支持解析微分,允许显式监督速度和正则化高阶导数信号,以促进数学一致性、物理合理性和控制平滑性。我们的方法在 CALVIN 和 LIBERO 上跨多种骨干网络取得了强劲结果。真实世界实验进一步证实 NIAF 支持稳定的阻抗控制,桥接了策略侧动作生成和执行侧平滑控制。

英文摘要

Despite the rapid progress of vision-language-action (VLA) models, the prevailing practice of predicting action chunks as discrete waypoints remains structurally misaligned with the intrinsic continuity of physical motion. This discretization arises naturally from fixed-rate robot data collection and the token-by-token prediction paradigm of large language models, but ties actions to rigid sampling rates, does not naturally support analytically consistent higher-order derivatives, and introduces quantization artifacts that hinder precise, compliant interaction. We propose Neural Implicit Action Fields (NIAF), which reformulates chunk-level action representation from discrete waypoints to continuous action functions. Using a vision-language model as a hierarchical spectral modulator over a learnable motion prior, NIAF synthesizes continuous-time action manifolds with arbitrary temporal resolution. This formulation enables analytical differentiation, allowing explicit supervision of velocity and regularization of higher-order derivative signals to promote mathematical consistency, physical plausibility, and control smoothness. Our approach achieves strong results on CALVIN and LIBERO across diverse backbones. Real-world experiments further confirm that NIAF supports stable impedance control, bridging policy-side action generation and execution-side smooth control.

2603.00349 2026-05-28 cs.AI cs.MA

COOP$^2$: Defining, Observing, and Repairing Cooperation in LLM Multi-Agent Systems

COOP$^2$: 定义、观察和修复LLM多智能体系统中的合作

Hanqing Yang, Narjes Nourzad, Shiyu Chen, Marie Siew, Jingdi Chen, Carlee Joe-Wong

AI总结 提出COOP$^2$框架,通过将高层合作动态与任务进度关联,定义可验证的合作任务,并开发COOP$^2$-Repair方法预测约束失败并引导修复,提升LLM多智能体系统的任务成功率和约束满足度。

详情
AI中文摘要

许多复杂任务需要超出单个智能体能力的持续努力、多样化能力或协调行动。然而,简单地增加更多智能体并不能保证更好的性能,因为有效的合作取决于智能体之间以及智能体与任务结构之间的交互方式,以满足随时间演变的约束。对于基于LLM的多智能体系统(LLM-MAS),这一挑战被放大:计划、消息和修订以自然语言发生,而任务进展依赖于具体环境中的行动。当前的评估大多将合作视为最终任务成功的隐含因素,使得合作以及多智能体交互对任务动态的影响难以研究。我们引入了COOP$^2$,一个评估框架,将LLM-MAS中的高层智能体合作动态与环境中的任务进展联系起来。COOP$^2$定义了具有可验证合作需求的合作任务,使我们能够分析合作如何随时间相对于任务进展展开,以及合作在何处和为何破裂。基于此框架,我们开发了COOP$^2$-Repair,它从群体计划中预测约束失败,并打开有针对性的修复通道以进行引导修订。在两个环境和三种通信结构下,COOP$^2$-Repair提高了任务成功率和约束满足度,同时暴露了修复所需的额外决策开销和通信负载。项目网页见:https://happyeureka.github.io/coop2。

英文摘要

Many complex tasks require extended effort, diverse capabilities, or coordinated actions beyond what a single agent can provide. However, simply adding more agents does not guarantee better performance, as effective cooperation depends on how agents interact with each other and with task structure to satisfy evolving constraints over time. This challenge is amplified for LLM-based multi-agent systems (LLM-MAS): plans, messages, and revisions occur in natural language, whereas task progress depends on grounded environment actions. Current evaluations mostly treat cooperation as an implicit ingredient of final task success, leaving both cooperation and the effect of multi-agent interaction on task dynamics difficult to study. We introduce COOP$^2$, an evaluation framework that grounds high-level agent cooperation dynamics in LLM-MAS within task progress in the environment. COOP$^2$ then defines cooperative tasks with verifiable cooperative requirements, allowing us to analyze how cooperation unfolds over time with respect to task progress, as well as where and why cooperation breaks down. Building on this framework, we develop COOP$^2$-Repair, which predicts constraint failures from group plans and opens targeted repair channels for guided revisions. Across two environments and three communication structures, COOP$^2$-Repair improves task success and constraint satisfaction while exposing the additional decision overhead and communication load required for repair. The project web page can be found at: https://happyeureka.github.io/coop2.

2603.00309 2026-05-28 cs.AI cs.MA

DIG to Heal: Scaling General-purpose Agent Collaboration via Explainable Dynamic Decision Paths

DIG to Heal: 通过可解释的动态决策路径扩展通用智能体协作

Hanqing Yang, Hyungwoo Lee, Yuhang Yao, Zhiwei Liu, Kay Liu, Jingdi Chen, Carlee Joe-Wong

AI总结 提出动态交互图(DIG)框架,将通用LLM智能体的涌现协作建模为时变因果网络,首次实现协作过程的可观察、可解释与实时纠错。

详情
AI中文摘要

日益流行的智能体AI范式有望利用多个通用大语言模型(LLM)智能体的能力协作完成复杂任务。尽管许多智能体AI系统通过预定义工作流或固定智能体角色来降低复杂性,但理想情况是支持真正自主的智能体,能够在多个交互智能体之间实现涌现协作。然而在实践中,这种非结构化交互常常导致冗余工作和级联故障,难以解释或纠正。在这项工作中,我们研究了由通用LLM智能体组成的多智能体系统,这些智能体通过涌现协作解决问题,而不依赖预定义角色、控制流或通信约束。我们引入了动态交互图(DIG),它将涌现协作捕获为智能体激活和交互的时变因果网络。DIG首次使涌现协作变得可观察和可解释,能够直接从智能体的协作路径中实时识别、解释和纠正协作引发的错误模式。因此,DIG填补了理解通用LLM智能体如何在真正智能体化的多智能体系统中共同解决问题的关键空白。项目网页见:https://happyeureka.github.io/dig。

英文摘要

The increasingly popular agentic AI paradigm promises to harness the power of multiple, general-purpose large language model (LLM) agents to collaboratively complete complex tasks. While many agentic AI systems reduce complexity through predefined workflows or fixed agent roles, the ideal is to support truly autonomous agents capable of emergent collaboration across many interacting agents. Yet in practice, such unstructured interactions often lead to redundant work and cascading failures that are difficult to interpret or correct. In this work, we study multi-agent systems composed of general-purpose LLM agents that solve problems through emergent collaboration, without relying on predefined roles, control flows, or communication constraints. We introduce the Dynamic Interaction Graph (DIG), which captures emergent collaboration as a time-evolving causal network of agent activations and interactions. DIG makes emergent collaboration observable and explainable for the first time, enabling real-time identification, explanation, and correction of collaboration-induced error patterns directly from agents' collaboration paths. Thus, DIG fills a critical gap in understanding how general LLM agents solve problems together in truly agentic multi-agent systems. The project webpage can be found at: https://happyeureka.github.io/dig.

2502.17055 2026-05-28 cs.LG cs.AI

GradientStabilizer:Fix the Norm, Not the Gradient

GradientStabilizer:固定范数,而非梯度

Tianjin Huang, Zhangyang Wang, Haotian Hu, Zhenyu Zhang, Gaojie Jin, Xiang Li, Li Shen, Jiaxing Shang, Tianlong Chen, Ke Li, Lu Liu, Qingsong Wen, Shiwei Liu

AI总结 提出GradientStabilizer,一种轻量级梯度变换方法,通过统计稳定的梯度范数估计替换更新幅度,在不改变梯度方向的前提下抑制极端梯度尖峰,从而提升训练稳定性并减少发散。

Comments Accepted By ICML2026

详情
AI中文摘要

现代深度学习系统中的训练不稳定性通常由罕见但极端的梯度范数尖峰引发,这些尖峰可能导致参数更新过大、破坏优化器状态,并导致缓慢恢复或发散。广泛使用的保护措施如梯度裁剪可以缓解这些故障,但需要调整阈值且不加区分地截断大更新。我们提出GradientStabilizer,一种轻量级、即插即用的梯度变换方法,它在保留瞬时梯度方向的同时,用从运行梯度范数统计中导出的统计稳定估计替换更新幅度。我们证明了在尖峰步骤上,得到的稳定幅度一致有界,与尖峰大小无关,并展示了这种有界性如何控制自适应方法中优化器状态的演化。在LLM预训练(FP16)、量化感知预训练(FP4)、ImageNet分类、强化学习和时间序列预测中,GradientStabilizer一致地提高了训练稳定性,扩大了稳定学习率区域,并相对于基于裁剪的基线减少了发散,甚至显著降低了Adam对权重衰减强度的敏感性。代码即将发布。

英文摘要

Training instability in modern deep learning systems is frequently triggered by rare but extreme gradient-norm spikes, which can induce oversized parameter updates, corrupt optimizer state, and lead to slow recovery or divergence. Widely used safeguards such as gradient clipping mitigate these failures but require threshold tuning and indiscriminately truncate large updates. We propose GradientStabilizer, a lightweight, drop-in gradient transform that preserves the instantaneous gradient direction while replacing the update magnitude with a statistically stabilized estimate derived from running gradient-norm statistics. We prove that the resulting stabilized magnitude is uniformly bounded on spike steps, independent of the spike size, and show how this boundedness controls optimizer state evolution in adaptive methods. Across LLM pre-training (FP16), quantization-aware pre-training (FP4), ImageNet classification, reinforcement learning, and time-series forecasting, GradientStabilizer consistently improves training stability, widens stable learning-rate regions, and reduces divergence relative to clipping-based baselines, even substantially reducing Adam's sensitivity to weight-decay strength. Code will be released soon.

2602.22787 2026-05-28 cs.CL cs.AI

Probing for Knowledge Attribution in Large Language Models

探测大型语言模型中的知识归因

Ivo Brink, Alexander Boer, Dennis Ulmer

AI总结 本文通过线性探针从隐藏表示中分类大型语言模型输出的主导知识来源(记忆或上下文),并引入自监督流水线AttriWiki生成训练数据,在多个模型和数据集上达到高F1分数。

详情
AI中文摘要

大型语言模型(LLM)的幻觉,即流畅但事实不正确的生成,分为两类:忠实性违反,即模型误用提供的上下文;以及事实性违反,即答案反映内部知识中的错误。适当的缓解取决于知道哪个来源驱动每个答案。我们研究贡献性归因,即对每个输出背后的主导知识来源进行分类,并表明在隐藏表示上训练的简单线性探针可以可靠地识别它。我们引入了AttriWiki,一个自监督流水线,通过提示模型从记忆中回忆被隐藏的实体或从上下文中读取它们,而不依赖知识冲突,自动生成标记的训练数据。在AttriWiki上训练的探针在Llama-3.1-8B、Mistral-7B和Qwen-7B上达到高达0.96的Macro-$F_1$,迁移到SQuAD和WebQuestions时达到0.94-0.99的Macro-$F_1$,并零样本泛化到Tighidet等人(2024)的基准,在冲突设置上无需重新训练即优于他们的探针。此外,归因不匹配会使错误率提高高达70%,尽管正确的归因并不能保证正确的答案,这表明需要更广泛的检测框架。

英文摘要

Large language model (LLM) hallucinations, meaning fluent but factually incorrect generations, fall into two types: faithfulness violations, where the model misuses provided context, and factuality violations, where answers reflect errors in internal knowledge. Proper mitigation depends on knowing which source drives each answer. We study contributive attribution, i.e. the classification of the dominant knowledge source behind each output, and show that a simple linear probe trained on hidden representations can reliably identify it. We introduce AttriWiki, a self-supervised pipeline that automatically generates labelled training data by prompting models to recall withheld entities from memory or read them from context without relying on knowledge conflicts. Probes trained on AttriWiki achieve up to 0.96 Macro-$F_1$ on Llama-3.1-8B, Mistral-7B, and Qwen-7B, transfer to SQuAD and WebQuestions with 0.94-0.99 Macro-$F_1$, and generalise zero-shot to Tighidet et al. (2024)'s benchmark, outperforming their probe on conflicting settings without retraining. Furthermore, attribution mismatches raise error rates by up to 70%, though correct attribution does not guarantee correct answers, pointing to the need for broader detection frameworks.

2602.22096 2026-05-28 cs.CV

WeatherCity: Urban Scene Reconstruction with Controllable Multi-Weather Transformation

WeatherCity: 可控多天气变换的城市场景重建

Wenhua Wu, Huai Guan, Zhe Liu, Hesheng Wang

AI总结 提出WeatherCity框架,利用文本引导的图像编辑、天气高斯表示和物理驱动模型,实现高保真、时间一致的4D城市场景重建与多天气编辑。

详情
AI中文摘要

可编辑的高保真4D场景对于自动驾驶至关重要,因为它们可以应用于端到端训练和闭环仿真。然而,现有的重建方法主要局限于复制观察到的场景,缺乏多样化的天气模拟能力。而图像级别的天气编辑方法往往引入场景伪影,并且对天气效果的可控性较差。为了解决这些限制,我们提出了 extbf{WeatherCity},一个用于4D城市场景重建和天气编辑的新框架。具体来说,我们利用文本引导的图像编辑模型来实现图像天气背景的灵活编辑。为了应对多天气建模的挑战,我们引入了一种基于共享场景特征和专用天气解码器的新型天气高斯表示。这种表示进一步通过内容一致性优化得到增强,确保不同天气条件下的连贯建模。此外,我们设计了一个物理驱动模型,通过粒子和运动模式模拟动态天气效果。在多个数据集和各种场景上的大量实验表明,WeatherCity在4D重建和天气编辑中实现了灵活的可控性、高保真度和时间一致性。我们的框架不仅能够对天气条件(例如小雨和大雪)进行细粒度控制,还支持场景内的物体级操作。代码已发布在https://github.com/IRMVLab/WeatherCity。

英文摘要

Editable high-fidelity 4D scenes are crucial for autonomous driving, as they can be applied to end-to-end training and closed-loop simulation. However, existing reconstruction methods are primarily limited to replicating observed scenes and lack the capability for diverse weather simulation. While image-level weather editing methods tend to introduce scene artifacts and offer poor controllability over the weather effects. To address these limitations, we propose \textbf{WeatherCity}, a novel framework for 4D urban scene reconstruction and weather editing. Specifically, we leverage a text-guided image editing model to achieve flexible editing of image weather backgrounds. To tackle the challenge of multi-weather modeling, we introduce a novel weather Gaussian representation based on shared scene features and dedicated weather-specific decoders. This representation is further enhanced with a content consistency optimization, ensuring coherent modeling across different weather conditions. Additionally, we design a physics-driven model that simulates dynamic weather effects through particles and motion patterns. Extensive experiments on multiple datasets and various scenes demonstrate that WeatherCity achieves flexible controllability, high fidelity, and temporal consistency in 4D reconstruction and weather editing. Our framework not only enables fine-grained control over weather conditions (e.g., light rain and heavy snow) but also supports object-level manipulation within the scene. Codes are released at https://github.com/IRMVLab/WeatherCity.

2602.20020 2026-05-28 cs.CL

CodeGENCAT: Generative Computerized Adaptive Testing for Open-ended Coding Problems

CodeGENCAT:面向开放式编程问题的生成式计算机自适应测试

Wanyong Feng, Alexander Scarlatos, Ruochen Sun, Andrew Lan

AI总结 提出CodeGENCAT框架,通过生成式项目反应理论模型预测学生代码响应,并设计三种选题算法,在编程教育数据集上优于现有CAT基线。

Comments 23 pages, 2 figures

详情
AI中文摘要

现有的计算机自适应测试(CAT)框架通常根据预测的学生正确回答概率来选题。这种设计忽略了学生开放式回答中包含的信息,尤其是在编程教育等领域,代码结构和错误蕴含丰富的学生知识信息。在这项工作中,我们提出了 extbf{Code} extbf{GEN}erative extbf{CAT}( extbf{CodeGENCAT}),一种使用预测的学生代码响应来选题的生成式CAT框架。首先,我们开发了一个生成式项目反应理论(GIRT)模型,该模型根据估计的学生知识生成代码响应,通过监督微调和直接偏好优化进行知识-响应对齐训练。其次,我们引入了三种选题算法,分别衡量不确定性、编码风格多样性以及从预测的学生代码响应中提取的信息。在两个真实世界的编程教育数据集上的实验表明,CodeGENCAT优于所有CAT基线,在自适应测试早期阶段,AUC比最强基线提高了4.32%。

英文摘要

Existing Computerized Adaptive Testing (CAT) frameworks typically select questions based on the predicted likelihood that the student will answer correctly. This design ignores information contained in students' open-ended responses, especially in domains such as programming education, where code structures and bugs contain rich information on student knowledge. In this work, we propose \textbf{Code} \textbf{GEN}erative \textbf{CAT} (\textbf{CodeGENCAT}), a generative CAT framework that selects questions using predicted student code responses. First, we develop a Generative Item Response Theory (GIRT) model that generates code responses conditioned on estimated student knowledge, trained with supervised fine-tuning followed by direct preference optimization for knowledge-response alignment. Second, we introduce three question-selection algorithms that measure uncertainty, coding style diversity, and information from predicted student code responses. Experiments on two real-world programming education datasets show that CodeGENCAT outperforms all CAT baselines, achieving an AUC improvement of up to 4.32\% over the strongest baseline in the early stages of adaptive testing.

2602.18647 2026-05-28 cs.LG cs.AI cs.CV cs.IT math.IT

Noise Scheduling as Information-Guided Allocation in Diffusion Training

噪声调度作为扩散训练中的信息引导分配

Gabriel Raya, Bac Nguyen, Georgios Batzolis, Yuhta Takida, Dejan Stancevic, Naoki Murata, Chieh-Hsin Lai, Yuki Mitsufuji, Luca Ambrogioni

AI总结 提出InfoNoise,一种在线自适应噪声调度方法,通过估计条件熵率剖面动态调整训练噪声分布,以优化去噪任务中的信息增益,在图像、DNA和语言生成等任务中达到或超越基线,并节省高达3倍训练计算量。

详情
AI中文摘要

我们引入了InfoNoise,一种用于扩散训练的在线自适应噪声调度,它将优化努力重新分配到去噪最具信息量的噪声水平上。与损失加权一起,噪声调度在去噪问题之间诱导出有效的分配,而这种分配通常在知道信息性噪声水平之前就已固定。InfoNoise通过从训练期间的去噪损失中估计条件熵率剖面,使这种分配具有数据自适应性,无需辅助模型或离线搜索。通过I--MMSE,该剖面识别出噪声观测在何处能快速减少关于干净样本的不确定性,并指导训练噪声分布的适应。它只改变这个分布,保持目标、加权和参数化不变。在图像基准测试中,调度已被广泛调整,InfoNoise匹配或略微超过强基线,并且可以用更少的更新达到相同的质量。在表示、序列和模态转换(包括DNA和语言生成)上,InfoNoise优于固定和自适应基线,并且达到目标质量所需的训练计算量最多减少3倍。这些结果确立了条件熵率剖面作为噪声调度设计的数据依赖目标,并使在线自适应成为手动调度搜索的实用替代方案。

英文摘要

We introduce InfoNoise, an online adaptive noise schedule for diffusion training that reallocates optimization effort toward noise levels where denoising is most informative. Together with loss weighting, a noise schedule induces an effective allocation across denoising problems, often fixed before informative noise levels are known. InfoNoise makes this allocation data-adaptive by estimating a conditional-entropy-rate profile from denoising losses during training, without auxiliary models or offline search. Through I--MMSE, this profile identifies where noisy observations rapidly reduce uncertainty about the clean sample and guides adaptation of the training noise distribution. It changes only this distribution, keeping the objective, weighting, and parameterization fixed. On image benchmarks, where schedules have been extensively tuned, InfoNoise matches or slightly exceeds strong baselines and can reach the same quality with fewer updates. On representation, sequence, and modality shifts, including DNA and language generation, InfoNoise improves over fixed and adaptive baselines and reaches target quality with up to $3\times$ less training compute. These results establish the conditional-entropy-rate profile as the data-dependent target for noise schedule design and make online adaptation a practical alternative to manual schedule search.

2602.17003 2026-05-28 cs.CL cs.AI

Persona2Web: Benchmarking Personalized Web Agents for Contextual Reasoning with User History

Persona2Web: 基于用户历史进行上下文推理的个性化Web智能体基准

Serin Kim, Sangam Lee, Dongha Lee

AI总结 提出Persona2Web基准,通过澄清-个性化原则评估Web智能体在真实开放网络中利用用户历史解决模糊查询的个性化能力,并引入推理感知评估框架。

Comments Accepted to ICML 2026

详情
AI中文摘要

大型语言模型推动了Web智能体的发展,但当前的智能体缺乏个性化能力。由于用户很少明确说明其意图的每个细节,实用的Web智能体必须能够通过推断用户偏好和上下文来解释模糊查询。为应对这一挑战,我们提出了Persona2Web,这是首个在真实开放网络上评估个性化Web智能体的基准,基于澄清-个性化原则构建,要求智能体根据用户历史而非依赖显式指令来解决歧义。Persona2Web包括:(1) 在长时间跨度内隐含揭示偏好的用户历史,(2) 需要智能体推断隐含用户偏好的模糊查询,以及(3) 一个推理感知评估框架,能够对个性化进行细粒度评估。我们针对各种智能体架构、骨干模型、历史访问方案和不同模糊程度的查询进行了广泛实验,揭示了个性化Web智能体行为中的关键挑战。为便于复现,我们的代码和数据集公开在 https://serin-kimm.github.io/Persona2Web/。

英文摘要

Large language models have advanced web agents, yet current agents lack personalization capabilities. Since users rarely specify every detail of their intent, practical web agents must be able to interpret ambiguous queries by inferring user preferences and contexts. To address this challenge, we present Persona2Web, the first benchmark for evaluating personalized web agents on the real open web, built upon the clarify-to-personalize principle, which requires agents to resolve ambiguity based on user history rather than relying on explicit instructions. Persona2Web consists of: (1) user histories that reveal preferences implicitly over long time spans, (2) ambiguous queries that require agents to infer implicit user preferences, and (3) a reasoning-aware evaluation framework that enables fine-grained assessment of personalization. We conduct extensive experiments across various agent architectures, backbone models, history access schemes, and queries with varying ambiguity levels, revealing key challenges in personalized web agent behavior. For reproducibility, our codes and datasets are publicly available at https://serin-kimm.github.io/Persona2Web/.

2602.16872 2026-05-28 cs.CV

DODO: Discrete OCR Diffusion Models

DODO: 离散OCR扩散模型

Sean Man, Gilad Deutch, Roy Ganz, Roi Ronen, Shahar Tsiper, Shai Mazor, Niv Nayman

AI总结 针对OCR任务中自回归解码速度慢的问题,提出首个利用块离散扩散的VLM模型DODO,在保持高精度的同时实现高达5倍的推理加速。

详情
AI中文摘要

光学字符识别(OCR)是数字化信息的基础任务,是视觉数据与文本理解之间的关键桥梁。虽然现代视觉语言模型(VLM)在该领域取得了高精度,但它们主要依赖自回归解码,这需要为每个生成的token进行顺序前向传播,因此在处理长文档时计算成本高且速度慢。我们发现了一个克服这一瓶颈的关键机会:与开放式生成不同,OCR是一个高度确定性的任务,视觉输入严格决定了唯一的输出序列,理论上可以通过扩散模型实现高效的并行解码。然而,我们表明现有的掩码扩散模型未能利用这一潜力;它们引入了结构不稳定性,这在灵活任务(如字幕生成)中无害,但对于OCR的刚性精确匹配要求则是灾难性的。为了弥合这一差距,我们引入了DODO,这是首个利用块离散扩散并释放其OCR加速潜力的VLM。通过将生成分解为块,DODO减轻了全局扩散的同步误差。实验上,我们的方法在实现接近最先进精度的同时,与自回归基线相比,推理速度提高了5倍。

英文摘要

Optical Character Recognition (OCR) is a fundamental task for digitizing information, serving as a critical bridge between visual data and textual understanding. While modern Vision-Language Models (VLM) have achieved high accuracy in this domain, they predominantly rely on autoregressive decoding, which becomes computationally expensive and slow for long documents as it requires a sequential forward pass for every generated token. We identify a key opportunity to overcome this bottleneck: unlike open-ended generation, OCR is a highly deterministic task where the visual input strictly dictates a unique output sequence, theoretically enabling efficient, parallel decoding via diffusion models. However, we show that existing masked diffusion models fail to harness this potential; those introduce structural instabilities that are benign in flexible tasks, like captioning, but catastrophic for the rigid, exact-match requirements of OCR. To bridge this gap, we introduce DODO, the first VLM to utilize block discrete diffusion and unlock its speedup potential for OCR. By decomposing generation into blocks, DODO mitigates the synchronization errors of global diffusion. Empirically, our method achieves near state-of-the-art accuracy while enabling up to 5x faster inference compared to autoregressive baselines.

2602.16837 2026-05-28 cs.LG

A Structural Theory of Position Bias in Transformers

Transformer中位置偏差的结构理论

Hanna Herasimchyk, Robin Labryga, Tomislav Prusina, Sören Laue

AI总结 本文通过残差感知累积注意力展开,提出一种结构理论解释因果Transformer中位置偏差的起源,并揭示残差连接如何改变无限深度下的注意力动力学,从而解释Lost-in-the-Middle现象。

Comments Revised version with improved presentation

详情
AI中文摘要

Transformer模型系统性地偏好某些token位置,但这一位置偏差的架构起源仍知之甚少。这种偏差与“中间丢失”现象密切相关,即模型未能充分利用上下文中间位置的信息。我们证明,中间丢失类型的行为可能源于因果Transformer本身的架构。为此,我们基于残差感知累积注意力展开,发展了一种位置偏差的结构理论。在有限深度下,因果掩码和残差连接导致广泛的、通常是U形的影响分布。在无限深度下,我们的框架解决了先前仅注意力的坍缩理论与实际Transformer行为之间的差异:残差连接从根本上改变了累积注意力动力学。实验上,预测的影响分布与预训练语言模型中测量的输入token影响高度吻合。

英文摘要

Transformer models systematically favor certain token positions, yet the architectural origins of this position bias remain poorly understood. This bias is closely connected to the Lost-in-the-Middle phenomenon, where models underutilize information placed in the middle of the context. We show that Lost-in-the-Middle-type behavior can arise from the architecture of causal Transformers itself. To do so, we develop a structural theory of position bias based on residual-aware cumulative attention rollout. At finite depth, causal masking and residual connections induce broad, often U-shaped, influence profiles. At infinite depth, our framework resolves a discrepancy between prior attention-only collapse theory and practical Transformer behavior: residual connections fundamentally change cumulative attention dynamics. Empirically, the predicted profiles closely match measured input-token influence in pretrained language models.

2602.16284 2026-05-28 cs.LG

Fast KV Compaction via Attention Matching

通过注意力匹配实现快速KV压缩

Adam Zweiger, Xinghong Fu, Han Guo, Yoon Kim

AI总结 提出通过注意力匹配在潜在空间快速压缩键值缓存的方法,以保持注意力输出并实现高达50倍压缩且质量损失小。

详情
AI中文摘要

将语言模型扩展到长上下文通常受到键值(KV)缓存大小的瓶颈限制。在实际部署中,长上下文通常通过摘要化在令牌空间中进行压缩管理。然而,摘要化可能高度有损,严重损害下游性能。最近关于Cartridges的工作表明,可以在潜在空间中训练高度紧凑的KV缓存,以紧密匹配全上下文性能,但代价是缓慢且昂贵的端到端优化。本文描述了一种通过注意力匹配在潜在空间中快速上下文压缩的方法,该方法构建紧凑的键和值以再现注意力输出并在每个KV头级别保持注意力质量。我们表明,该公式自然分解为简单的子问题,其中一些子问题允许高效的闭式解。在此框架内,我们开发了一系列方法,显著推动了压缩时间与质量的帕累托前沿,在某些数据集上实现了高达50倍的压缩,且几乎没有质量损失。

英文摘要

Scaling language models to long contexts is often bottlenecked by the size of the key-value (KV) cache. In deployed settings, long contexts are typically managed through compaction in token space via summarization. However, summarization can be highly lossy, substantially harming downstream performance. Recent work on Cartridges has shown that it is possible to train highly compact KV caches in latent space that closely match full-context performance, but at the cost of slow and expensive end-to-end optimization. This work describes an approach for fast context compaction in latent space through Attention Matching, which constructs compact keys and values to reproduce attention outputs and preserve attention mass at a per-KV-head level. We show that this formulation naturally decomposes into simple subproblems, some of which admit efficient closed-form solutions. Within this framework, we develop a family of methods that significantly push the Pareto frontier of compaction time versus quality, achieving up to 50x compaction in seconds on some datasets with little quality loss.

2602.15894 2026-05-28 cs.CL cs.LG

Quality-constrained Entropy Maximization Policy Optimization for LLM Diversity

质量约束的熵最大化策略优化用于LLM多样性

Haihui Pan, Yuzhong Hong, Kaichen Zhang, Shaoke Lv, Junwei Bao, Hongfei Jiang, Yang Song

AI总结 提出QEMPO框架,通过理论推导的闭式解在保证输出质量的同时最大化熵以提升LLM多样性,实验证明其在不牺牲质量的情况下提升多样性。

详情
AI中文摘要

在许多大语言模型(LLM)对齐应用中,用户不仅期望高质量输出,还希望有显著的多样性。然而,现有方法通常面临这些目标之间的根本权衡:提高输出质量的方法往往会降低多样性,而增加多样性的方法往往以牺牲质量为代价。在这项工作中,我们提出了质量约束的熵最大化策略优化(QEMPO),这是一个新颖的框架,在明确保持输出质量的同时增强LLM输出的多样性。QEMPO建立在坚实的理论基础之上:我们推导出一个闭式解析解,该解在质量约束下可证明地最大化熵(多样性的原则性度量),并在定义的目标下保证最优性。利用这一解,QEMPO自然支持在线和离线训练设置。实验结果表明,QEMPO在不牺牲质量的情况下持续提高输出多样性,并且在许多情况下,与现有基线相比,在质量和多样性两个维度上都取得了提升,与我们的理论保证一致。

英文摘要

In many large language model (LLM) alignment applications, users expect not only high-quality outputs but also substantial diversity. However, existing methods often face a fundamental trade-off between these objectives: approaches that improve output quality tend to reduce diversity, while methods that increase diversity often do so at the expense of quality. In this work, we propose Quality-constrained Entropy Maximization Policy Optimization (QEMPO), a novel framework that enhances the diversity of LLM outputs while explicitly preserving output quality. QEMPO is grounded in a strong theoretical foundation: we derive a closed-form analytical solution that provably maximizes entropy-a principled measure of diversity-subject to a quality constraint, with guarantees on optimality under the defined objective. Leveraging this solution, QEMPO naturally supports both online and offline training settings. Empirical results demonstrate that QEMPO consistently improves output diversity without sacrificing quality, and in many cases yields gains in both dimensions compared to existing baselines, aligning with our theoretical guarantees.

2601.16800 2026-05-28 cs.CL

Large Language Models as Automatic Annotators and Annotation Adjudicators for Fine-Grained Opinion Analysis

大语言模型作为细粒度意见分析的自动标注者和标注裁决者

Gaurav Negi, MA Waskow, John McCrae, Omnia Zayed, Paul Buitelaar

AI总结 本文探索使用大语言模型作为自动标注者进行细粒度意见分析,提出声明式标注流水线和LLM裁决方法,实验表明LLM在跨度级别可靠但难以再现关系结构,更适合作为标注助手而非完全替代人类。

详情
AI中文摘要

文本的细粒度意见分析提供了对表达情感的详细理解,包括所涉及的实体。尽管这种详细程度很有价值,但在数据集中标注意见以训练模型需要大量人力投入和成本,尤其是在不同领域和实际应用中。为了解决领域特定标注数据集的短缺,我们探索了LLM作为自动标注者进行细粒度意见分析的可行性。我们使用声明式标注流水线,这种方法减少了在使用LLM识别文本中细粒度意见跨度时手动提示工程的可变性。我们还提出了一种专门的方法,让LLM裁决多个标签并产生最终标注。我们使用不同大小的模型在方面情感三元组提取(ASTE)和方面-类别-意见-情感(ACOS)分析任务上试用了该流水线。在这项工作中,我们试图开发完全自主的基于LLM的标注者,但我们的结果揭示了一个不均衡的画面,其特点是关键的性能分叉:LLM在跨度级别可靠,但难以忠实地再现连接这些跨度的关系结构。这表明LLM更适合作为高保真标注助手和数据增强工具,以扩展细粒度意见标注数据集,而不是完全取代人类标注者。

英文摘要

Fine-grained opinion analysis of text provides a detailed understanding of expressed sentiments, including the addressed entity. Although this level of detail is valuable, annotating opinions in datasets for model training requires considerable human effort and substantial cost, especially across diverse domains and real-world applications. To address this shortage of domain-specific labelled datasets, we explore the feasibility of LLMs as automatic annotators for fine-grained opinion analysis. We use a declarative annotation pipeline, an approach that reduces the variability of manual prompt engineering when using LLMs to identify fine-grained opinion spans in text. We also present a dedicated methodology for an LLM to adjudicate multiple labels and produce final annotations. We trial the pipeline with models of different sizes for the Aspect Sentiment Triplet Extraction (ASTE) and Aspect-Category-Opinion-Sentiment (ACOS) analysis tasks. In this work, we attempt to develop fully autonomous LLM-based annotators, but our results reveal an uneven picture characterised by a critical performance bifurcation: LLMs are reliable at the span level yet struggle to faithfully reproduce the relational structures that connect those spans. This suggests that LLMs are better positioned as high-fidelity annotation assistants and data augmentation tools to expand fine-grained opinion-annotated datasets, rather than replacing human annotators entirely.

2602.15515 2026-05-28 cs.LG cs.AI

The Obfuscation Atlas: Mapping Where Honesty Emerges in RLVR with Deception Probes

混淆图谱:使用欺骗探针映射RLVR中诚实出现的位置

Mohammad Taufeeque, Stefan Heimersheim, Adam Gleave, Chris Cundy

AI总结 本文通过构建一个自然产生奖励黑客行为的编码环境,研究在对抗白盒欺骗检测器训练时模型出现的混淆策略,并引入分类法分析诚实、混淆激活和混淆策略三种结果。

Comments Accepted at ICML 2026 (Oral presentation). 30 pages, 14 figures

详情
AI中文摘要

针对白盒欺骗检测器的训练已被提议作为使AI系统诚实的一种方法。然而,这种训练存在模型学习混淆其欺骗行为以逃避检测器的风险。先前的工作仅在人为设置中研究混淆,其中模型因有害输出而直接获得奖励。我们构建了一个真实的编码环境,其中通过硬编码测试用例的奖励黑客行为自然发生,并表明混淆在此环境中出现。我们引入了在对抗欺骗检测器训练时可能结果的分类法。模型要么保持诚实,要么通过两种可能的混淆策略变得欺骗。(i)混淆激活:模型输出欺骗性文本,同时修改其内部表示以不再触发检测器。(ii)混淆策略:模型输出逃避检测器的欺骗性文本,通常包括对奖励黑客行为的理由。实验上,混淆激活源于RL期间的表示漂移,无论是否有检测器惩罚。检测器惩罚仅激励混淆策略;我们从理论上表明,对于策略梯度方法,这是预期的。足够高的KL正则化和检测器惩罚可以产生诚实策略,从而确立白盒欺骗检测器作为易受奖励黑客行为任务的有效训练信号。

英文摘要

Training against white-box deception detectors has been proposed as a way to make AI systems honest. However, such training risks models learning to obfuscate their deception to evade the detector. Prior work has studied obfuscation only in artificial settings where models were directly rewarded for harmful output. We construct a realistic coding environment where reward hacking via hardcoding test cases naturally occurs, and show that obfuscation emerges in this setting. We introduce a taxonomy of possible outcomes when training against a deception detector. The model either remains honest, or becomes deceptive via two possible obfuscation strategies. (i) Obfuscated activations: the model outputs deceptive text while modifying its internal representations to no longer trigger the detector. (ii) Obfuscated policy: the model outputs deceptive text that evades the detector, typically by including a justification for the reward hack. Empirically, obfuscated activations arise from representation drift during RL, with or without a detector penalty. The detector penalty only incentivizes obfuscated policies; we theoretically show this is expected for policy gradient methods. Sufficiently high KL regularization and detector penalty can yield honest policies, establishing white-box deception detectors as viable training signals for tasks prone to reward hacking.

2602.13748 2026-05-28 cs.CL cs.CV

RMPL: Relation-aware Multi-task Progressive Learning with Stage-wise Training for Multimedia Event Extraction

RMPL:基于关系感知的多任务渐进学习与分阶段训练的多媒体事件抽取

Yongkang Jin, Jianwen Luo, Jingjing Wang, Jianmin Yao, Yu Hong

AI总结 提出RMPL框架,通过分阶段训练结合单模态事件抽取和多模态关系抽取的异构监督,在低资源条件下实现多媒体事件抽取,并在M2E2基准上取得一致改进。

Comments Accepted by ACM ICMR 2026

详情
AI中文摘要

多媒体事件抽取(MEE)旨在从包含文本和图像的文档中识别事件及其论元。它需要跨不同模态对事件语义进行 grounding。MEE 的进展受到缺乏标注训练数据的限制。M2E2 是唯一已建立的基准,但它仅提供评估用的标注。这使得直接监督训练不切实际。现有方法主要依赖于跨模态对齐或使用视觉-语言模型(VLM)进行推理时提示。这些方法没有显式学习结构化的事件表示,并且通常在多模态设置中产生较弱的论元 grounding。为解决这些限制,我们提出了 RMPL,一种用于低资源条件下 MEE 的基于关系感知的多任务渐进学习框架。RMPL 通过分阶段训练整合了来自单模态事件抽取和多模态关系抽取的异构监督。模型首先使用统一模式进行训练,以学习跨模态的共享事件中心表示。然后,使用混合文本和视觉数据对模型进行微调,以进行事件提及识别和论元角色抽取。在 M2E2 基准上使用多个 VLM 进行的实验表明,在不同模态设置下均取得了一致的改进。

英文摘要

Multimedia Event Extraction (MEE) aims to identify events and their arguments from documents that contain both text and images. It requires grounding event semantics across different modalities. Progress in MEE is limited by the lack of annotated training data. M2E2 is the only established benchmark, but it provides annotations only for evaluation. This makes direct supervised training impractical. Existing methods mainly rely on cross-modal alignment or inference-time prompting with Vision--Language Models (VLMs). These approaches do not explicitly learn structured event representations and often produce weak argument grounding in multimodal settings. To address these limitations, we propose RMPL, a Relation-aware Multi-task Progressive Learning framework for MEE under low-resource conditions. RMPL incorporates heterogeneous supervision from unimodal event extraction and multimedia relation extraction with stage-wise training. The model is first trained with a unified schema to learn shared event-centric representations across modalities. It is then fine-tuned for event mention identification and argument role extraction using mixed textual and visual data. Experiments on the M2E2 benchmark with multiple VLMs show consistent improvements across different modality settings.