arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2507.22554 2026-05-28 cs.LG

DeepC4: Deep Conditional Census-Constrained Clustering for Large-scale Multitask Spatial Disaggregation of Urban Morphology

DeepC4: 用于城市形态大规模多任务空间分解的深度条件普查约束聚类

Joshua Dimasaka, Christian Geiß, Emily So

AI总结提出DeepC4，一种结合局部普查统计作为聚类约束并联合学习卫星图像模式的多任务深度学习方法，用于城市形态的粗到细空间分解，在卢旺达数据上优于现有方法。

Comments Major Revised Preprint Submitted to ISPRS Journal of Photogrammetry and Remote Sensing (in review) | Keywords: urban morphology, building exposure, physical vulnerability, spatial disaggregation, deep clustering | Data: https://doi.org/10.5281/zenodo.13119552 | Code: https://github.com/riskaudit/DeepC4

详情

AI中文摘要

为了理解许多发展中经济体在可持续发展和减灾方面的全球进展，最近的两项重大举措——全球地震模型（GEM）基金会的统一非洲暴露数据集和通过地球观测常规（METEOR）项目进行暴露建模——利用来自各种卫星图像及其衍生物、建筑环境地理空间数据集以及地方级普查统计的信息，实施了经典的空间分解技术来生成城市形态的大规模映射。然而，与经过良好验证的普查统计数据的局部差异以及传播的模型不确定性仍然是这种粗到细粒度映射问题中的挑战，特别是受到弱和条件标签监督的约束。因此，我们提出了深度条件普查约束聚类（DeepC4），这是一种新颖的基于深度学习的空间分解方法，它将局部普查统计作为聚类级别的约束，同时在卫星图像模式的联合多任务学习中考虑多个条件标签关系。作为使用卢旺达城市形态的演示，DeepC4在屋顶、墙壁和高度预测上分别实现了0.63、0.78和0.45的宏F1分数，以及0.57、0.71和0.42的宏mIoU，估计的国家住宅和居住者数量与普查记录相比误差在1.13%和1.11%以内，优于GEM（2.03%和3.29%），并且在各省份中占据了比METEOR多32%-49%的500米网格像素。随着世界在2030年接近许多全球框架的结束，我们的工作提供了一种新的基于深度学习的映射技术，该技术明确编码了经过良好验证的普查和专家信念系统，以实现对现有大规模粗粒度派生信息的可解释和可解释审计。

英文摘要

To understand our global progress for sustainable development and disaster risk reduction in many developing economies, two recent major initiatives - the Uniform African Exposure Dataset of the Global Earthquake Model (GEM) Foundation and the Modelling Exposure through Earth Observation Routines (METEOR) Project - implemented classical spatial disaggregation techniques to generate large-scale mapping of urban morphology using the information from various satellite imagery and its derivatives, geospatial datasets of the built environment, and subnational census statistics. However, the local discrepancy with well-validated census statistics and the propagated model uncertainties remain a challenge in such coarse-to-fine-grained mapping problems, specifically constrained by weak and conditional label supervision. Therefore, we present Deep Conditional Census-Constrained Clustering (DeepC4), a novel deep learning-based spatial disaggregation approach that incorporates local census statistics as cluster-level constraints while considering multiple conditional label relationships in a joint multitask learning of the patterns of satellite imagery. As a demonstration using Rwandan urban morphology, DeepC4 achieves macro-F1 scores of 0.63, 0.78, and 0.45 and macro-mIoU of 0.57, 0.71, and 0.42 for roof, wall, and height prediction respectively, estimates national dwelling and occupant counts within 1.13% and 1.11% error compared to census records, outperforming GEM (2.03% and 3.29%), and occupies 32%-49% more 500-meter grid pixels than METEOR across provinces. As the world approaches the conclusion of many global frameworks in 2030, our work offers a new deep learning-based mapping technique that explicitly encodes well-validated census and experts' belief systems to achieve an explainable and interpretable auditing of existing coarse-grained derived information at large scales.

URL PDF HTML ☆

赞 0 踩 0

2602.13524 2026-05-28 cs.LG cs.AI

Singular Vectors of Attention Heads Align with Features

注意力头的奇异向量与特征对齐

Gabriel Franco, Carson Loughridge, Mark Crovella

AI总结本文通过理论分析和实验验证，解释了注意力头奇异向量与特征表示对齐的原因和条件，并提出了稀疏注意力分解作为对齐的可检验预测。

Comments To be published in ICML 2026

详情

AI中文摘要

识别语言模型中的特征表示是机械可解释性的核心任务。最近的一些研究观察到，在某些情况下，可以从注意力矩阵的奇异向量中推断出特征表示。然而，这一现象缺乏合理的解释。本文探讨了这个问题：为什么以及何时奇异向量与特征对齐？首先，我们证明在可以直接观察特征的模型中，奇异向量与特征稳健地对齐。然后，我们从理论上表明，这种对齐在多种条件下是预期的。最后，我们提出如何在特征表示不可直接观察的真实模型中操作性地识别对齐。我们将稀疏注意力分解确定为对齐的一个可检验预测，并展示证据表明它在真实模型中以与预测一致的方式出现。这些结果共同表明，奇异向量与特征的对齐可以作为语言模型中特征识别的合理且有理论依据的基础。

英文摘要

Identifying feature representations in language models is a central task in mechanistic interpretability. Several recent studies have made the observation that feature representations can be inferred in some cases from singular vectors of attention matrices. However, sound justification for this phenomenon is lacking. In this paper we address that question, asking: why and when do singular vectors align with features? First, we demonstrate that singular vectors robustly align with features in a model where features can be directly observed. We then show theoretically that such alignment is expected under a range of conditions. We close by asking how, operationally, alignment may be recognized in real models where feature representations are not directly observable. We identify sparse attention decomposition as a testable prediction of alignment, and show evidence that it emerges in real models in a manner consistent with predictions. Together these results suggest that alignment of singular vectors with features can be a sound and theoretically justified basis for feature identification in language models.

URL PDF HTML ☆

赞 0 踩 0

2602.13075 2026-05-28 cs.LG

Unified Multi-Domain Graph Pre-training for Homogeneous and Heterogeneous Graphs via Domain-Specific Expert Encoding

统一多域图预训练：通过域特定专家编码实现同质和异质图

Chundong Liang, Yongqi Huang, Dongxiao He, Peiyuan Li, Yawen Li, Di Jin, Weixiong Zhang

AI总结提出统一多域图预训练方法GPH²，通过域特定专家编码和任务导向专家融合策略，解决同质与异质图混合场景下的跨域分布偏移问题。

Comments 12 pages, 7 figures

详情

AI中文摘要

近年来，图预训练取得了显著成功，为下游任务提供了可迁移的表示。然而，大多数现有方法仅针对同质图或异质图设计，阻碍了跨不同类型图的统一建模。这种分离与现实应用相矛盾，因为混合的同质和异质图普遍存在，且上游预训练与下游部署之间的分布偏移很常见。在本文中，我们通过实验证明，同质和异质图预训练的平衡混合有利于下游任务，并提出了一种统一的跨同质和异质图的多域图预训练方法（GPH²）。为了解决缺乏同质和异质图统一编码器的问题，我们提出了一种统一的多视图图构建方法，无需显式的图类型特定设计即可同时编码两者。为了应对混合图带来的跨域分布差异增加，我们引入了域特定专家编码。每个专家在单个图上独立预训练以捕获域特定知识，从而保护预训练编码器免受跨域差异的不利影响。对于下游任务，我们进一步设计了一种任务导向的专家融合策略，根据专家的判别优势自适应地整合多个专家。在混合图上的大量实验表明，GPH²能够实现跨图类型和域的稳定迁移，显著优于现有的图预训练方法。

英文摘要

Graph pre-training has achieved remarkable success in recent years, delivering transferable representations for downstream adaptation. However, most existing methods are designed for either homogeneous or heterogeneous graphs, thereby hindering unified graph modeling across diverse graph types. This separation contradicts real-world applications, where mixed homogeneous and heterogeneous graphs are ubiquitous, and distribution shifts between upstream pre-training and downstream deployment are common. In this paper, we empirically demonstrate that a balanced mixture of homogeneous and heterogeneous graph pre-training benefits downstream tasks and propose a unified multi-domain \textbf{G}raph \textbf{P}re-training method across \textbf{H}omogeneous and \textbf{H}eterogeneous graphs ($\mathbf{GPH^{2}}$). To address the lack of a unified encoder for homogeneous and heterogeneous graphs, we propose a Unified Multi-View Graph Construction that simultaneously encodes both without explicit graph-type-specific designs. To cope with the increased cross-domain distribution discrepancies arising from mixed graphs, we introduce domain-specific expert encoding. Each expert is independently pre-trained on a single graph to capture domain-specific knowledge, thereby shielding the pre-training encoder from the adverse effects of cross-domain discrepancies. For downstream tasks, we further design a Task-oriented Expert Fusion Strategy that adaptively integrates multiple experts based on their discriminative strengths. Extensive experiments on mixed graphs demonstrate that $\text{GPH}^{2}$ enables stable transfer across graph types and domains, significantly outperforming existing graph pre-training methods.

URL PDF HTML ☆

赞 0 踩 0

2602.12843 2026-05-28 cs.CV

MMRad-22K: A Structured Multimodal Evidence Dataset for Chest X-ray Report Generation

MMRad-22K：用于胸部X光报告生成的结构化多模态证据数据集

Yichen Zhao, Zelin Peng, Fenghe Tang, Piao Yang, Yu Huang, Wei Shen

AI总结针对胸部X光报告生成中现有资源监督信号碎片化的问题，提出结构化多模态证据数据集MMRad-22K，并基于统一LVLM骨干进行适配，证明结构化多模态证据优于纯文本或边界框证据，在语言和临床指标上表现更优。

详情

AI中文摘要

胸部X光（CXR）报告遵循基于区域的临床工作流程，放射科医生检查解剖区域并将局部发现整合到最终报告中。然而，现有的CXR报告生成资源以碎片化形式提供这些监督信号。我们引入MMRad-22K，一个将区域文本观察、解剖定位坐标、局部图像证据和报告目标组织成结构化多模态证据单元的数据集，用于CXR报告生成。为了推动这一构想，我们首先比较了不同证据格式对报告生成的影响，发现结构化多模态证据通常比纯文本或基于边界框的证据更有用。然后，我们使用MMRad-22K适配统一的LVLM骨干，并证明多模态证据适配在语言和临床导向指标上均优于文本证据适配和端到端适配。在相同的评估协议下，适配模型也达到了与几个开源LVLM参考相当的性能水平。这些结果共同支持MMRad-22K作为实用的结构化多模态资源，用于训练和评估与临床阅读工作流程一致的CXR报告生成。

英文摘要

Chest X-ray (CXR) reporting follows a region-based clinical workflow in which radiologists inspect anatomical regions and integrate localized findings into a final report. However, existing resources for CXR report generation provide these supervision signals in fragmented forms. We introduce MMRad-22K, a dataset that organizes regional textual observations, anatomical grounding coordinates, localized image evidence, and report targets into structured multimodal evidence units for CXR report generation. To motivate this formulation, we first compare different evidence formats for report generation and find that structured multimodal evidence is generally more useful than text-only or bounding box-based evidence. We then adapt a unified LVLM backbone using MMRad-22K and show that adaptation with multimodal evidence outperforms both textual-evidence adaptation and end-to-end adaptation on language and clinically oriented metrics. Under the same evaluation protocol, the adapted model also reaches a performance level comparable to several open-source LVLM references. Together, these results support MMRad-22K as a practical structured multimodal resource for training and evaluating CXR report generation aligned with clinical reading workflows.

URL PDF HTML ☆

赞 0 踩 0

2602.12586 2026-05-28 cs.AI

Can I Have Your Order? Monte-Carlo Tree Search for Slot Filling Ordering in Diffusion Language Models

能给我你的订单吗？扩散语言模型中插槽填充顺序的蒙特卡洛树搜索

Joshua Ong Jun Leang, Yu Zhao, Mihaela Cătălina Stoian, Wenda Li, Shay B. Cohen, Eleonora Giunchiglia

AI总结针对掩码扩散模型（MDM）中计划-填充解码对插槽填充顺序敏感的问题，提出McDiffuSE框架，利用蒙特卡洛树搜索（MCTS）优化生成顺序，平均性能提升3.2%，在MBPP和MATH500上分别提升19.5%和4.9%。

Comments 8 pages, ICML2026

详情

AI中文摘要

虽然掩码扩散模型（MDM）中的计划-填充解码在数学和代码推理方面显示出潜力，但其性能对插槽填充顺序高度敏感，常常导致输出方差较大。我们引入了McDiffuSE框架，该框架将插槽选择形式化为决策问题，并通过蒙特卡洛树搜索（MCTS）优化填充顺序。McDiffuSE在提交前使用前瞻模拟评估部分完成情况，系统地探索生成顺序的组合空间。实验表明，与自回归基线相比平均提升3.2%，与基线计划-填充相比提升8.0%，在MBPP和MATH500上分别显著提升19.5%和4.9%。我们的分析揭示，虽然McDiffuSE主要遵循顺序生成，但引入非顺序生成对于最大化性能至关重要。我们观察到，需要更大的探索常数而非增加模拟次数，以克服模型置信度偏差并发现有效的顺序。这些发现确立了基于MCTS的规划作为提升MDM生成质量的有效方法。

英文摘要

While plan-and-infill decoding in Masked Diffusion Models (MDMs) shows promise for mathematical and code reasoning, performance remains highly sensitive to slot infilling order, often yielding substantial output variance. We introduce McDiffuSE, a framework that formulates slot selection as decision making and optimises infilling orders through Monte Carlo Tree Search (MCTS). McDiffuSE uses look-ahead simulations to evaluate partial completions before commitment, systematically exploring the combinatorial space of generation orders. Experiments show an average improvement of 3.2% over autoregressive baselines and 8.0% over baseline plan-and-infill, with notable gains of 19.5% on MBPP and 4.9% on MATH500. Our analysis reveals that while McDiffuSE predominantly follows sequential ordering, incorporating non-sequential generation is essential for maximising performance. We observe that larger exploration constants, rather than increased simulations, are necessary to overcome model confidence biases and discover effective orderings. These findings establish MCTS-based planning as an effective approach for enhancing generation quality in MDMs.

URL PDF HTML ☆

赞 0 踩 0

2602.12468 2026-05-28 cs.LG cs.FL

Continuous Diffusion Models Can Obey Formal Syntax

连续扩散模型可以遵守形式语法

Jinwoo Kim, Taylor Berg-Kirkpatrick, Loris D'Antoni

AI总结提出一种无需训练的引导方法，利用正则表达式约束连续扩散语言模型的生成过程，使其满足形式语法，并在JSON和自然语言基准上实现68-96%的约束满足率。

详情

AI中文摘要

扩散语言模型因其全局、非因果的生成过程而成为自回归模型的有前途的替代方案，但其连续潜在动态使得离散约束（例如，输出应为匹配给定模式的JSON文件）难以施加。我们提出了一种无需训练的引导方法，用于引导连续扩散语言模型满足用正则表达式表达的形式语法约束。我们的方法构建了一个解析分数，估计潜在状态解码为给定正则表达式接受的合法字符串的概率，并利用其梯度引导采样，无需训练辅助分类器。去噪过程以句法有效性为条件，针对基础模型进行优化。我们在PLAID扩散模型之上将我们的方法实现为Diffinity，并在180个正则表达式约束下对JSON和自然语言基准进行了评估。Diffinity实现了68-96%的约束满足率，同时相对于无约束采样仅产生很小的困惑度代价，在约束满足和输出质量方面均优于自回归约束解码。Diffinity已在github.com/large-loris-models/Diffinity开源。

英文摘要

Diffusion language models offer a promising alternative to autoregressive models due to their global, non-causal generation process, but their continuous latent dynamics make discrete constraints -- e.g., the output should be a JSON file that matches a given schema -- difficult to impose. We introduce a training-free guidance method for steering continuous diffusion language models to satisfy formal syntactic constraints expressed using regular expressions. Our approach constructs an analytic score estimating the probability that a latent state decodes to a valid string accepted by a given regular expression, and uses its gradient to guide sampling, without training auxiliary classifiers. The denoising process targets the base model conditioned on syntactic validity. We implement our method in Diffinity on top of the PLAID diffusion model and evaluate it on 180 regular-expression constraints over JSON and natural-language benchmarks. Diffinity achieves 68-96\% constraint satisfaction while incurring only a small perplexity cost relative to unconstrained sampling, outperforming autoregressive constrained decoding in both constraint satisfaction and output quality. Diffinity is open-sourced at github.com/large-loris-models/Diffinity.

URL PDF HTML ☆

赞 0 踩 0

2602.11564 2026-05-28 cs.CV

LUVE : Latent-Cascaded Ultra-High-Resolution Video Generation with Dual Frequency Experts

LUVE：基于双频专家的潜在级联超高分辨率视频生成

Chen Zhao, Jiawei Chen, Hongyu Li, Zhuoliang Kang, Shilin Lu, Xiaoming Wei, Kai Zhang, Jian Yang, Ying Tai

AI总结提出LUVE框架，通过三阶段潜在级联架构（低分辨率运动生成、潜在空间上采样、高分辨率内容精炼）结合双频专家，解决超高分辨率视频生成中的运动建模、语义规划和细节合成难题。

Comments ICML 2026

详情

AI中文摘要

近期视频扩散模型在视觉质量上取得了显著进步，但超高分辨率（UHR）视频生成由于运动建模、语义规划和细节合成的复合困难，仍然是一个严峻挑战。为解决这些限制，我们提出了 extbf{LUVE}，一个基于双频 extbf{专}家的 extbf{潜}在级联 extbf{UHR} extbf{V}ideo生成框架。LUVE采用三阶段架构，包括用于运动一致潜在合成的低分辨率运动生成、直接在潜在空间进行分辨率上采样以减少内存和计算开销的视频潜在上采样，以及集成低频和高频专家以共同增强语义连贯性和细粒度细节生成的高分辨率内容精炼。大量实验表明，我们的LUVE在UHR视频生成中实现了卓越的照片真实感和内容保真度，全面的消融研究进一步验证了每个组件的有效性。项目可在\href{https://unicornanrocinu.github.io/LUVE_web/}{https://github.io/LUVE/}获取。

英文摘要

Recent advances in video diffusion models have significantly improved visual quality, yet ultra-high-resolution (UHR) video generation remains a formidable challenge due to the compounded difficulties of motion modeling, semantic planning, and detail synthesis. To address these limitations, we propose \textbf{LUVE}, a \textbf{L}atent-cascaded \textbf{U}HR \textbf{V}ideo generation framework built upon dual frequency \textbf{E}xperts. LUVE employs a three-stage architecture comprising low-resolution motion generation for motion-consistent latent synthesis, video latent upsampling that performs resolution upsampling directly in the latent space to mitigate memory and computational overhead, and high-resolution content refinement that integrates low-frequency and high-frequency experts to jointly enhance semantic coherence and fine-grained detail generation. Extensive experiments demonstrate that our LUVE achieves superior photorealism and content fidelity in UHR video generation, and comprehensive ablation studies further validate the effectiveness of each component. The project is available at \href{https://unicornanrocinu.github.io/LUVE_web/}{https://github.io/LUVE/}.

URL PDF HTML ☆

赞 0 踩 0

2602.10820 2026-05-28 cs.LG

Adaptive Sampling and Clipping for Private Worst-Case Group Optimization

自适应采样与裁剪的私有最坏情况组优化

Max Cairney-Leeming, Amartya Sanyal, Christoph H. Lampert

AI总结提出ASC算法，通过自适应控制每组梯度贡献的采样率和裁剪阈值，在差分隐私下优化最坏情况组准确率，同时保持模型效用。

Comments 10 pages, 3 figures

详情

AI中文摘要

以人为中心的机器学习任务被接受的一个核心要求是它们应该是公平的，即对于来自不同社会群体的个体，它们应该表现得同样好。第二个同样重要的要求是它们应该尊重用户数据的隐私。虽然存在分别解决每个方面的技术，例如前者采用最坏情况组优化，后者采用差分隐私SGD，但这些技术往往相互冲突，目前没有实用的方法可以同时强制执行这两个要求。在这项工作中，我们克服了这个问题，并提出了一种以差分隐私方式优化最坏情况组准确率的算法。我们的主要贡献是ASC（自适应采样和裁剪的最坏情况组优化），它自适应地控制每组梯度贡献的采样率和裁剪阈值。因此，它能够重新加权训练目标以偏向更难学习的组，同时将执行隐私所需的噪声保持在足够低的水平以保持模型效用。我们的实验表明，ASC在不牺牲整体平均准确率的情况下，实现了比先前工作更高的最坏情况组准确率。

英文摘要

A central requirement for the acceptance of machine learning methods for human-centric tasks is that they should be fair, in the sense that they should work comparably well for individuals from different societal groups. A second, equally important, requirement is that they should respect the privacy of user data. While techniques exist to address each aspect in isolation, such as worst-case group optimization for the former and differentially private SGD for the latter, these are often at odds with with each other, and no practical method currently exists to enforce both requirements simultaneously. In this work, we overcome this problem and propose an algorithm for optimizing the worst-case group accuracy in a differentially private way. Our main contribution is ASC (Adaptively Sampled and Clipped Worst-case Group Optimization), which adaptively controls both the sampling rate and the clipping threshold of each group's gradient contributions. Thereby, it is able to reweight the training objective in favor of harder-to-learn groups, while keeping the noise required to enforce privacy low enough to preserve model utility. Our experiments show that ASC achieves substantially higher worst-case group accuracy than prior work, without sacrificing overall average accuracy.

URL PDF HTML ☆

赞 0 踩 0

2602.06054 2026-05-28 cs.CL

Are We Truly Innovating? A Qualitative and Quantitative Study of Originality in AI Research Papers

我们真的在创新吗？AI研究论文原创性的定性与定量研究

Abeer Mostafa, Thi Huyen Nguyen, Zahra Ahmadi

AI总结基于10万+同行评审报告，通过定性与定量方法分析AI研究论文原创性的感知维度，并评估大语言模型在原创性评估中的可靠性。

详情

AI中文摘要

评估AI研究的原创性可以说是同行评审中最重要但最不可靠的步骤。评审者对原创性的判断仍然不透明、不一致，并且依赖于对先前工作的比较，而这些比较往往不完整。在本文中，我们基于来自顶级AI会议的超过10万份同行评审报告，对研究原创性进行了大规模、数据驱动的定性与定量分析，涵盖了该领域快速增长的时期。利用结构化的、语义检索的先前工作以及嵌入在专家评审者评估中的信号，我们系统地描述了原创性在实践中是如何被感知的，并识别出最强烈影响新颖性判断的关键维度。我们的分析产生了一个细粒度、基于证据的框架，为作者和评审者提供了关于原创性如何被评估的可操作见解。此外，我们评估了当前大语言模型（LLM）智能体在评估原创性方面的可靠性。我们发现这些模型倾向于系统性地高估新颖性，并且在检测概念抄袭方面存在困难，尤其是在存在改写的情况下。我们在以下网址发布我们的数据集、训练模型和代码：https://anonymous.4open.science/r/Novelty-Reviewer-365C/。

英文摘要

Assessing originality in AI research is arguably the most consequential yet least reliable step in peer review. Reviewer judgments of originality remain opaque, inconsistent, and dependent on comparisons to prior work that are often incomplete. In this paper, we present a large-scale, data-driven qualitative and quantitative analysis of research originality based on over 100,000 peer-review reports from leading AI venues, spanning a period of rapid growth in the field. Leveraging structured, semantically retrieved prior work and signals embedded in expert reviewer assessments, we systematically characterize how originality is perceived in practice and identify the key dimensions that most strongly influence novelty judgments. Our analysis yields a fine-grained, evidence-based framework that equips both authors and reviewers with actionable insights into how originality is evaluated. In addition, we evaluate the reliability of current large language model (LLM) agents in assessing originality. We find that these models tend to systematically overestimate novelty and struggle to detect conceptual plagiarism, particularly in the presence of paraphrasing. We release our dataset, trained models, and code at: https://anonymous.4open.science/r/Novelty-Reviewer-365C/.

URL PDF HTML ☆

赞 0 踩 0

2602.01992 2026-05-28 cs.AI

Emergent Analogical Reasoning in Transformers

Transformer中的涌现类比推理

Gouki Minegishi, Jingyuan Feng, Hiroki Furuta, Takeshi Kojima, Yusuke Iwasawa, Yutaka Matsuo

AI总结本研究通过范畴论中的函子概念形式化类比推理，设计合成任务探究Transformer中类比推理的涌现机制，发现其依赖于数据特征、优化选择和模型规模，并通过机制分析揭示几何对齐和函子应用两个关键组件。

Comments Accepted to ICML2026 (spotlight)

详情

AI中文摘要

类比是人类智能的核心能力，使得在一个领域中发现的抽象模式能够应用于另一个领域。尽管类比在认知中占据核心地位，但Transformer获取并实现类比推理的机制仍知之甚少。受范畴论中函子概念的启发，我们将类比推理形式化为跨类别实体间对应关系的推断。基于这一表述，我们引入了在受控设置下评估类比推理涌现的合成任务。我们发现，类比推理的涌现对数据特征、优化选择和模型规模高度敏感。通过机制分析，我们展示了Transformer中的类比推理分解为两个关键组件：(1) 嵌入空间中关系结构的几何对齐，以及(2) Transformer内部函子的应用。这些机制使模型能够将关系结构从一个类别转移到另一个类别，从而实现类比。最后，我们量化了这些效应，并在预训练的大型语言模型中观察到了相同的趋势。通过这样做，我们将类比从一个抽象的认知概念转变为现代神经网络中一个具体的、基于机制的現象。

英文摘要

Analogy is a central faculty of human intelligence, enabling abstract patterns discovered in one domain to be applied to another. Despite its central role in cognition, the mechanisms by which Transformers acquire and implement analogical reasoning remain poorly understood. In this work, inspired by the notion of functors in category theory, we formalize analogical reasoning as the inference of correspondences between entities across categories. Based on this formulation, we introduce synthetic tasks that evaluate the emergence of analogical reasoning under controlled settings. We find that the emergence of analogical reasoning is highly sensitive to data characteristics, optimization choices, and model scale. Through mechanistic analysis, we show that analogical reasoning in Transformers decomposes into two key components: (1) geometric alignment of relational structure in the embedding space, and (2) the application of a functor within the Transformer. These mechanisms enable models to transfer relational structure from one category to another, realizing analogy. Finally, we quantify these effects and find that the same trends are observed in pretrained LLMs. In doing so, we move analogy from an abstract cognitive notion to a concrete, mechanistically grounded phenomenon in modern neural networks.

URL PDF HTML ☆

赞 0 踩 0

2511.18894 2026-05-28 cs.CV cs.AI

Not All Pixels Are Equal: Pixel-wise Meta-Learning for Medical Segmentation with Noisy Labels

并非所有像素都平等：面向含噪标签医学分割的像素级元学习

Chenyu Mu, Guihai Chen, Xun Yang, Erkun Yang, Cheng Deng

AI总结提出MetaDCSeg框架，通过动态学习像素级权重并引入动态中心距离机制建模边界不确定性，抑制噪声标签影响并提升边界分割性能。

详情

AI中文摘要

医学图像分割对于临床应用至关重要，但常常受到噪声标注和模糊解剖边界的干扰，限制了其在现实场景中的应用。现有方法通常直接适应为实例分类设计的噪声标签学习技术，忽视了医学分割中像素级异质性及其空间和解剖上的难度差异。因此，全局假设或简单的置信度指标无法解决这些局部变化，导致边界模糊问题未得到解决。为解决这一问题，我们提出MetaDCSeg，一个鲁棒的框架，动态学习最优像素级权重以抑制噪声标签的影响，同时保留可靠标注。通过动态中心距离（DCD）机制显式建模边界不确定性，我们的方法利用前景、背景和边界中心的加权特征距离，引导模型关注模糊边界附近的难分割像素。该策略能够更精确地处理结构边界（这些边界常被现有方法忽略），并显著提升分割性能。在四个不同噪声水平的基准数据集上的大量实验表明，MetaDCSeg优于现有最先进方法。

英文摘要

Medical image segmentation is crucial for clinical applications, but it is frequently disrupted by noisy annotations and ambiguous anatomical boundaries, limiting its application in real-world scenarios. Existing methods often directly adapt noisy label learning techniques designed for instance classification, overlooking the pixel-wise heterogeneity in medical segmentation with its spatially and anatomically varying difficulties. Consequently, global assumptions or simple confidence metrics fail to address these local variations, leaving boundary ambiguities unresolved. To address this issue, we propose MetaDCSeg, a robust framework that dynamically learns optimal pixel-wise weights to suppress the influence of noisy labels while preserving reliable annotations. By explicitly modeling boundary uncertainty through a Dynamic Center Distance (DCD) mechanism, our approach utilizes weighted feature distances for foreground, background, and boundary centers, directing the model's attention toward hard-to-segment pixels near ambiguous boundaries. This strategy enables more precise handling of structural boundaries, which are often overlooked by existing methods, and significantly enhances segmentation performance. Extensive experiments across four benchmark datasets with varying noise levels demonstrate that MetaDCSeg outperforms existing state-of-the-art methods.

URL PDF HTML ☆

赞 0 踩 0

2403.11852 2026-05-28 cs.RO cs.AI

Delay-Aware Reinforcement Learning for Highway On-Ramp Merging under Stochastic Communication Latency

考虑随机通信延迟的高速公路匝道合流延迟感知强化学习

Amin Tabrizian, Zhitong Huang, Arsyi Aziz, Peng Wei

AI总结针对V2I通信随机延迟导致状态观测延迟的问题，提出DAROM框架，通过随机延迟MDP建模和延迟感知编码器恢复马尔可夫性，结合物理安全控制器实现鲁棒控制。

详情

AI中文摘要

延迟和部分可观测的状态信息给现实自动驾驶中基于强化学习（RL）的控制带来了重大挑战。在高速公路匝道合流中，路侧单元（RSU）可以感知附近交通，进行边缘感知，并通过车到基础设施（V2I）链路将状态估计传输给自车。随着智能交通基础设施和边缘计算的最新进展，这种RSU辅助感知越来越现实，并已部署在现代互联道路系统中。然而，边缘处理时间和无线传输可能引入随机的V2I通信延迟，违反马尔可夫假设并显著降低控制性能。在这项工作中，我们提出了DAROM，一种对随机延迟鲁棒的高速公路匝道合流延迟感知强化学习框架。我们将问题建模为随机延迟马尔可夫决策过程（RDMDP），并开发了一个统一的RL智能体用于联合纵向和横向控制。为了在延迟观测下恢复马尔可夫表示，我们引入了一个延迟感知编码器，该编码器以延迟观测、掩蔽动作历史和观测延迟幅度为条件来推断当前潜在状态。我们进一步集成基于物理的安全控制器以减少合流过程中的碰撞风险。在模拟城市交通（SUMO）模拟器中，使用下一代仿真（NGSIM）数据集的真实交通数据进行的实验表明，DAROM在各种交通密度下始终优于标准RL基线。特别是，基于门控循环单元（GRU）的编码器在高达2.0秒的随机V2I延迟的高密度交通中实现了超过99%的成功率。

英文摘要

Delayed and partially observable state information poses significant challenges for reinforcement learning (RL)-based control in real-world autonomous driving. In highway on-ramp merging, a roadside unit (RSU) can sense nearby traffic, perform edge perception, and transmit state estimates to the ego vehicle over vehicle-to-infrastructure (V2I) links. With recent advancements in intelligent transportation infrastructure and edge computing, such RSU-assisted perception is increasingly realistic and already deployed in modern connected roadway systems. However, edge processing time and wireless transmission can introduce stochastic V2I communication delays, violating the Markov assumption and substantially degrading control performance. In this work, we propose DAROM, a Delay-Aware Reinforcement Learning framework for On-ramp Merging that is robust to stochastic delays. We model the problem as a random delay Markov decision process (RDMDP) and develop a unified RL agent for joint longitudinal and lateral control. To recover a Markovian representation under delayed observations, we introduce a Delay-Aware Encoder that conditions on delayed observations, masked action histories, and observed delay magnitude to infer the current latent state. We further integrate a physics-based safety controller to reduce collision risk during merging. Experiments in the Simulation of Urban MObility (SUMO) simulator using real-world traffic data from the Next Generation Simulation (NGSIM) dataset demonstrate that DAROM consistently outperforms standard RL baselines across traffic densities. In particular, the gated recurrent unit (GRU)-based encoder achieves over 99% success in high-density traffic with random V2I delays of up to 2.0 seconds.

URL PDF HTML ☆

赞 0 踩 0

2602.07574 2026-05-28 cs.CV cs.CL

ViCA: Efficient Multimodal LLMs with Vision-Only Cross-Attention

ViCA：仅视觉交叉注意力的高效多模态大语言模型

Wenjie Liu, Hao Wu, Xin Qiu, Xudong Wang, Yingqi Fan, Yihan Zhang, Anhao Zhao, Yunpu Ma, Xiaoyu Shen

AI总结提出ViCA架构，通过仅视觉交叉注意力减少视觉令牌计算，在保持98%准确率的同时将视觉计算降至4%，实现显著加速。

详情

AI中文摘要

现代多模态大语言模型（MLLMs）采用统一的自我注意设计，在每个Transformer层处理视觉和文本令牌，导致大量计算开销。在这项工作中，我们重新审视了这种密集视觉处理的必要性，并表明投影的视觉嵌入已经与语言空间良好对齐，而有效的视觉-语言交互仅发生在少数层中。基于这些见解，我们提出了ViCA（仅视觉交叉注意力），一种最小的MLLM架构，其中视觉令牌绕过所有自我注意和前馈层，仅通过稀疏的交叉注意力在选定层与文本交互。在三个MLLM骨干、九个多模态基准和26个基于剪枝的基线上的广泛评估表明，ViCA在将视觉侧计算减少到4%的同时保持了98%的基线准确率，始终实现了优越的性能-效率权衡。此外，ViCA提供了一个规则的、硬件友好的推理流水线，在单批推理中实现了超过3.5倍的加速，在多批推理中实现了超过10倍的加速，与仅文本的LLM相比，将视觉定位减少到接近零的开销。它还与令牌剪枝方法正交，可以无缝结合以进一步提高效率。我们的代码可在https://github.com/EIT-NLP/ViCA获取。

英文摘要

Modern multimodal large language models (MLLMs) adopt a unified self-attention design that processes visual and textual tokens at every Transformer layer, incurring substantial computational overhead. In this work, we revisit the necessity of such dense visual processing and show that projected visual embeddings are already well-aligned with the language space, while effective vision-language interaction occurs in only a small subset of layers. Based on these insights, we propose ViCA (Vision-only Cross-Attention), a minimal MLLM architecture in which visual tokens bypass all self-attention and feed-forward layers, interacting with text solely through sparse cross-attention at selected layers. Extensive evaluations across three MLLM backbones, nine multimodal benchmarks, and 26 pruning-based baselines show that ViCA preserves 98% of baseline accuracy while reducing visual-side computation to 4%, consistently achieving superior performance-efficiency trade-offs. Moreover, ViCA provides a regular, hardware-friendly inference pipeline that yields over 3.5x speedup in single-batch inference and over 10x speedup in multi-batch inference, reducing visual grounding to near-zero overhead compared with text-only LLMs. It is also orthogonal to token pruning methods and can be seamlessly combined for further efficiency gains. Our code is available at https://github.com/EIT-NLP/ViCA.

URL PDF HTML ☆

赞 0 踩 0

2602.06880 2026-05-28 cs.LG

Decoupling Variance and Scale-Invariant Updates in Adaptive Gradient Descent for Unified Vector and Matrix Optimization

解耦自适应梯度下降中的方差与尺度不变更新以实现统一向量和矩阵优化

Zitao Song, Cedar Site Bai, Zhe Zhang, Brian Bullins, David F. Gleich

AI总结提出DeVA框架，通过解耦AdaGrad更新中的方差适应项和尺度不变项，统一向量自适应方法与矩阵谱优化，在语言建模和图像分类中优于Muon和SOAP，减少约6.6%的token使用。

详情

AI中文摘要

像Adam这样的自适应方法已成为大规模向量和欧几里得优化的$ extit{事实}$标准，因为它们具有二阶性质的坐标适应。最近，基于矩阵的谱优化器如Muon（Jordan等人，2024b）展示了将权重矩阵视为矩阵而非长向量的威力。将这些方法联系起来是困难的，因为许多自然泛化不可行实现，而且我们也不能简单地将Adam适应移到矩阵谱上。为了解决这个问题，我们重新表述了AdaGrad更新，并将其分解为方差适应项和尺度不变项。这种解耦产生了$ extbf{DeVA}$（$ extbf{De}$coupled $ extbf{V}$ariance $ extbf{A}$daptation），一个连接基于向量的方差适应和矩阵谱优化的框架，实现了从Adam到自适应谱下降的无缝过渡。在语言建模和图像分类上的大量实验表明，DeVA持续优于Muon和SOAP（Vyas等人，2024）等最先进方法，减少了约6.6%的token使用。理论上，我们证明方差适应项有效改善了块状平滑性，促进了更快的收敛。我们的实现可在https://github.com/Tsedao/Decoupled-Variance-Adaptation获取。

英文摘要

Adaptive methods like Adam have become the $\textit{de facto}$ standard for large-scale vector and Euclidean optimization due to their coordinate-wise adaptation with a second-order nature. More recently, matrix-based spectral optimizers like Muon (Jordan et al., 2024b) show the power of treating weight matrices as matrices rather than long vectors. Linking these is hard because many natural generalizations are not feasible to implement, and we also cannot simply move the Adam adaptation to the matrix spectrum. To address this, we reformulate the AdaGrad update and decompose it into a variance adaptation term and a scale-invariant term. This decoupling produces $\textbf{DeVA}$ ($\textbf{De}$coupled $\textbf{V}$ariance $\textbf{A}$daptation), a framework that bridges between vector-based variance adaptation and matrix spectral optimization, enabling a seamless transition from Adam to adaptive spectral descent. Extensive experiments across language modeling and image classification demonstrate that DeVA consistently outperforms state-of-the-art methods such as Muon and SOAP (Vyas et al., 2024), reducing token usage by around 6.6\%. Theoretically, we show that the variance adaptation term effectively improves the blockwise smoothness, facilitating faster convergence. Our implementation is available at https://github.com/Tsedao/Decoupled-Variance-Adaptation

URL PDF HTML ☆

赞 0 踩 0

2412.01004 2026-05-28 cs.CV

Take Only What You Need: Rank Minimization as an Implicit Forgetting Regularizer in Continual Learning

只取所需：秩最小化作为持续学习中的隐式遗忘正则化器

Haodong Lu, Chongyang Zhao, Jason Xue, Lina Yao, Kristen Moore, Dong Gong

AI总结本文提出CoDyRA方法，通过秩最小化作为隐式遗忘正则化器，在持续学习中平衡可塑性与稳定性，在多个基准上优于现有方法。

Comments Preprint

详情

AI中文摘要

持续学习中的核心张力是可塑性（获取新知识）与稳定性（保留先前知识）之间的权衡。我们研究如何通过容量控制（即调节每次参数更新的有效秩，这是LoRA更新中可直接控制的逐步骤量）来持续更新预训练骨干网络，使其吸收新知识的同时保留现有能力。对模块和任务间LoRA秩和放置的受控探测揭示了一致的权衡，存在一个随放置和任务变化的中等秩最佳点，没有普遍最优的固定秩；一个形式化界限表明遗忘随秩增长。基于这些发现，我们提出了持续动态秩选择LoRA（CoDyRA），该方法通过在每个组件重要性权重上施加稀疏性促进正则化，联合训练每个LoRA更新与秩最小化。监督目标驱动可塑性；秩最小化正则化遗忘。我们证明秩最小化在持续学习机制中充当隐式遗忘正则化器，通过控制相对于当前模型状态的遗忘，同时保护通用能力和先前任务知识。在MTIL、X-TAIL和TRACE（CLIP、LLaMA、Gemma）上，CoDyRA在新知识学习和遗忘方面优于先前的持续学习方法，实现了强大的可塑性-稳定性平衡。代码可在https://github.com/jeff024/codyra获取。

英文摘要

The central tension in continual learning (CL) is the trade-off between plasticity (acquiring new knowledge) and stability (retaining prior knowledge). We study how a pre-trained backbone can be continually updated to absorb new knowledge while preserving existing capabilities, via capacity control: regulating the effective rank of each parameter update, a per-step quantity directly controllable inside a LoRA update. A controlled probe of LoRA rank and placement across modules and tasks reveals a consistent trade-off, with a moderate-rank sweet spot that varies by placement and task, leaving no universally optimal fixed rank; a formal bound shows forgetting grows with rank. Building on these findings, we propose Continual Dynamic Rank-Selective LoRA (CoDyRA), which jointly trains each LoRA update with rank minimization via sparsity-promoting regularization on per-component importance weights. The supervised objective drives plasticity; rank minimization regularizes forgetting. We show that rank minimization serves as an implicit forgetting regularizer in the CL regime, protecting general capability and prior-task knowledge simultaneously by controlling forgetting against the current model state. Across MTIL, X-TAIL, and TRACE (CLIP, LLaMA, Gemma), CoDyRA outperforms prior CL methods on new knowledge learning and forgetting, achieving a strong plasticity-stability balance. Code is available at https://github.com/jeff024/codyra.

URL PDF HTML ☆

赞 0 踩 0

2505.18647 2026-05-28 cs.LG cs.AI

STFlow: Data-Coupled Flow Matching for Geometric Trajectory Simulation

STFlow: 用于几何轨迹模拟的数据耦合流匹配

Kiet Bennema ten Brinke, Koen Minartz, Vlado Menkovski

AI总结提出STFlow，一种基于图神经网络和层次卷积的生成模型，通过数据依赖耦合的流匹配框架，从条件随机游走而非高斯噪声去噪，降低传输成本，提高训练和推理效率，在N体系统、分子动力学和人类轨迹预测中实现最低预测误差。

Comments Proceedings of the 43rd International Conference on Machine Learning (ICML), Seoul, South Korea. PMLR 306, 2026, 18 pages, 12 figures

详情

AI中文摘要

模拟动力系统的轨迹是分子动力学、生物化学和行人动力学等广泛领域中的基本问题。机器学习已成为扩展基于物理的模拟器和直接从实验数据开发模型的宝贵工具。特别是，深度生成建模和几何深度学习的最新进展通过学习复杂的轨迹分布，同时尊重固有的置换和时间平移对称性，实现了概率模拟。然而，N体系统的轨迹通常具有对导致分岔的扰动的高敏感性，以及多尺度的时间和空间相关性。为了应对这些挑战，我们引入了STFlow（时空流），一种基于图神经网络和层次卷积的生成模型。通过在流匹配框架中引入数据依赖的耦合，STFlow从条件随机游走而非高斯噪声开始去噪。这种新颖的信息先验通过降低传输成本简化了学习任务，提高了训练和推理效率。我们在N体系统、分子动力学和人类轨迹预测上验证了我们的方法。在这些基准测试中，STFlow以更少的模拟步骤实现了最低的预测误差，并提高了可扩展性。

英文摘要

Simulating trajectories of dynamical systems is a fundamental problem in a wide range of fields such as molecular dynamics, biochemistry, and pedestrian dynamics. Machine learning has become an invaluable tool for scaling physics-based simulators and developing models directly from experimental data. In particular, recent advances in deep generative modeling and geometric deep learning enable probabilistic simulation by learning complex trajectory distributions while respecting intrinsic permutation and time-shift symmetries. However, trajectories of N-body systems are commonly characterized by high sensitivity to perturbations leading to bifurcations, as well as multi-scale temporal and spatial correlations. To address these challenges, we introduce STFlow (Spatio-Temporal Flow), a generative model based on graph neural networks and hierarchical convolutions. By incorporating data-dependent couplings within the Flow Matching framework, STFlow denoises starting from conditioned random-walks instead of Gaussian noise. This novel informed prior simplifies the learning task by reducing transport cost, increasing training and inference efficiency. We validate our approach on N-body systems, molecular dynamics, and human trajectory forecasting. Across these benchmarks, STFlow achieves the lowest prediction errors with fewer simulation steps and improved scalability.

URL PDF HTML ☆

赞 0 踩 0

2602.05897 2026-05-28 cs.CL

Stop Rewarding Hallucinated Steps: Faithfulness-Aware Step-Level Reinforcement Learning for Small Reasoning Models

停止奖励幻觉步骤：面向小型推理模型的忠实感知步骤级强化学习

Shuo Nie, Hexuan Deng, Chao Wang, Ruiyu Fang, Xuebo Liu, Shuangyong Song, Yu Li, Min Zhang, Xuelong Li

AI总结针对小型推理模型在中间推理步骤中容易产生忠实性幻觉的问题，提出忠实感知步骤级强化学习（FaithRL），通过过程奖励模型提供步骤级监督和隐式截断重采样策略，减少幻觉并提高推理可靠性。

详情

AI中文摘要

随着大型语言模型变得更小更高效，小型推理模型（SRM）在资源受限环境中实现思维链（CoT）推理至关重要。然而，它们容易产生忠实性幻觉，尤其是在中间推理步骤中。现有的基于在线强化学习的缓解方法依赖于结果奖励或粗粒度的CoT评估，这可能在最终答案正确时无意中强化不忠实的推理。为了解决这些局限性，我们提出了忠实感知步骤级强化学习（FaithRL），通过来自过程奖励模型的显式忠实奖励引入步骤级监督，以及一种隐式截断重采样策略，该策略从忠实前缀生成对比信号，同时减轻步骤级奖励的奖励黑客攻击。在多个SRM和开放书籍QA基准上的实验表明，FaithRL持续减少CoT和最终答案中的幻觉，从而实现更忠实和可靠的推理。代码可在 https://github.com/Easy195/FaithRL 获取。

英文摘要

As large language models become smaller and more efficient, small reasoning models (SRMs) are crucial for enabling chain-of-thought (CoT) reasoning in resource-constrained settings. However, they are prone to faithfulness hallucinations, especially in intermediate reasoning steps. Existing mitigation methods based on online reinforcement learning rely on outcome-based rewards or coarse-grained CoT evaluation, which can inadvertently reinforce unfaithful reasoning when the final answer is correct. To address these limitations, we propose Faithfulness-Aware Step-Level Reinforcement Learning (FaithRL), introducing step-level supervision via explicit faithfulness rewards from a process reward model, together with an implicit truncated resampling strategy that generates contrastive signals from faithful prefixes, while also mitigating reward hacking from step-level rewards. Experiments across multiple SRMs and Open-Book QA benchmarks demonstrate that FaithRL consistently reduces hallucinations in both the CoT and final answers, leading to more faithful and reliable reasoning. Code is available at https://github.com/Easy195/FaithRL.

URL PDF HTML ☆

赞 0 踩 0

2509.25582 2026-05-28 cs.LG

Safe In-Context Reinforcement Learning

安全的上下文强化学习

Amir Moeini, Minjae Kwon, Alper Kamil Bozkurt, Yuichi Motai, Rohan Chandra, Lu Feng, Shangtong Zhang

AI总结提出SCARED方法，在无参数更新的上下文强化学习适应过程中，通过精确惩罚对偶法在约束马尔可夫决策过程框架下保证安全，实现奖励最大化与成本约束。

Comments ICML 2026

详情

AI中文摘要

上下文强化学习（ICRL）是一种新兴的强化学习范式，其中智能体在预训练后，无需任何参数更新，仅依赖不断扩展的交互历史上下文即可适应分布外测试任务。尽管ICRL展现出令人印象深刻的泛化能力，但适应过程中的安全性尚未被探索，这限制了其在测试时行为需安全的实际部署中的应用。本文提出SCARED：基于精确惩罚对偶的安全上下文自适应强化，这是首个在约束马尔可夫决策过程框架下促进ICRL安全适应的方法。在无需参数更新的适应过程中，我们的智能体不仅最大化奖励，还将累积成本控制在用户指定的安全预算内。我们还证明智能体对安全预算有主动反应：安全预算越高，智能体行为越激进；安全预算越低，智能体行为越保守。在具有挑战性的基准测试中，SCARED始终实现安全且鲁棒的上下文适应，优于现有的ICRL和安全元强化学习基线。

英文摘要

In-context reinforcement learning (ICRL) is an emerging RL paradigm where an agent, after pretraining, can adapt to out-of-distribution test tasks without any parameter updates, instead relying on an expanding context of interaction history. While ICRL has shown impressive generalization, safety during this adaptation process remains unexplored, limiting its applicability in real-world deployments where test-time behavior is expected to be safe. In this work, we propose SCARED: Safe Contextual Adaptive Reinforcement via Exact-penalty Dual, the first method that promotes safe adaptation of ICRL under the constrained Markov decision process framework. During the parameter-update-free adaptation process, our agent not only maximizes the reward but also keeps the accumulated cost within a user-specified safety budget. We also demonstrate that the agent actively reacts to the safety budget; with a higher safety budget, the agent behaves more aggressively, and with a lower safety budget the agent behaves more conservatively. Across challenging benchmarks, SCARED consistently enables safe and robust in-context adaptation, outperforming existing ICRL and safe meta-RL baselines.

URL PDF HTML ☆

赞 0 踩 0

2503.01829 2026-05-28 cs.CL cs.AI cs.LG cs.MA

Persuade Me if You Can: A Framework for Evaluating Persuasion Effectiveness and Susceptibility Among Large Language Models

如果你能说服我：评估大型语言模型说服效果与易受影响性的框架

Nimet Beyza Bozdag, Shuhaib Mehri, Gokhan Tur, Dilek Hakkani-Tür

AI总结提出PMIYC框架，通过多智能体对话自动评估LLM的说服效果与易受影响性，发现不同模型在说服力和抗说服性上存在显著差异。

Comments Paper published at the ACM Conference on AI and Agentic Systems 2026

详情

DOI: 10.1145/3786335.3813181

AI中文摘要

大型语言模型（LLM）展现出与人类水平相当的说服能力。虽然这些能力可用于社会公益，但也存在被滥用的风险。除了关注LLM如何说服他人外，它们自身对说服的易受影响性也构成了关键的校准挑战，引发了关于鲁棒性、安全性和伦理原则遵守的问题。为了研究这些动态，我们引入了“如果你能说服我”（PMIYC），一个用于评估多智能体交互中说服力和易受影响性的自动化框架。我们的框架提供了一种可扩展的替代方案，替代了通常用于研究LLM说服的昂贵且耗时的人工标注过程。PMIYC自动进行说服者和被说服者智能体之间的多轮对话，同时衡量说服的有效性和易受影响性。我们的综合评估涵盖了多种LLM和说服场景（例如，主观和错误信息场景）。我们通过人工评估验证了框架的有效性，并展示了与先前研究中人工评估的一致性。通过PMIYC，我们发现Llama-3.3-70B和GPT-4o表现出相似的说服效果，比Claude 3 Haiku高出30%。然而，GPT-4o在对抗错误信息方面的抵抗力比Llama-3.3-70B高出50%以上。值得注意的是，o4-mini既是有效的说服者，也是抵抗的被说服者。这些发现为LLM的说服动态提供了实证见解，并有助于开发更安全的AI系统。

英文摘要

Large Language Models (LLMs) demonstrate persuasive capabilities that rival human-level persuasion. While these capabilities can be used for social good, they also present risks of potential misuse. Beyond the concern of how LLMs persuade others, their own susceptibility to persuasion poses a critical alignment challenge, raising questions about robustness, safety, and adherence to ethical principles. To study these dynamics, we introduce Persuade Me If You Can (PMIYC), an automated framework for evaluating persuasiveness and susceptibility to persuasion in multi-agent interactions. Our framework offers a scalable alternative to the costly and time-intensive human annotation process typically used to study persuasion in LLMs. PMIYC automatically conducts multi-turn conversations between Persuader and Persuadee agents, measuring both the effectiveness of and susceptibility to persuasion. Our comprehensive evaluation spans a diverse set of LLMs and persuasion settings (e.g., subjective and misinformation scenarios). We validate the efficacy of our framework through human evaluations and demonstrate alignment with human assessments from prior studies. Through PMIYC, we find that Llama-3.3-70B and GPT-4o exhibit similar persuasive effectiveness, outperforming Claude 3 Haiku by 30%. However, GPT-4o demonstrates over 50% greater resistance to persuasion for misinformation compared to Llama-3.3-70B. Notably, o4-mini emerges as both an effective persuader, and a resistant persuadee. These findings provide empirical insights into the persuasive dynamics of LLMs and contribute to the development of safer AI systems.

URL PDF HTML ☆

赞 0 踩 0

2602.03668 2026-05-28 cs.RO cs.CV

MVP-LAM: Learning Action-Centric Latent Action via Cross-Viewpoint Reconstruction

MVP-LAM：通过跨视角重建学习以动作为中心的潜在动作

Jung Min Lee, Dohyeok Lee, Seokhun Ju, Taehyun Cho, Jin Woo Koo, Li Zhao, Sangwoo Hong, Jungwoo Lee

AI总结提出MVP-LAM模型，利用多视角视频通过跨视角重建目标学习与真实动作高度相关的潜在动作，提升动作预测和下游操作性能。

详情

AI中文摘要

从多样化人类视频中学习的潜在动作作为视觉-语言-动作（VLA）预训练的伪标签，但只有当它们对底层真实动作保持信息量时才能提供有效监督。为了有效监督，潜在动作应包含关于底层动作的信息，尽管这些信息不可直接获取。我们提出多视角潜在动作模型（MVP-LAM），该模型从多视角视频中学习与真实动作高度相关的潜在动作。MVP-LAM通过跨视角重建目标训练潜在动作，使得一个视角的潜在动作必须解释另一个视角的未来，从而减少对视角特定线索的依赖。在Bridge V2上，MVP-LAM生成更以动作为中心的潜在动作，与真实动作的互信息更高，动作预测性能提升，包括在分布外评估下。最后，使用MVP-LAM潜在动作预训练VLA模型提高了各种基准上的下游操作性能。代码和训练好的检查点可在https://jmsnu.github.io获取。

英文摘要

Latent actions learned from diverse human videos serve as pseudo-labels for vision-language-action (VLA) pretraining, but provide effective supervision only if they remain informative about the underlying ground-truth actions. For effective supervision, latent actions should contain information about the underlying actions even though they are inaccessible. We propose Multi-ViewPoint Latent Action Moel (MVP-LAM), which learns latent actions that are highly informative about ground-truth actions from multi-view videos. MVP-LAM trains latent actions with a cross-viewpoint reconstruction objective, so that a latent action from one view must explain the future in another view, reducing reliance on viewpoint-specific cues. On Bridge V2, MVP-LAM produces more action-centric latent actions, achieving higher mutual information with ground-truth actions and improved action prediction, including under out-of-distribution evaluation. Finally, pretraining VLAs with MVP-LAM latent actions improves downstream manipulation performance on various benchmarks. The code and trained checkpoints are available at https://jmsnu.github.io.

URL PDF HTML ☆

赞 0 踩 0

2602.03515 2026-05-28 cs.LG cs.AI cs.DC

Mitigating Staleness in Asynchronous Pipeline Parallelism via Basis Rotation

通过基旋转缓解异步流水线并行中的陈旧性问题

Hyunji Jung, Sungbin Shin, Namhoon Lee

AI总结针对异步流水线并行中梯度陈旧性随流水线深度线性增长的问题，提出基旋转框架，通过将优化器坐标系与Hessian特征基对齐来保持延迟更新的有效性，理论证明最小化基失配并实证在3B参数LLM训练中减少81.7%迭代次数。

Comments ICML 2026

详情

AI中文摘要

异步流水线并行通过消除同步执行中固有的流水线气泡来最大化硬件利用率，为高效大规模分布式训练提供了一条途径。然而，这种效率提升可能会被梯度陈旧性所削弱，其中使用延迟梯度的即时模型更新会在优化过程中引入噪声。关键的是，我们发现了一个常被忽视的严重问题：这种延迟随流水线深度线性增长，从根本上破坏了该方法原本意图提供的可扩展性。我们将此问题归因于优化景观的一个特定性质：Hessian特征基与标准坐标基之间的失配，这触发了坐标自适应优化器更新轨迹中的振荡。我们识别出这些振荡导致延迟更新偏离其真实对应项，使其无法用于当前迭代。这一见解通过理论分析（包括一个表明基失配放大延迟惩罚的收敛界）和实证评估得到证实。为了解决这个问题，我们提出了基旋转，一个将优化器坐标系旋转以与Hessian特征基对齐的框架，使延迟更新保持有用。我们从理论上证明基旋转最小化基失配，从而抵消放大延迟惩罚的条件。在训练高达3B参数的LLM的实证中，与性能最佳的异步基线相比，基旋转减少了81.7%所需的迭代次数。

英文摘要

Asynchronous pipeline parallelism maximizes hardware utilization by eliminating the pipeline bubbles inherent in synchronous execution, offering a path toward efficient large-scale distributed training. However, this efficiency gain can be compromised by gradient staleness, where the immediate model updates with delayed gradients introduce noise into the optimization process. Crucially, we identify a critical, yet often overlooked, pathology: this delay scales linearly with pipeline depth, fundamentally undermining the very scalability that the method originally intends to provide. We trace this pathology to a specific property of the optimization landscape: the misalignment between the Hessian eigenbasis and the standard coordinate basis, which triggers oscillations in the update trajectories of coordinate-wise adaptive optimizers. We identify that these oscillations cause delayed updates to diverge from their true counterparts, invalidating their use for current iterations. This insight is formalized through theoretical analysis, including a convergence bound showing that basis misalignment amplifies the delay penalty, and substantiated with empirical evaluation. To address this, we propose basis rotation, a framework that rotates the optimizer's coordinate system to align with the Hessian eigenbasis, keeping delayed updates useful. We theoretically demonstrate that basis rotation minimizes basis misalignment, thereby counteracting the conditions that amplify delay penalties. Empirically, in training up to a 3B-parameter LLM, basis rotation reduces the required iterations by 81.7\% compared to the best-performing asynchronous baseline.

URL PDF HTML ☆

赞 0 踩 0

2602.03491 2026-05-28 cs.CV cs.CL

Decoupling Skeleton and Flesh: Efficient Multimodal Table Reasoning with Disentangled Alignment and Structure-aware Guidance

解耦骨架与血肉：基于解缠对齐和结构感知引导的高效多模态表格推理

Yingjie Zhu, Xuefeng Bai, Kehai Chen, Yang Xiang, Youcheng Pan, Xiaoqiang Zhou, Min Zhang

AI总结提出DiSCo解缠结构-内容对齐框架和Table-GLS全局到局部结构引导推理框架，高效增强LVLM的表格理解与推理能力，无需昂贵监督或外部工具。

Comments Accepted as a Spotlight Paper at ICML 2026

详情

AI中文摘要

由于复杂的布局和紧密耦合的结构-内容信息，对表格图像进行推理对于大型视觉语言模型（LVLM）仍然具有挑战性。现有解决方案通常依赖于昂贵的监督训练、强化学习或外部工具，限制了效率和可扩展性。这项工作解决了一个关键问题：如何以最少的标注且无需外部工具来使LVLM适应表格推理？具体来说，我们首先引入了DiSCo，一种解缠结构-内容对齐框架，在多模态对齐期间明确分离结构抽象和语义基础，高效地将LVLM适应于表格结构。在DiSCo的基础上，我们进一步提出了Table-GLS，一种全局到局部结构引导推理框架，通过结构化探索和基于证据的推理来执行表格推理。跨多个基准的大量实验表明，我们的框架高效地增强了LVLM的表格理解和推理能力，特别是泛化到未见过的表格结构。我们的数据和代码可在https://github.com/AAAndy-Zhu/TableVLM获取。

英文摘要

Reasoning over table images remains challenging for Large Vision-Language Models (LVLMs) due to complex layouts and tightly coupled structure-content information. Existing solutions often depend on expensive supervised training, reinforcement learning, or external tools, limiting efficiency and scalability. This work addresses a key question: how to adapt LVLMs to table reasoning with minimal annotation and no external tools? Specifically, we first introduce DiSCo, a Disentangled Structure-Content alignment framework that explicitly separates structural abstraction from semantic grounding during multimodal alignment, efficiently adapting LVLMs to tables structures. Building on DiSCo, we further present Table-GLS, a Global-to-Local Structure-guided reasoning framework that performs table reasoning via structured exploration and evidence-grounded inference. Extensive experiments across diverse benchmarks demonstrate that our framework efficiently enhances LVLM's table understanding and reasoning capabilities, particularly generalizing to unseen table structures. Our data and code are available at https://github.com/AAAndy-Zhu/TableVLM.

URL PDF HTML ☆

赞 0 踩 0

2602.02898 2026-05-28 cs.AI cs.CL

Aligning Language Model Benchmarks with Pairwise Preferences

将语言模型基准与成对偏好对齐

Marco Gutierrez, Xinyi Leng, Hannah Cyberey, Jonathan Richard Schwarz, Ahmed Alaa, Thomas Hartvigsen

AI总结提出BenchAlign方法，通过利用语言模型在问题级别的性能与模型成对排名，自动调整离线基准权重，使新基准能根据偏好准确排序未见模型。

详情

AI中文摘要

语言模型基准是广泛使用的、计算高效的现实性能代理。然而，许多近期工作发现基准常常无法预测实际效用。为弥合这一差距，我们引入基准对齐，即利用有限的模型性能信息自动更新离线基准，旨在生成新的静态基准，以预测给定测试设置中的模型成对偏好。然后我们提出BenchAlign，这是该问题的首个解决方案，它利用语言模型在问题级别的性能以及可能在部署期间收集的模型成对排名，学习基准问题的偏好对齐权重，生成新的基准，根据这些偏好对先前未见过的模型进行排序。我们的实验表明，我们的对齐基准能够根据人类偏好模型准确地对未见模型进行排序，即使模型大小不同，同时保持可解释性。总体而言，我们的工作为将基准与实际人类偏好对齐的局限性提供了见解，这有助于加速模型开发以追求实际效用。

英文摘要

Language model benchmarks are pervasive and computationally-efficient proxies for real-world performance. However, many recent works find that benchmarks often fail to predict real utility. Towards bridging this gap, we introduce benchmark alignment, where we use limited amounts of information about model performance to automatically update offline benchmarks, aiming to produce new static benchmarks that predict model pairwise preferences in given test settings. We then propose BenchAlign, the first solution to this problem, which learns preference-aligned weight- ings for benchmark questions using the question-level performance of language models alongside ranked pairs of models that could be collected during deployment, producing new benchmarks that rank previously unseen models according to these preferences. Our experiments show that our aligned benchmarks can accurately rank unseen models according to models of human preferences, even across different sizes, while remaining interpretable. Overall, our work provides insights into the limits of aligning benchmarks with practical human preferences, which stands to accelerate model development towards real utility.

URL PDF HTML ☆

赞 0 踩 0

2602.02855 2026-05-28 cs.LG cond-mat.dis-nn math.ST stat.TH

When pre-training hurts LoRA fine-tuning: a dynamical analysis via single-index models

当预训练损害LoRA微调：基于单指标模型的动力学分析

Gibbs Nwemadji, Bruno Loureiro, Jean Barbier

AI总结本文通过单指标模型下的动力学分析，数学证明了过度预训练会降低LoRA微调的收敛速度，并刻画了收敛率与初始对齐及目标任务非线性的关系。

Comments 38 pages, 14 figures

详情

AI中文摘要

在源任务上的预训练通常被认为有助于类似下游问题的微调。本文从数学上表明，这种朴素直觉并不总是成立：过度预训练会在计算上减慢微调优化。我们研究了在单次SGD训练的单指标模型上进行低秩适应（LoRA）微调的现象。利用微调动力学的汇总统计描述，我们精确刻画了收敛率如何依赖于初始微调对齐和目标任务的非线性程度。关键结论是，即使预训练和下游任务高度对齐，强预训练也会导致搜索阶段延长并阻碍收敛。因此，我们的理论提供了一个统一图景，说明预训练强度与任务难度如何在非平凡的可处理模型中共同塑造LoRA微调的动力学和局限性。在实践方面，我们通过实验表明，我们的理论发现超越了玩具模型，在真实数据上训练的视觉变换器模型中仍然相关。

英文摘要

Pre-training on a source task is usually expected to facilitate fine-tuning on similar downstream problems. In this work, we mathematically show that this naive intuition is not always true: excessive pre-training can computationally slow down fine-tuning optimization. We study this phenomenon for low-rank adaptation (LoRA) fine-tuning on single-index models trained under one-pass SGD. Leveraging a summary statistics description of the fine-tuning dynamics, we precisely characterize how the convergence rate depends on the initial fine-tuning alignment and the degree of non-linearity of the target task. The key take away is that even when the pre-training and downstream tasks are well aligned, strong pre-training can induce a prolonged search phase and hinder convergence. Our theory thus provides a unified picture of how pre-training strength and task difficulty jointly shape the dynamics and limitations of LoRA fine-tuning in a nontrivial tractable model. On the practical side, we empirically show that our theoretical findings extend beyond our toy model and remain relevant in the context of a vision-transformer model trained on real data.

URL PDF HTML ☆

赞 0 踩 0

2602.01807 2026-05-28 cs.CL cs.LG

Sentence Curve Language Models

句子曲线语言模型

DongNyeong Heo, Taehwan Kim, Heeyoul Choi

AI总结提出句子曲线表示，将扩散语言模型扩展为预测句子曲线而非静态词嵌入，以增强全局结构建模，并在IWSLT14和WMT14上取得最优性能。

详情

AI中文摘要

语言模型（LM）是现代AI系统的核心组成部分，扩散语言模型（DLM）最近已成为一种有竞争力的替代方案。这两种范式都依赖词嵌入来表示输入句子，以及骨干模型训练预测的目标句子。我们认为，这种目标词的静态嵌入对相邻词不敏感，鼓励局部准确的词预测，而全局句子结构则较少被强调。为了解决这个问题，我们提出了一种连续的句子表示，称为句子曲线，定义为一条样条曲线，其控制点影响句子中的多个词。基于这种表示，我们引入了句子曲线语言模型（SCLM），它将DLM扩展为预测句子曲线而非静态词嵌入。我们从理论上证明，句子曲线预测会引入正则化效应，促进全局结构建模，并刻画了不同句子曲线类型如何影响这种行为。实验上，SCLM在IWSLT14和WMT14上取得了DLM中的最优性能，训练稳定且无需繁重的知识蒸馏，并在LM1B上展现出与离散DLM相比有潜力的前景。

英文摘要

Language models (LMs) are a central component of modern AI systems, and diffusion language models (DLMs) have recently emerged as a competitive alternative. Both paradigms rely on word embeddings not only to represent the input sentence, but also to represent the target sentence that backbone models are trained to predict. We argue that such static embedding of the target word is insensitive to neighboring words, encouraging locally accurate word prediction while global sentence structure is less emphasized. To address this, we propose a continuous sentence representation, termed sentence curve, defined as a spline curve whose control points affect multiple words in the sentence. Based on this representation, we introduce sentence curve language model (SCLM), which extends DLMs to predict sentence curves instead of the static word embeddings. We theoretically show that sentence curve prediction induces a regularization effect that promotes global structure modeling, and characterize how different sentence curve types affect this behavior. Empirically, SCLM achieves state-of-the-art performance among DLMs on IWSLT14 and WMT14, shows stable training without burdensome knowledge distillation, and demonstrates promising potential compared to discrete DLMs on LM1B.

URL PDF HTML ☆

赞 0 踩 0

2602.02417 2026-05-28 cs.LG

Trust Region Continual Learning as an Implicit Meta-Learner

信任区域持续学习作为隐式元学习器

Zekun Wang, Anant Gupta, Christopher J. MacLellan

AI总结本文提出信任区域持续学习，通过结合生成重放和Fisher度量信任区域约束，实现隐式元学习效果，在任务增量扩散图像生成和持续扩散策略控制中取得最佳性能。

Comments 21 pages, 21 tables

详情

AI中文摘要

持续学习旨在顺序获取任务而不发生灾难性遗忘，但标准策略面临核心权衡：基于正则化的方法（如EWC）在任务最优值弱重叠时可能过度约束更新，而基于重放的方法可以保持性能但因不完美重放而漂移。我们研究了一种混合视角：\emph{信任区域持续学习}，它将生成重放与Fisher度量信任区域约束相结合。我们证明，在局部近似下，得到的更新具有MAML风格的解释，包含一个隐式内步：重放提供旧任务梯度信号（类似查询），而Fisher加权惩罚提供高效的离线曲率塑造（类似支持）。这产生了持续学习中的涌现元学习特性：模型成为初始化，在每次任务转换后快速\emph{重新收敛}到先前任务最优值，而无需显式优化双层目标。实验上，在任务增量扩散图像生成和持续扩散策略控制中，信任区域持续学习实现了最佳最终性能和保留，并且比EWC、重放和持续元学习基线更快地恢复早期任务性能。

英文摘要

Continual learning aims to acquire tasks sequentially without catastrophic forgetting, yet standard strategies face a core tradeoff: regularization-based methods (e.g., EWC) can overconstrain updates when task optima are weakly overlapping, while replay-based methods can retain performance but drift due to imperfect replay. We study a hybrid perspective: \emph{trust region continual learning} that combines generative replay with a Fisher-metric trust region constraint. We show that, under local approximations, the resulting update admits a MAML-style interpretation with a single implicit inner step: replay supplies an old-task gradient signal (query-like), while the Fisher-weighted penalty provides an efficient offline curvature shaping (support-like). This yields an emergent meta-learning property in continual learning: the model becomes an initialization that rapidly \emph{re-converges} to prior task optima after each task transition, without explicitly optimizing a bilevel objective. Empirically, on task-incremental diffusion image generation and continual diffusion-policy control, trust region continual learning achieves the best final performance and retention, and consistently recovers early-task performance faster than EWC, replay, and continual meta-learning baselines.

URL PDF HTML ☆

赞 0 踩 0

2602.02259 2026-05-28 cs.LG cs.CV

Segment to Focus: Guiding Latent Action Models in the Presence of Distractors

聚焦分割：在干扰物存在下引导潜在动作模型

Marcus Fechner, Hamza Adnan, Constantin C. Lüth, Matthew T. Jackson, Alexey Zakharov, J. Marius Zöllner

AI总结针对动作相关视觉干扰导致潜在动作模型失效的问题，提出MaskLAM方法，利用分割基础模型（如SAM）零样本获取智能体掩码，限制重建目标于智能体像素，迫使潜在动作编码内源动态，显著提升下游策略性能。

详情

AI中文摘要

潜在动作模型（LAMs）为在大规模无动作视频上预训练具身智能体提供了一条有前景的路径。它们推断连续观测之间的潜在动作，之后可以使用少量标签解码为真实动作。然而，近期工作表明，在真实世界视频中常见的动作相关视觉干扰物（如动态背景、相机抖动或其他移动物体）存在时，这一方法会失败。在这些场景中，标准重建目标会驱使潜在动作编码外源运动而非智能体控制的动态，导致微调后的策略性能不佳。然而，我们观察到内源和外源因素通常在像素空间中是空间分离的：控制相关的变化集中在智能体上，而干扰物运动发生在别处。我们利用这一观察，将重建目标限制在智能体像素上，迫使潜在动作解释智能体控制的动态而非外源动态。我们将该方法称为MaskLAM；它从现成的分割基础模型（如SAM）中零样本获取智能体掩码，并且在预训练期间不需要架构更改、辅助损失或动作标签。在两个连续控制基准（Distracting Control Suite、Distracting Meta-World）上，MaskLAM将归一化线性探针MSE降低了最多$3.51 imes$，并将归一化回报提高了最多$4.97 imes$，相比LAPO，同时缩小了与依赖真实动作监督的LAOM-Labels之间的差距。

英文摘要

Latent action models (LAMs) offer a promising path to pre-training embodied agents on large amounts of action-free video. They infer latent actions between consecutive observations that can later be decoded to ground-truth actions using a small number of labels. However, recent work has shown that this recipe fails in the presence of action-correlated visual distractors common in real-world video, such as dynamic backgrounds, camera shake, or other moving objects. In these scenarios, the standard reconstruction objective drives latent actions to encode exogenous motion instead of agent-controlled dynamics, resulting in policies that underperform when fine-tuned. We observe, however, that endogenous and exogenous factors are typically spatially separated in pixel space: control-relevant change is concentrated on the agent, while distractor motion occurs elsewhere. We exploit this observation by restricting the reconstruction objective to agent pixels, forcing latent actions to explain agent-controlled dynamics rather than exogenous ones. We call this method MaskLAM; it obtains the agent mask zero-shot from off-the-shelf segmentation foundation models (e.g., SAM) and requires no architectural changes, auxiliary losses, or action labels during pre-training. Across two continuous-control benchmarks (Distracting Control Suite, Distracting Meta-World), MaskLAM reduces normalized linear-probe MSE by up to $3.51\times$ and improves normalized return by up to $4.97\times$ over LAPO, while narrowing the gap to LAOM-Labels, which relies on ground-truth action supervision.

URL PDF HTML ☆

赞 0 踩 0

2602.02150 2026-05-28 cs.LG cs.AI

ECHO: Entropy-Confidence Hybrid Optimization for Test-Time Reinforcement Learning

ECHO: 测试时强化学习的熵-置信度混合优化

Chu Zhao, Enneng Yang, Yuting Liu, Jianzhe Zhao, Guibing Guo

AI总结针对测试时强化学习中高熵分支导致rollout崩溃和早期伪标签噪声引发过拟合的问题，提出熵-置信度混合组相对策略优化（ECHO），通过自适应分支控制和置信度剪枝缓解崩溃，并采用置信度自适应裁剪和优势塑造增强训练鲁棒性。

Comments 19 ppages

详情

AI中文摘要

测试时强化学习通过重复rollout生成多个候选答案，并利用多数投票构建的伪标签进行在线更新。为了减少开销并改进探索，先前的工作引入了树结构rollout，共享推理前缀并在关键节点分支以提高采样效率。然而，这种范式仍然面临两个挑战：(1) 高熵分支可能触发rollout崩溃，即分支预算集中在少数具有连续高熵片段的轨迹上，迅速减少有效分支数量；(2) 早期伪标签存在噪声和偏差，可能引发自我强化的过拟合，导致策略过早锐化并抑制探索。为了解决这些问题，我们提出了熵-置信度混合组相对策略优化（ECHO）。在rollout过程中，ECHO联合利用局部熵和组级置信度自适应控制分支宽度，并进一步引入在线置信度剪枝以终止持续低置信度的分支，避免高熵陷阱并缓解崩溃。在策略更新过程中，ECHO采用置信度自适应裁剪和熵-置信度混合优势塑造方法，以增强训练鲁棒性并减轻早期偏差。实验表明，ECHO在多个数学和视觉推理基准上取得了一致的性能提升，并在有限的rollout预算下更有效地泛化。

英文摘要

Test-time reinforcement learning generates multiple candidate answers via repeated rollouts and performs online updates using pseudo-labels constructed by majority voting. To reduce overhead and improve exploration, prior work introduces tree structured rollouts, which share reasoning prefixes and branch at key nodes to improve sampling efficiency. However, this paradigm still faces two challenges: (1) high entropy branching can trigger rollout collapse, where the branching budget concentrates on a few trajectories with consecutive high-entropy segments, rapidly reducing the number of effective branches; (2) early pseudo-labels are noisy and biased, which can induce self-reinforcing overfitting, causing the policy to sharpen prematurely and suppress exploration. To address these issues, we propose Entropy Confidence Hybrid Group Relative Policy Optimization (ECHO). During rollout, ECHO jointly leverages local entropy and group level confidence to adaptively control branch width, and further introduces online confidence-based pruning to terminate persistently low confidence branches, avoiding high entropy traps and mitigating collapse. During policy updates, ECHO employs confidence adaptive clipping and an entropy confidence hybrid advantage shaping approach to enhance training robustness and mitigate early stage bias. Experiments demonstrate that ECHO achieves consistent gains on multiple mathematical and visual reasoning benchmarks, and generalizes more effectively under a limited rollout budget.

URL PDF HTML ☆

赞 0 踩 0

2602.01990 2026-05-28 cs.LG cs.AI

SAME: Stabilized Mixture-of-Experts for Multimodal Continual Instruction Tuning

SAME: 用于多模态持续指令调优的稳定混合专家模型

Zhen-Hao Xie, Jun-Tao Tang, Yu-Cheng Shi, Han-Jia Ye, De-Chuan Zhan, Da-Wei Zhou

AI总结针对多模态持续指令调优中专家路由漂移和专家漂移问题，提出稳定混合专家模型（SAME），通过正交子空间分解路由动态和曲率感知缩放更新专家，实现无重放的状态最优性能。

Comments Accepted to ICML 2026. Code is available at https://github.com/LAMDA-CL/Prism

详情

AI中文摘要

多模态大语言模型（MLLMs）通过指令调优实现了强大的性能，但实际部署需要它们持续扩展能力，这使得多模态持续指令调优（MCIT）变得至关重要。最近的方法利用稀疏专家路由来促进任务专业化，但我们发现专家路由过程会随着数据分布的演变而发生漂移。例如，之前激活定位专家的接地查询在学习OCR任务后可能被路由到不相关的专家。同时，与接地相关的专家可能被新任务覆盖而失去原有功能。这种失败反映了两个问题：路由器漂移（专家选择随时间变得不一致）和专家漂移（共享专家跨任务被覆盖）。因此，我们提出了用于MCIT的稳定混合专家模型（SAME）。为了解决路由器漂移，SAME通过将路由动态分解为正交子空间并仅更新任务相关方向来稳定专家选择。为了缓解专家漂移，我们通过使用历史输入协方差进行曲率感知缩放来调节专家更新，无需重放。SAME还引入了自适应专家激活，在训练期间冻结选中的专家，减少冗余计算和跨任务干扰。我们还引入了一个新的基准来评估长任务序列的MCIT，大量实验证明了SAME的最优性能。代码可在 https://github.com/LAMDA-CL/Prism 获取。

英文摘要

Multimodal Large Language Models (MLLMs) achieve strong performance through instruction tuning, but real-world deployment requires them to continually expand their capabilities, making Multimodal Continual Instruction Tuning (MCIT) essential. Recent methods leverage sparse expert routing to promote task specialization, but we find that the expert routing process suffers from drift as the data distribution evolves. For example, a grounding query that previously activated localization experts may instead be routed to irrelevant experts after learning OCR tasks. Meanwhile, the grounding-related experts can be overwritten by new tasks and lose their original functionality. Such failure reflects two problems: router drift, where expert selection becomes inconsistent over time, and expert drift, where shared experts are overwritten across tasks. Therefore, we propose StAbilized Mixture-of-Experts (SAME) for MCIT. To address router drift, SAME stabilizes expert selection by decomposing routing dynamics into orthogonal subspaces and updating only task-relevant directions. To mitigate expert drift, we regulate expert updates via curvature-aware scaling using historical input covariance in a rehearsal-free manner. SAME also introduces adaptive expert activation to freeze selected experts during training, reducing redundant computation and cross-task interference. We also introduce a new benchmark to evaluate MCIT with long task sequence, and extensive experiments demonstrate SAME's SOTA performance. Code is available at https://github.com/LAMDA-CL/Prism.

URL PDF HTML ☆

赞 0 踩 0

2602.01745 2026-05-28 cs.LG cs.AI

Probability-Entropy Calibration: An Elastic Indicator for Adaptive Fine-tuning

概率-熵校准：一种用于自适应微调的弹性指标

Wenhao Yu, Shaohang Wei, Jiahong Liu, Yifan Li, Minda Hu, Aiwei Liu, Hao Zhang, Irwin King

AI总结提出概率-熵校准信号（相对排名指标）进行token级重加权，以平衡预训练先验与下游对齐，在数学推理、分布外推理和代码生成任务上优于仅基于概率或熵的方法。

Comments Accepted by ICML 2026

详情

AI中文摘要

Token级重加权是一种简单但有效的控制监督微调的机制，但常见的指标很大程度上是单维的：真实概率反映下游对齐，而token熵反映预训练先验引起的内在不确定性。忽略熵可能会将噪声或易替换的token误识别为学习关键，而忽略概率则无法反映目标特定的对齐。RankTuner引入了一种概率-熵校准信号，即相对排名指标，它比较真实token的排名与其在预测分布下的预期排名。逆指标作为token级的相对尺度用于重加权微调目标，将更新集中在真正未学习充分的token上，而不过度惩罚内在不确定的位置。在多个骨干网络上的实验表明，在数学推理基准上持续改进，在分布外推理上获得迁移增益，并且在代码生成性能上优于仅基于概率或熵的重加权基线。

英文摘要

Token-level reweighting is a simple yet effective mechanism for controlling supervised fine-tuning, but common indicators are largely one-dimensional: the ground-truth probability reflects downstream alignment, while token entropy reflects intrinsic uncertainty induced by the pre-training prior. Ignoring entropy can misidentify noisy or easily replaceable tokens as learning-critical, while ignoring probability fails to reflect target-specific alignment. RankTuner introduces a probability--entropy calibration signal, the Relative Rank Indicator, which compares the rank of the ground-truth token with its expected rank under the prediction distribution. The inverse indicator is used as a token-wise Relative Scale to reweight the fine-tuning objective, focusing updates on truly under-learned tokens without over-penalizing intrinsically uncertain positions. Experiments on multiple backbones show consistent improvements on mathematical reasoning benchmarks, transfer gains on out-of-distribution reasoning, and pre code generation performance over probability-only or entropy-only reweighting baselines.

URL PDF HTML ☆

赞 0 踩 0