arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2080
2605.12702 2026-05-14 cs.AI cs.HC

DisaBench: A Participatory Evaluation Framework for Disability Harms in Language Models

Eugenia Kim, Ioana Tanase, Christina Mallon

AI总结 本文提出 DisaBench,一个用于评估语言模型中与残疾相关危害的参与式评价框架。该框架通过与残疾人士和红队专家共同创建的十二类残疾危害分类,结合七类生活场景中的良性与对抗性提示,构建了一个包含175个提示和525对标注响应的数据集。研究发现,残疾相关危害因类型不同而差异显著,并在非文本模态中叠加出现,且其评估具有文化与时间依赖性,常规安全评估难以识别细微危害。该框架强调残疾危害的个人性、交叉性和社区定义特征,现有通用安全基准难以全面捕捉此类问题。

详情
英文摘要

General-purpose safety benchmarks for large language models do not adequately evaluate disability-related harms. We introduce DisaBench: a taxonomy of twelve disability harm categories co-created with people with disabilities and red teaming experts, a taxonomy-driven evaluation methodology that pairs benign and adversarial prompts across seven life domains, and a dataset of 175 prompts with human-annotated labels on 525 prompt-response pairs. Annotation by four evaluators with lived disability experience reveals three findings: harm rates vary sharply by disability type and will compound in non-text modalities, terminology-driven harm is culturally and temporally bound rather than universally assessable, and standard safety evaluation catches overt failures while missing the subtle harms that only domain expertise can recognize. Disability harm is simultaneously personal, intersectional, and community-defined: it cannot be isolated from the full context of who a person is, and general-purpose benchmarks systematically miss it. We will release the dataset, taxonomy, and methodology via Hugging Face and an open-source red teaming framework for direct integration into existing safety pipelines with no additional infrastructure.

2605.12701 2026-05-14 cs.LG cs.AI cs.CE cs.CY

Do Fair Models Reason Fairly? Counterfactual Explanation Consistency for Procedural Fairness in Credit Decisions

Gideon Popoola, John Sheppard

AI总结 在信用决策等社会敏感领域,现有公平机器学习模型虽然能够实现预测结果的公平性,但仍可能在推理过程中对不同群体采用不同的逻辑,形成“隐藏的过程性偏差”。本文提出一种名为反事实解释一致性(CEC)的框架,通过对齐个体与其反事实样本的特征归因,检测并缓解这种偏差,并引入新的过程性公平度量与训练损失函数。实验表明,CEC能有效减少模型的隐藏偏差,且对模型性能的影响较小。

详情
英文摘要

Machine learning algorithms in socially sensitive domains (e.g., credit decisions) often focus on equalizing predictive outcomes. However, satisfying these metrics does not guarantee that models use the same reasoning for different groups. We show that existing outcome-fair models can still apply fundamentally different reasoning to individuals, a ``hidden procedural bias'' missed by standard fairness metrics and algorithms. We propose Counterfactual Explanation Consistency (CEC), a framework that detects and mitigates this bias by aligning feature attributions between individuals and their counterfactual counterparts. Key contributions include a nearest-neighbor counterfactual generation method, a modified baseline for integrated gradient comparisons, an individual-level procedural fairness metric, and a corresponding training loss. We introduce a taxonomy identifying ``Regime B'' (same outcome, different reasoning) as a critical blind spot. Experiments on synthetic data, German Credit, Adult Income, and HMDA mortgage data demonstrate that outcome-fair baselines exhibit substantial hidden bias, while CEC substantially reduces it with modest utility cost.

2605.12700 2026-05-14 cs.LG cs.NA math.NA

UFO: A Domain-Unification-Free Operator Framework for Generalized Operator Learning

Hanli Qiao, George Em Karniadakis, Muhammad Muniruzzaman

AI总结 本文提出了一种名为UFO的跨域神经算子框架,能够在不同表示域之间进行自适应的联合条件交互,无需统一域表示即可实现算子学习。该框架支持输入与输出的离散化解耦,允许在训练时未使用的分辨率或位置进行预测,提升了模型的灵活性和泛化能力。实验表明,UFO在多个具有不连续输入、谱不匹配、非线性动力学和随机高频场等挑战的基准任务中,均能提供准确、鲁棒且物理一致的预测结果。

详情
英文摘要

Neural operators have become an effective framework for learning mappings between function spaces, yet most existing architectures realize operators within a single representational domain, such as physical, spectral, or latent space. In this work, we introduce UFO (Domain-Unification-Free Operator), a cross-domain neural operator framework that realizes operators through adaptive, jointly conditioned interactions among representations defined on distinct domains. UFO enables discretization decoupling: the input function can be observed at resolutions or locations different from those used during training, while the solution can be queried at arbitrary output resolutions. Across four complementary benchmarks covering discontinuous inputs, irregular sampling with spectral mismatch, nonlinear dynamics, and stochastic high-frequency fields, UFO delivers accurate, robust, and physically coherent predictions under distribution shifts. These results establish cross-domain, phase-modulated realization as a powerful framework for discretization-decoupled neural operator learning.

2605.12699 2026-05-14 cs.LG cs.AI

Modeling Heterophily in Multiplex Graphs: An Adaptive Approach for Node Classification

Kamel Abdous, Nairouz Mrabah, Mohamed Bouguessa

AI总结 该论文研究了在多层图中建模异质性(heterophily)的问题,即相连节点可能属于不同类别且属性差异较大的情况。现有方法多假设同质性(homophily),难以处理多层图中同时存在的同质与异质交互。为此,作者提出了一种名为\methodname的新方法,通过引入维度特定的兼容性矩阵和可训练的低通与高通滤波器,动态适应不同维度的异质特性,从而更有效地进行节点分类。实验表明,该方法在合成和真实数据集上均取得了优于现有方法的分类性能。

Comments 38 pages, 7 figures, 4 tables, 1 algorithm. Published in Expert Systems with Applications

详情
Journal ref
Expert Systems with Applications, Volume 323, 2026, Article 132374
英文摘要

Existing multiplex graph models often assume homophily, where connected nodes tend to belong to the same class or share similar attributes. Consequently, these models may struggle with graphs exhibiting heterophily, where connected nodes typically belong to different classes and have dissimilar attributes. While recent methods have been developed to learn reliable node representations from unidimensional graphs with heterophily, they do not fully address the complexities of multiplex graphs. In a multiplex graph, nodes are linked through multiple types of edges (referred to as dimensions), which can simultaneously exhibit homophilic and heterophilic interactions. To address this gap, we propose \methodname, a novel method for node classification in multiplex graphs that adapts to both homophilic and heterophilic dimensions. \methodname introduces dimension-specific compatibility matrices to model varying degrees of homophily and heterophily across dimensions. A key innovation is its use of a product of trainable low-pass and high-pass filters, approximated via Chebyshev polynomials, to capture both smooth and abrupt changes in the graph signal. By composing these filters and optimizing label predictions using a proximal-gradient method, \methodname dynamically adjusts to the heterophilic characteristics of each dimension. Extensive experiments on synthetic and real-world datasets provide evidence that \methodname captures the complex interplay of homophilic and heterophilic interactions in multiplex graphs, and tends to yield improved node classification performance compared to state-of-the-art methods.

2605.12693 2026-05-14 cs.LG

IGT-OMD: Implicit Gradient Transport for Decision-Focused Learning under Delayed Feedback

Benjamin Amoh, Geoffrey G. Parker, Wesley Marrero

AI总结 该研究针对延迟反馈环境下决策导向学习中的挑战,提出了一种新的算法IGT-OMD,用于解决双层优化中的梯度陈旧问题。通过隐式梯度传输技术,该方法在在线镜像下降中重新评估存储的内部解,从而将运输误差从延迟的二次依赖降低到线性依赖,并首次实现了具有自适应步长的延迟双层优化的次线性遗憾界。实验表明,该方法在多个任务中显著降低了决策损失,验证了理论分析的有效性。

Comments 9 pages, 4 figures, NeurIPS 2026 conference

详情
英文摘要

Decision-focused learning trains predictive models end-to-end against downstream decision loss, but online settings suffer delayed feedback: outcomes may not arrive for many environment interactions. We identify \emph{staleness amplification}, a failure mode unique to bilevel optimization under delay, in which gradient staleness couples with inner-solver sensitivity to inflate regret beyond single-level delay theory. We prove that any black-box delayed optimizer incurs an irreducible regret cost from inner-solver approximation error, and that gradient staleness contributes a quadratically growing transport error without bilevel-aware correction. Our algorithm, \textbf{IGT-OMD}, applies Implicit Gradient Transport to hypergradients within Online Mirror Descent, re-evaluating stale gradients at the current parameters using stored inner solutions. This method reduces transport error from a quadratic to a linear dependence on delay and achieves the first sublinear regret bound for delayed bilevel optimization with queue-length-adaptive step sizes. Controlled experiments provide a \emph{mechanistic fingerprint}: transport benefit is exactly $0.0\%$ ($p=1.00$) at unit delay and grows monotonically to $9.5\%$ at fifty rounds ($p<0.001$), isolating the correction's effect. On Linear Quadratic Regulator, Warcraft shortest-path, and Sinkhorn optimal transport, IGT-OMD reduces decision loss by $17$--$55\%$ relative to single-level baselines, with phase transitions matching the theory.

2605.12691 2026-05-14 cs.AI

On the Size Complexity and Decidability of First-Order Progression

Jens Classen, Daxin Liu

AI总结 本文研究了在一阶逻辑框架下动作进展(progression)的规模复杂性与可判定性问题。作者在情境演算(Situation Calculus)框架下,分析了具有局部效应、正常和无环等特性的动作类别的进展规模,证明在合理假设下其规模仅呈多项式增长。此外,当知识库属于可判定的逻辑片段(如二元一阶逻辑或带有常量的全称理论)时,进展仍保持在相同片段内,从而保证了可判定性和实际应用价值。

Comments This is an extended version of an identically-titled paper accepted for publication at IJCAI 2026. This version contains an appendix with further proofs

详情
英文摘要

Progression, the task of updating a knowledge base to reflect action effects, generally requires second-order logic. Identifying first-order special cases, by restricting either the knowledge base or action effects, has long been a central topic in reasoning about actions. It is known that local-effect, normal, and acyclic actions, three increasingly expressive classes, admit first-order progression. However, a systematic analysis of the size of such progressions, crucial for practical applications, has been missing. In this paper, using the framework of Situation Calculus, we show that under reasonable assumptions, first-order progression for these action classes grows only polynomially. Moreover, we show that when the KB belongs to decidable fragments such as two-variable first-order logic or universal theories with constants, the progression remains within the same fragment, ensuring decidability and practical applicability.

2605.12685 2026-05-14 cs.LG cs.AI

A Unified Perspective for Learning Graph Representations Across Multi-Level Abstractions

Mohamed Mahmoud Amar, Nairouz Mrabah, Mohamed Bouguessa, Abdoulaye Baniré Diallo

AI总结 该论文提出了一种统一的对比学习框架,用于从节点级、邻近级、聚类级和图级等多个抽象层次学习图结构数据的表示。为了解决现有方法大多只关注单一抽象层次的问题,该方法通过相似度与不相似度分数的线性组合整合多级信息,并引入一种无需参数的细粒度自适应加权机制,以增强优化灵活性并提升模型收敛性。实验表明,该方法在多个下游任务中优于现有先进方法,适用于单层次和多层次场景。

Comments Accepted for publication in IEEE Transactions on Knowledge and Data Engineering (TKDE). 18 pages, 8 figures

详情
英文摘要

Graph Self-Supervised Learning (GSSL) has emerged as a powerful paradigm for generating high-quality representations for graph-structured data. While multi-scale graph contrastive learning has received increasing attention, many existing methods still predominantly focus on a single graph abstraction level. To address this limitation, we propose a unified contrastive framework that can target node-level, proximity-level, cluster-level, and graph-level information and integrate them through a linear combination of similarity scores on positive pairs and dissimilarity scores (i.e., similarity scores on negative pairs). Furthermore, current approaches typically assign uniform penalty strengths to all examples, which reduces optimization flexibility and leads to ambiguous convergence status. To overcome this, we introduce a novel parameter-free fine-grained self-weighting mechanism that adaptively assigns weights to individual similarity and dissimilarity scores. The proposed mechanism emphasizes the scores that deviate significantly from their target values. Our approach not only enhances optimization flexibility but also eliminates the computational overhead of hyperparameter tuning in conventional multi-task GSSL methods. Comprehensive experiments on real-world datasets show that our methods consistently outperform state-of-the-art approaches across downstream tasks, including classification, clustering, and link prediction, in both single-level and multi-level scenarios.

2605.12684 2026-05-14 cs.CV cs.AI cs.HC

Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty?

Yichen Feng, Yuetai Li, Chunjiang Liu, Yuanyuan Chen, Fengqing Jiang, Yue Huang, Hang Hua, Zhengqing Yuan, Kaiyuan Zheng, Luyao Niu, Bhaskar Ramasubramanian, Basel Alomair, Xiangliang Zhang, Misha Sra, Zichen Chen, Radha Poovendran, Zhangchen Xu

AI总结 该研究探讨了前沿多模态大语言模型在视觉审美判断方面的能力,指出当前模型在判断图像美感时存在显著不足。研究引入了“视觉审美基准”(VAB),通过专家标注的对比任务评估模型表现,发现即使是最好的模型在识别最佳和最差图像时也远不如人类专家。研究还表明,通过少量专家示例对模型进行微调,可以显著提升其性能,凸显了VAB在推动审美判断模型发展中的重要价值。

Comments Project page: https://vab.bakelab.ai. Code: https://github.com/BakeLab/Visual-Aesthetic-Benchmark. Dataset: https://huggingface.co/datasets/BakeLab/Visual-Aesthetic-Benchmark

详情
英文摘要

Multimodal large language models (MLLMs) are now routinely deployed for visual understanding, generation, and curation. A substantial fraction of these applications require an explicit aesthetic judgment. Most existing solutions reduce this judgment to predicting a scalar score for a single image. We first ask whether such scores faithfully capture comparative preference: in a controlled study with eight expert annotators, score-derived rankings align poorly with the same annotators' direct comparisons, while direct ranking yields substantially higher inter-annotator agreement on best- and worst-image labels. Motivated by this finding, we introduce the Visual Aesthetic Benchmark (VAB), which casts aesthetic evaluation as comparative selection over candidate sets with matched subject matter. VAB contains 400 tasks and 1,195 images across fine art, photography, and illustration, with labels derived from the consensus of 10 independent expert judges per task. Evaluating 20 frontier MLLMs and six dedicated visual-quality reward models, we find that the strongest system identifies both the best and the worst image correctly across three random permutations of the candidate order in only 26.5% of tasks, far below the 68.9% achieved by human experts. Fine-tuning a 35B-parameter model on 2,000 expert examples brings its accuracy close to that of a 397B-parameter open-weight model, suggesting that the comparative signal in VAB is transferable. Together, these results expose a clear and measurable gap between current multimodal models and expert aesthetic judgment, and VAB provides the first set-based, expert-grounded testbed on which that gap can be tracked and closed.

2605.12683 2026-05-14 cs.LG cs.AI cs.DC physics.comp-ph

Parallel-in-Time Training of Recurrent Neural Networks for Dynamical Systems Reconstruction

Florian Hess, Florian Götz, Daniel Durstewitz

AI总结 本文研究了如何通过时间并行化方法提高递归神经网络在动态系统重建任务中的训练效率。作者提出了两种基于并行关联扫描的算法,分别适用于线性非自主动力学模型和通用非线性模型,并发现前者在训练时存在限制,难以准确学习非线性动力学。为此,作者将广义教师强制(GTF)引入DEER框架,有效提升了模型在长序列上的学习能力,实验表明长轨迹数据对具有长时程特征的动态系统重建具有显著提升作用。

Comments 29 pages, 6 figures, preprint

详情
英文摘要

Reconstructing nonlinear dynamical systems (DS) from data (DSR) is a fundamental challenge in science and engineering, but it inherently relies on sequential models. Recent breakthroughs for sequential models have produced algorithms that parallelize computation along sequence length $T$, achieving logarithmic time complexity, $\mathcal{O}(\log T)$. Since sequence lengths have been practically limited due to the linear runtime complexity $\mathcal{O}(T)$ of classical backpropagation through time, this opens new avenues for DSR. This paper studies two prominent classes of parallel-in-time algorithms for this task, both of which leverage parallel associative scans as their core computational primitive. The first class comprises models with linear yet non-autonomous dynamics and a nonlinear readout, such as modern State Space Models (SSMs), while the second consists of general nonlinear models which can be parallelized using the DEER framework. We find that the linear training-time recurrence of the first class of models imposes limitations that often hinder learning of accurate nonlinear dynamics. To address this, we augment DEER with Generalized Teacher Forcing (GTF), a novel variant within the more general nonlinear framework that ensures stable and effective learning of nonlinear dynamics across arbitrary sequence lengths. Using GTF-DEER, we investigate the benefits of training on extremely long sequences ($T>10^4$) for DSR. Our results show that access to such long trajectories significantly improves DSR if the data features long time scales. This work establishes GTF-DEER as a robust tool for data-driven discovery and underscores the largely untapped potential of long-sequence learning in modeling complex DS.

2605.12682 2026-05-14 cs.AI

Learning Transferable Latent User Preferences for Human-Aligned Decision Making

Alina Hyk, Sandhya Saisubramanian

AI总结 该研究旨在解决大语言模型在生成人类对齐决策时面临的挑战,即如何从有限的交互中学习可迁移的潜在用户偏好。为此,作者提出了CLIPR框架,通过少量对话输入学习可操作的自然语言规则,以表示用户的潜在偏好,并通过自适应反馈不断优化这些规则。实验表明,CLIPR在多个任务和环境中均能有效提升决策对齐度并降低推理成本。

详情
英文摘要

Large language models (LLMs) are increasingly used as reasoning modules in many applications. While they are efficient in certain tasks, LLMs often struggle to produce human-aligned solutions. Human-aligned decision making requires accounting for both explicitly stated goals and latent user preferences that shape how ambiguous situations should be resolved. Existing approaches to incorporating such preferences either rely on extensive and repeated user interactions or fail to generalize latent preferences across tasks and contexts, limiting their practical applicability. We consider a setting in which an LLM is used for high-level reasoning and is responsible for inferring latent user preferences from limited interactions, which guides downstream decision making. We introduce CLIPR (Conversational Learning for Inferring Preferences and Reasoning), a framework that learns actionable, transferable natural language rules that represent latent user preferences from minimal conversational input. These rules are iteratively refined through adaptive feedback and applied to both in-distribution and out-of-distribution ambiguous tasks across multiple environments. Evaluations on three datasets and a user study show that CLIPR consistently outperforms existing methods in improving alignment and reducing inference costs.

2605.12674 2026-05-14 cs.AI cs.LG cs.RO

Revealing Interpretable Failure Modes of VLMs

Isha Chaudhary, Vedaant V Jain, Kavya Sachdeva, Sayan Ranu, Gagandeep Singh

AI总结 该论文提出了一种名为REVELIO的框架,用于系统性地揭示视觉-语言模型(VLMs)中可解释的失效模式。研究通过结合多样性感知的束搜索和高斯过程汤普森采样策略,高效探索VLM在特定场景下的失效组合空间。实验表明,该方法在自动驾驶和室内机器人任务中发现了现有VLM的潜在漏洞,为提升模型安全性提供了结构化且可解释的改进方向。

详情
英文摘要

Vision-Language Models (VLMs) are increasingly used in safety-critical applications because of their broad reasoning capabilities and ability to generalize with minimal task-specific engineering. Despite these advantages, they can exhibit catastrophic failures in specific real-world situations, constituting failure modes. We introduce REVELIO, a framework for systematically uncovering interpretable failure modes in VLMs. We define a failure mode as a composition of interpretable, domain-relevant concepts-such as pedestrian proximity or adverse weather conditions-under which a target VLM consistently behaves incorrectly. Identifying such failures requires searching over an exponentially large discrete combinatorial space. To address this challenge, REVELIO combines two search procedures: a diversity-aware beam search that efficiently maps the failure landscape, and a Gaussian-process Thompson Sampling strategy that enables broader exploration of complex failure modes. We apply REVELIO to autonomous driving and indoor robotics domains, uncovering previously unreported vulnerabilities in state-of-the-art VLMs. In driving environments, the models often demonstrate weak spatial grounding and fail to account for major obstructions, leading to recommendations that would result in simulated crashes. In indoor robotics tasks, VLMs either miss safety hazards or behave excessively conservatively, producing false alarms and reducing operational efficiency. By identifying structured and interpretable failure modes, REVELIO offers actionable insights that can support targeted VLM safety improvements.

2605.12673 2026-05-14 cs.AI cs.CR

Do Androids Dream of Breaking the Game? Systematically Auditing AI Agent Benchmarks with BenchJack

Hao Wang, Hanchen Li, Qiuyang Mang, Alvin Cheung, Koushik Sen, Dawn Song

AI总结 该论文研究了人工智能代理基准测试中的奖励黑客问题,即代理通过非预期方式最大化得分而非完成任务的现象。为此,作者提出了 BenchJack 系统,通过自动化红队测试方法系统性地审计基准测试,识别潜在的奖励黑客漏洞。研究还构建了一个迭代生成对抗流程,不断发现并修复新漏洞,显著提升了基准测试的安全性。实验表明,BenchJack 能在多个主流基准中发现大量漏洞,并有效降低了可被攻击的任务比例。

详情
英文摘要

Agent benchmarks have become the de facto measure of frontier AI competence, guiding model selection, investment, and deployment. However, reward hacking, where agents maximize a score without performing the intended task, emerges spontaneously in frontier models without overfitting. We argue that benchmarks must be secure by design. From past incidents of reward hacks, we derive a taxonomy of eight recurring flaw patterns and compile them into the Agent-Eval Checklist for benchmark designers. We condense the insights into BenchJack, an automated red-teaming system that drives coding agents to audit benchmarks and identify possible reward-hacking exploits in a clairvoyant manner. Moreover, we extend BenchJack to an iterative generative-adversarial pipeline that discovers new flaws and patches them iteratively to improve benchmark robustness. We apply BenchJack to 10 popular agent benchmarks spanning software engineering, web navigation, desktop computing, and terminal operations. BenchJack synthesizes reward-hacking exploits that achieve near-perfect scores on most of the benchmarks without solving a single task, surfacing 219 distinct flaws across the eight classes. Moreover, BenchJack's extended pipeline reduces the hackable-task ratio from near 100% to under 10% on four benchmarks without fatal design flaws, fully patching WebArena and OSWorld within three iterations. Our results show that evaluation pipelines have not internalized an adversarial mindset, and that proactive auditing could help close the security gap for the fast-paced benchmarking space.

2605.12671 2026-05-14 cs.CL

All Circuits Lead to Rome: Rethinking Functional Anisotropy in Circuit and Sheaf Discovery for LLMs

Xi Chen, Mingyu Jin, Jingcheng Niu, Yutong Yin, Jinman Zhao, Bangwei Guo, Dimitris N. Metaxas, Zhaoran Wang, Yutao Yue, Gerald Penn

AI总结 本文挑战了大型语言模型(LLMs)中电路与sheaf发现(CSD)领域的一个核心假设——功能各向异性假设,即认为模型功能由单一或近似唯一的内部机制实现。研究通过实证和理论分析表明,同一任务可由多个结构不同的电路或sheaf同时完成,且它们均具备稀疏性、完整性和任务表现力。为此,作者提出了一种结构重叠感知的sheaf排斥方法,有效揭示了具有高性能但结构差异显著的替代机制,并提出了分布式稠密电路假设,解释了在高维叠加下非唯一、低重叠的电路解释为何自然出现。

Comments ICML 2026

详情
英文摘要

In this paper, we present empirical and theoretical evidence against a central but largely implicit assumption in circuit and sheaf discovery (CSD), which we term the Functional Anisotropy Hypothesis: the idea that functions in large language models (LLMs) are localised to a unique or near-unique internal mechanism. We show that a single LLM task can instead be supported by multiple, structurally distinct circuits or sheaves that are simultaneously faithful, sparse, and complete. To systematically uncover such competing mechanisms, we introduce Overlap-Aware Sheaf Repulsion, a method that augments the CSD objective with an explicit penalty on structural overlap across multiple discovery runs, enabling the discovery of circuits or sheaves with strong task performance but minimal shared structure across a plethora of common CSD benchmarks. We find that this phenomenon becomes increasingly pronounced as the number of discovered sheaves grows and persists robustly across major CSD methods. We further identify an ultra-sparse three-edge sheaf and show that none of its edges is individually indispensable, undermining even weakened notions of canonical or essential components. To explain these findings, we propose a Distributive Dense Circuit Hypothesis and provide a theoretical analysis demonstrating that non-unique, low-overlap circuit explanations arise naturally from high-dimensional superposition under mild assumptions. Together, our results suggest that mechanistic explanations in LLMs are inherently non-canonical and call for a rethinking of how CSD results should be interpreted and evaluated.

2605.12662 2026-05-14 cs.LG q-bio.GN

scShapeBench: Discovering geometry from high dimensional scRNAseq data

Andrew J Steindl, João Felipe Rocha, Brian Tshilengi Di Bassinga, Zachary Warren, Matthew Scicluna, César Miguel Valdez Córdova, Shabarni Gupta, Leire Torices, Daniel Neumann, Timothy J. Mann, Ihuan Gunawan, Dhananjay Bhaskar, John G Lock, Christine L Chaffer, Guy Wolf, Smita Krishnaswamy

AI总结 scShapeBench 是一个用于单细胞转录组数据形状检测的基准数据集,旨在自动识别数据中的几何结构,如聚类、轨迹和典型模式,从而辅助选择合适的下游分析流程。该研究引入了 scReebTower 方法,基于扩散几何提取 Reeb 图,实现了可视化与分析流程的自动匹配,并提供了拓扑感知的评估指标。实验表明,scReebTower 在合成和真实数据上均优于现有方法,为单细胞数据的自动化分析提供了重要工具。

详情
英文摘要

High-dimensional point cloud data arise across many scientific domains, especially single-cell biology. The shapes or topologies of these datasets determine the types of information that can be extracted. For example, clustered data supports cell-type identification, trajectory structures support transition analysis, and archetypal structures capture continua of cellular behaviors. Existing analysis pipelines often assume a specific shape. The standard Seurat pipeline combines UMAP visualization with Louvain clustering and therefore assumes clustered data, while tools such as Monocle and SPADE assume tree-like structures, and flow-based models such as MIOFlow and Conditional Flow Matching target trajectories. Choosing which pipeline to apply is therefore often left to bioinformaticians who visually inspect datasets before selecting an analysis strategy. With the rise of agentic AI scientists, automating shape detection is increasingly important for selecting downstream analysis pipelines. To address this problem, we introduce scShapeBench, a benchmark dataset for shape detection containing both synthetic and expert-annotated single-cell datasets. Synthetic datasets are sampled from ground-truth skeleton graphs with controlled variance. Real single-cell datasets are curated from diverse sources and annotated by experts into four categories: clusters, single trajectory, multi-branching, and archetypal. We additionally introduce scReebTower, a baseline method that uses diffusion geometry to extract Reeb graphs and connect visualization with pipeline selection. We provide topology-aware evaluation metrics and compare scReebTower against PAGA and Mapper on synthetic and real data. Our results indicate that scReebTower outperforms existing baselines. Overall, our contributions span benchmarks, evaluation metrics, and a baseline for automated shape detection in single-cell data.

2605.12654 2026-05-14 cs.RO

COSMIC: Concurrent Optimization of Structure, Material, and Integrated Control for robotic systems

Qinsong Guo, Liwei Wang

AI总结 本文提出了一种基于梯度的协同设计框架COSMIC,用于同时优化机器人的结构、材料和控制策略,以实现超越传统分步设计的性能。该框架通过将混合类型的拓扑和材料变量嵌入连续设计空间,并结合可微分模拟器中的神经网络控制器,实现了对结构、材料与控制策略之间交互关系的高效建模与梯度计算。研究展示了该方法在多样化的运动策略优化和适应不同功能需求方面的有效性,并揭示了各设计要素对机器人性能的独立与协同影响。

详情
英文摘要

Replicating and surpassing the autonomy of natural organisms remains a long-standing goal in robotics. Yet most robotic systems have their structure, materials, and control designed separately, in sharp contrast to the co-evolution in nature. This separation often leads to suboptimal designs, and we still have a limited understanding of the individual and collective contributions of these design entities. In this work, we propose a gradient-based co-design framework that simultaneously optimizes the topology, material distribution, and control policy of a truss-lattice robot. The framework embeds mixed-type topological and material variables into a continuous design space and integrates a neural network controller within a differentiable simulator, capturing their interactions and enabling efficient gradient calculation via automatic differentiation. Furthermore, we develop a constrained optimization to navigate the highly non-convex design landscape and jointly optimize all design entities. Case studies demonstrate that the proposed framework consistently discovers diverse locomotion strategies that outperform baselines obtained through separated design. The framework is also flexible to accommodate different functional requirements and boundary conditions. Using this framework, we further extract design insights that reveal the individual and collective effects of different entities on robotic performance. The proposed framework provides a computational foundation for the autonomous co-design of robotic systems, capable of reconfiguration, locomotion, and other complex autonomous behaviors.

2605.12653 2026-05-14 cs.LG cs.AI stat.ML

Plan Before You Trade: Inference-Time Optimization for RL Trading Agents

Eun Go, Rohan Deb, Arindam Banerjee

AI总结 本文提出了一种名为FPILOT的推理时优化框架,用于改进强化学习在投资组合管理中的应用。该方法受模型预测控制启发,利用价格预测信息在推理阶段动态优化交易策略,而无需依赖训练时的固定策略。FPILOT能够在不重新训练策略的情况下,结合价格预测模型生成多步价格轨迹,并据此优化每一步的资产配置,从而在多个风险调整指标上显著提升交易表现。

详情
英文摘要

Reinforcement learning agents for portfolio management are typically trained and deployed as static policies, with no mechanism for using price forecasts at inference time. We propose $\text{FPILOT}$ (**Fin**ancial **P**lugin **I**nference-time **L**earning for **O**ptimal **T**rading), a plugin inference-time optimization framework inspired by Model Predictive Control (MPC). Our key structural insight is that future prices mostly do not depend on one agent's portfolio allocation, so a suitable predictive model can produce a multi-step price trajectory without iterative action-conditioned rollouts as in typical reinforcement learning. At each decision step, we use the forecaster's predicted price trajectory to construct an allocation-based imagined return objective, and optimize the policy at inference-time before executing one step of the trade. Our framework is compatible with any pre-trained agent and adapts the policy to the forecaster's predictions without any retraining. Evaluated across five policy learning algorithms on the TradeMaster DJ30 benchmark, $\text{FPILOT}$ produces consistent improvements in total return and return-based risk-adjusted metrics (Sharpe, Sortino, Calmar), with stochastic policies benefiting more than deterministic ones. Further, using synthetic forecasts at calibrated quality levels, we show that gains consistently improve with forecaster quality, suggesting that our performance will improve based on advances in financial forecasting.

2605.12650 2026-05-14 cs.CV

CRAFT: Clinical Reward-Aligned Finetuning for Medical Image Synthesis

Yunsung Chung, Alex El Darzi, Carlo El Khoury, Han Feng, Nassir Marrouche, Jihun Hamm

AI总结 该研究针对医学图像合成中基础扩散模型适应性不足的问题,提出了一种基于临床对齐的微调方法CRAFT。通过引入临床对齐分数(CAS)作为新的评估指标,CRAFT从多模态大语言模型中迁移医学知识,结合条件提示增强、临床检查表和可微奖励优化,显著提升了生成图像的临床相关性。实验表明,CRAFT在多个医学影像模态上不仅提高了CAS评分,还有效减少了生成图像的不真实现象,优于现有主流方法。

详情
英文摘要

Foundation diffusion models can generate photorealistic natural images, but adapting them to medical imaging remains challenging. In medical adaptation, limited labeled data can exacerbate hallucination-like and clinically implausible synthesis, while existing metrics such as FID or Inception Score do not quantify per-image alignment with pathology-relevant criteria. We introduce the Clinical Alignment Score (CAS), a foundation-model-based proxy for clinical alignment that evaluates generated images along four complementary dimensions beyond visual fidelity. Building on CAS, we propose Clinical Reward-Aligned Finetuning (CRAFT), a reward-based adaptation framework that transfers medical knowledge from multimodal large language models and vision-language models through label-conditioned prompt enrichment, clinical checklists, and differentiable reward optimization. Across four diverse modalities, CRAFT improves CAS and downstream classification performance over strong adaptation baselines. Beyond average CAS gains, CRAFT reduces the empirical low-alignment tail below a real-image reference threshold by 5.5-34.7% points relative to the strongest baseline, corresponding to a 20.4% average relative reduction across datasets. These results indicate fewer hallucination-like generations under CAS, and are corroborated by out-of-family evaluator evaluation, structured checklist auditing, memorization analysis, and a blinded physician preference study on CheXpert.

2605.12648 2026-05-14 cs.LG stat.ML

Population Risk Bounds for Kolmogorov-Arnold Networks Trained by DP-SGD with Correlated Noise

Puyu Wang, Jan Schuchardt, Nikita Kalinin, Junyu Zhou, Sophie Fellenz, Christoph Lampert, Marius Kloft

AI总结 本文首次为使用带有梯度裁剪的随机梯度下降(SGD)训练的柯尔莫戈罗夫-阿诺尔德网络(KAN)建立了群体风险界,涵盖了非隐私保护的SGD以及使用高斯扰动的差分隐私SGD(DP-SGD),其中扰动噪声在独立与时间相关之间进行插值。研究采用更贴近实际训练的批量SGD方法,并引入时间相关噪声机制,以改善隐私与效用的平衡。通过引入辅助未投影动态、偏移迭代和高概率引导分析,解决了非凸优化中相关噪声DP训练的分析难题,最终得到了KAN的群体风险界,为非凸学习中的相关噪声机制提供了首个优化与泛化分析。

详情
英文摘要

We establish the first population risk bounds for Kolmogorov-Arnold Networks (KANs) trained by mini-batch SGD with gradient clipping, covering non-private SGD as well as differentially private SGD (DP-SGD) with Gaussian perturbations that interpolate between independent and temporally correlated noise. This setting is substantially closer to practice than prior KAN theory along two axes: training is by mini-batch SGD, the standard recipe for modern networks, rather than full-batch gradient descent (GD); and correlated-noise mechanisms have empirically shown a more favorable privacy-utility tradeoff than independent-noise mechanisms. Our results cover the corresponding full-batch GD and independent-noise DP-GD results for KANs by Wang et al. (2026), while yielding sharper fixed-second-layer specializations. The technical core is a new analysis route for correlated-noise DP training in the non-convex regime. Temporal dependence breaks the conditional-centering structure underlying standard one-step SGD arguments, and the projection step obstructs the exact cancellation structure of correlated perturbations. We address these difficulties through an auxiliary unprojected dynamics, a shifted iterate that absorbs the current noise perturbation, and a high-probability bootstrap certifying projection inactivity. Combining this optimization analysis with a stability-based generalization argument yields the stated population risk bounds. To the best of our knowledge, this is the first optimization and population risk analysis of a correlated-noise mechanism for DP training beyond convex learning, in particular for neural networks.

2605.12645 2026-05-14 cs.CL cs.AI

Training LLMs with Reinforcement Learning for Intent-Aware Personalized Question Answering

Maryam Amirizaniani, Benjamin Charles Germain Lee, Jevin West, Nicholas Weber

AI总结 该研究旨在提升语言模型在单轮对话中的个性化问答能力,通过理解用户隐含的意图来生成更符合其深层目标的回答。为此,作者提出了一种基于强化学习的框架IAP,能够在仅凭单轮问题的情况下直接推断用户意图,并通过标签化机制将其融入推理过程,从而生成更具针对性的回答。实验表明,IAP在多个模型上均显著优于现有方法,验证了在训练过程中建模隐含用户意图的有效性。

详情
英文摘要

Effective personalized question answering (PQA) in language models requires grounding responses in the user's underlying intent, where intent refers to the implicit ``why'' behind a query beyond its explicit wording. However, existing approaches to intent-aware personalization rely on multi-turn conversational context or rich user profiles, and do not explicitly model user intent during the reasoning process. This limits their effectiveness in single-turn settings, where the user's latent goal must be inferred from minimal input and integrated into the thinking and reasoning process. To bridge this gap, we propose IAP (Intent-Aware Personalization), a reinforcement learning framework that trains models to infer implicit user intent directly from a single-turn question and incorporate it into thinking steps through a tag-based schema for generating personalized, intent-grounded answers. By optimizing intent-aware answer trajectories under a personalized reward function, IAP reinforces generation paths that make implicit user intent explicit and produce responses that better align with the user's underlying goal. Through experiments on the LaMP-QA benchmark across six models, IAP consistently outperforms all baselines, achieving an average macro-score gain of around 7.5\% over the strongest competitor, demonstrating that modeling implicit user intent within the training objective is a promising direction for PQA.

2605.12639 2026-05-14 cs.LG

OceanCBM: A Concept Bottleneck Model for Mechanistic Interpretability in Ocean Forecasting

Sanah Suri, Kieran Ringel, Maike Sonnewald

AI总结 本文提出 OceanCBM,一种用于海洋预报的机制可解释概念瓶颈模型,旨在解决传统机器学习模型在预测极端海洋现象时缺乏物理可解释性的问题。该模型通过混合监督方式预测海洋热含量,结合来自流体力学的预设概念和自由概念层,既保证了模型的物理一致性,又保持了预测性能。实验表明,OceanCBM 能在不牺牲预测能力的前提下,提供明确的物理机制解释,揭示了可解释性与性能之间的权衡关系。

Comments 17 pages, 9 figures, 4 tables

详情
英文摘要

Extreme ocean phenomena are challenging not only to predict but to diagnose, as accurate forecasts alone do not reveal the underlying physical drivers. While recent machine learning approaches achieve strong predictive skill, they remain largely opaque and provide limited guarantees of fidelity to ground-truth physics. We introduce OceanCBM, the first concept bottleneck model (CBM) for spatiotemporal prediction and mechanistic interrogation of ocean dynamics. OceanCBM uses mixed supervision to predict mixed layer heat content, a key precursor of marine heatwaves, while routing information through an intermediate layer of prescribed concepts derived from geophysical fluid dynamics and a 'free' concept. This design imposes soft physical structure without over-constraining the model, and the free concept both regularizes concept predictions and captures residual physical processes. Across ensemble initializations, we show that mixed supervision yields consistent mechanistic representations, whereas prediction-only and prescription-only baselines learn highly variable latent structures despite similar predictive performance. OceanCBM achieves interpretable, physically grounded representations without sacrificing skill, explicitly characterizing the interpretability-performance trade-off.

2605.12628 2026-05-14 cs.RO

Multistep Belief Space Dynamics Learning For Risk-Aware Control

Jason Gibson, Bogdan Vlahov, Patrick Spieler, Evangelos A. Theodorou

AI总结 本文研究了如何在自动驾驶系统中实现风险感知的控制,针对动态不确定性随时间演变的问题,提出了一种用于模型预测控制(MPC)的分布动态学习框架。该方法通过学习环境动力学的分布特性,能够在保证安全性的前提下优化控制策略,避免过于保守。实验表明,该方法在真实复杂的越野环境中表现出良好的适应性和智能行为。

详情
英文摘要

As autonomous vehicles move from a simplified research setting to practical use, there exists a large gap between the dynamic behavior of a human driving and an autonomous system. Risk-aware behavior needs to naturally develop in order to scale to the demands of the real world. A major issue for risk-aware planning and control has been predicting how dynamical uncertainty evolves through time and optimizing plans that account for this without being overly conservative. Here, we present a learning framework to predict distributional dynamics that can be optimized in real time for Model Predictive Control (MPC). We explore the importance of structure when learning distributional dynamics for use in MPC. A rigorous ablation study is conducted on a large dataset of real world off-road driving that shows the impact of deviations from our proposed structure. Furthermore, we deploy our learned model and planning stack on a full sized vehicle in challenging off-road conditions. Our planning architecture is able to naturally regulate the speed of the vehicle based on the environment and consistently demonstrates intelligent behavior over miles of diverse terrain.

2605.12325 2026-05-14 cs.CV

VIP: Visual-guided Prompt Evolution for Efficient Dense Vision-Language Inference

Hao Zhu, Shuo Jin, Wenbin Liao, Jiayu Xiao, Yan Zhu, Siyue Yu, Feng Dai

AI总结 该研究旨在解决无训练开放词汇语义分割中因CLIP模型存在空间偏差而导致的效率与泛化性难题。为此,作者提出了一种基于空间感知框架dino$.$txt的视觉引导提示进化(VIP)方法,通过引入视觉引导的蒸馏机制和别名扩展,提升文本查询的语义表达能力,从而实现更高效、更精确的密集预测。实验表明,VIP在多个基准数据集上取得了优于现有方法的性能,并具有良好的跨领域泛化能力和较低的推理开销。

Comments Accepted by ICML2026. Code is available at https://github.com/MiSsU-HH/VIP

详情
英文摘要

Pursuing training-free open-vocabulary semantic segmentation in an efficient and generalizable manner remains challenging due to the deep-seated spatial bias in CLIP. To overcome the limitations of existing solutions, this work moves beyond the CLIP-based paradigm and harnesses the recent spatially-aware dino$.$txt framework to facilitate more efficient and high-quality dense prediction. While dino$.$txt exhibits robust spatial awareness, we find that the semantic ambiguity of text queries gives rise to severe mismatch within its dense cross-modal interactions. To address this, we introduce Visual-guided Prompt evolution (VIP) to rectify the semantic expressiveness of text queries in dino$.$txt, unleashing its potential for fine-grained object perception. Towards this end, VIP integrates alias expansion with a visual-guided distillation mechanism to mine valuable semantic cues, which are robustly aggregated in a saliency-aware manner to yield a high-fidelity prediction. Extensive evaluations demonstrate that VIP: 1. surpasses the top-leading methods by 1.4%-8.4% average mIoU, 2. generalizes well to diverse challenging domains, and 3. requires marginal inference time and memory overhead.

2605.12163 2026-05-14 cs.CV

Self-Consistent Latent Reasoning: Long Latent Sequence Reasoning for Vision-Language Model

Chenfeng Wang, Wei He, Xuhan Zhu, Chunpeng Zhou, Qizhen Li, Song Yan, Yufei Zheng, Chengjun Yu, Fan Lu, Wei Zhai, Yang Cao, Pengfei Yu, Zheng-Jun Zha

AI总结 本文研究了视觉-语言模型中长潜层序列推理的问题,发现现有方法在潜层序列变长时性能下降,原因在于信息增益崩溃和过度池化的图像嵌入缺乏有效信号。为此,作者提出了一种自洽潜层推理方法SCOLAR,通过引入轻量级解码器生成独立锚定于原始视觉空间的辅助视觉标记,并结合多阶段微调和强化学习,显著提升了潜层推理长度和模型性能,在多个真实场景基准上取得了最优结果。

Comments 17 pages, 6 figures

详情
英文摘要

In language reasoning, longer chains of thought consistently yield better performance, which naturally suggests that visual latent reasoning may likewise benefit from longer latent sequences. However, we discover a counterintuitive phenomenon: the performance of existing latent visual reasoning methods systematically degrades as the latent sequence grows longer. We reveal the root cause: Information Gain Collapse -- autoregressive generation makes each step highly dependent on prior outputs, so subsequent tokens can barely introduce new information. We further identify that heavily pooled ($\geq 128\times$) image embeddings used as supervision targets provide no more signal than meaningless placeholders. Motivated by these insights, we propose SCOLAR (Self-COnsistent LAtent Reasoning), which introduces a lightweight detransformer that leverages the LLM's full-sequence hidden states to generate auxiliary visual tokens in a single shot, with each token independently anchored to the original visual space. Combined with three-stage SFT and ALPO reinforcement learning, SCOLAR extends acceptable latent CoT length by over $30\times$, achieves state-of-the-art among open-source models on real-world reasoning benchmarks (+14.12% over backbone), and demonstrates strong out-of-distribution generalization.

2605.12145 2026-05-14 cs.CV

Cross-Modal-Domain Generalization Through Semantically Aligned Discrete Representations

Souptik Sen, Raneen Younis, Zahra Ahmadi

AI总结 该研究旨在解决多模态学习中跨模态泛化与模态特异性结构之间的平衡问题。提出了一种名为CoDAAR的新框架,通过语义对齐的离散表示,在统一的离散空间中同时保留各模态的独特结构并实现跨模态的泛化能力。该方法结合了离散时间对齐和级联语义对齐两种机制,通过自监督重建任务进行训练,在多个跨模态和跨领域基准测试中取得了最先进的性能。

Comments Added missing affiliation for co-author R. Younis and Z. Ahmadi

详情
英文摘要

Multimodal learning seeks to integrate information across diverse sensory sources, yet current approaches struggle to balance cross-modal generalizability with modality-specific structure. Continuous (implicit) methods preserve fine-grained priors but render generalization challenging, while discrete (explicit) approaches enforce shared prototypes at the expense of modality specificity. We introduce CoDAAR (Cross-modal Discrete Alignment And Reconstruction), a novel framework that resolves this long-standing trade-off by establishing semantic consensus across modality-specific codebooks through index-level alignment. This design uniquely allows CoDAAR to preserve modality-unique structures while achieving generalizable cross-modal representations within a unified discrete space. CoDAAR combines two complementary mechanisms: Discrete Temporal Alignment (DTA), which enables fine-grained temporal quantization, and Cascading Semantic Alignment (CSA), which promotes progressive cross-modal semantic agreement. Together, they establish a competition-free unified representation space. Trained with self-supervised reconstruction objectives on paired multimodal sequences, CoDAAR demonstrates robust cross-modal and cross-domain generalization. Across Cross-Modal Generalization benchmarks, including event classification, localization, video segmentation, and cross-dataset transfer, CoDAAR achieves state-of-the-art performance, establishing a new paradigm for discrete and generalizable multimodal representation learning.

2605.12119 2026-05-14 cs.CV cs.GR

MoCam: Unified Novel View Synthesis via Structured Denoising Dynamics

Haofeng Liu, Yang Zhou, Ziheng Wang, Zhengbo Xu, Zhan Peng, Jie Ma, Jun Liang, Shengfeng He, Jing Li

AI总结 本文提出了一种名为MoCam的统一新视角合成方法,旨在解决生成式新视角合成中几何先验与外观先验之间的矛盾。该方法通过结构化去噪动力学,在扩散过程中协调地从几何到外观逐步生成内容,先利用几何先验构建粗略结构,再借助外观先验修正几何误差并细化细节。实验表明,MoCam在点云存在严重缺失或扭曲的情况下表现尤为突出,实现了几何与外观的有效解耦与统一合成。

Comments Project page: https://orange-3dv-team.github.io/MoCam

详情
英文摘要

Generative novel view synthesis faces a fundamental dilemma: geometric priors provide spatial alignment but become sparse and inaccurate under view changes, while appearance priors offer visual fidelity but lack geometric correspondence. Existing methods either propagate geometric errors throughout generation or suffer from signal conflicts when fusing both statically. We introduce MoCam, which employs structured denoising dynamics to orchestrate a coordinated progression from geometry to appearance within the diffusion process. MoCam first leverages geometric priors in early stages to anchor coarse structures and tolerate their incompleteness, then switches to appearance priors in later stages to actively correct geometric errors and refine details. This design naturally unifies static and dynamic view synthesis by temporally decoupling geometric alignment and appearance refinement within the diffusion process. Experiments demonstrate that MoCam significantly outperforms prior methods, particularly when point clouds contain severe holes or distortions, achieving robust geometry-appearance disentanglement.

2605.11989 2026-05-14 cs.CV cs.AI

A Transfer Learning Evaluation of Deep Neural Networks for Image Classification

Nermeen Abou Baker, Nico Zengeler, Uwe Handmann

AI总结 本文研究了如何为图像分类任务选择最符合目标领域需求的预训练模型,探讨了迁移学习在深度神经网络中的应用效果。作者对十一类在ImageNet上预训练的模型进行了输出层和网络参数的调整,并将其应用于五个不同的目标数据集。通过评估准确率、准确密度、训练时间和模型大小等指标,比较了不同模型在单次和多次训练过程中的表现,为迁移学习中的模型选择提供了参考依据。

Comments Published by Machine Learning and Knowledge Extraction Journal

详情
Journal ref
Machine Learning and Knowledge Extraction 4, no. 1: 22-41 (2022)
英文摘要

Transfer learning is a machine learning technique that uses previously acquired knowledge from a source domain to enhance learning in a target domain by reusing learned weights. This technique is ubiquitous because of its great advantages in achieving high performance while saving training time, memory, and effort in network design. In this paper, we investigate how to select the best pre-trained model that meets the target domain requirements for image classification tasks. In our study, we refined the output layers and general network parameters to apply the knowledge of eleven image processing models, pre-trained on ImageNet, to five different target domain datasets. We measured the accuracy, accuracy density, training time, and model size to evaluate the pre-trained models both in training sessions in one episode and with ten episodes.

2605.11679 2026-05-14 cs.AI

Explaining and Breaking the Safety-Helpfulness Ceiling via Preference Dimensional Expansion

ShiYing Huang, Liang Lin, Yuer Li, Kaiwen Luo, Zhenhong Zhou, An Zhang, Junhao Dong, Kun Wang, Zhigang Zeng

AI总结 在多目标对齐的大型语言模型研究中,如何平衡不同的人类偏好常表现为零和冲突。本文提出一种新的视角,认为多目标之间的冲突源于提示本身对多维奖励的限制,并据此提出多目标奖励融合方法MORA,通过扩展奖励维度提升模型在有用性、安全性等多方面的表现。实验表明,MORA在顺序对齐和同时对齐任务中均取得了显著的性能提升。

详情
英文摘要

In the realm of multi-objective alignment for large language models, balancing disparate human preferences often manifests as a zero-sum conflict. Specifically, the intrinsic tension between competing goals dictates that aggressively optimizing for one metric (e.g., helpfulness) frequently incurs a substantial penalty on another (e.g., harmlessness). While prior work mainly focuses on data selection, parameter merging, or algorithmic balancing during training, these approaches merely force compromises between divergent preferences along a fixed Pareto frontier, failing to fundamentally resolve the inherent trade-off. In this work, we approach this problem from a novel perspective of multi-dimensional rewards. By scaling up the model's rollouts and analyzing the outputs across different reward dimensions, we arrive at a critical conclusion: the conflict among multiple objectives stems from the fact that the prompt itself inherently restricts the achievable multi-dimensional rewards. Based on this core observation, we propose MORA: Multi-Objective Reward Assimilation. Specifically, MORA isolates single-reward prompts through pre-sampling and expands their reward diversity by rewriting the original questions to incorporate multi-dimensional intents. Extensive experiments demonstrate that: (1) in sequential alignment, MORA achieves single-preference improvements ranging from 5% to 12.4%, with exceptional gains in harmlessness, after multiple-preference alignment across helpful, harmless, and truthful dimensions. (2) In simultaneous alignment, MORA achieves an average overall reward improvement of 4.6%. Our codes are available at https://github.com/Shiying-Huang/MORA-MPA.

2605.11572 2026-05-14 cs.CV

TB-AVA: Text as a Semantic Bridge for Audio-Visual Parameter Efficient Finetuning

Seongah Kim, Dinh Phu Tran, Hyeontaek Hwang, Saad Wazir, Duc Do Minh, Daeyoung Kim

AI总结 该研究提出了一种名为TB-AVA的参数高效微调框架,旨在解决音频-视觉对齐中的语义对应难题。通过引入文本作为语义桥梁,TB-AVA在冻结的音频和视觉编码器基础上,利用文本引导的语义调制模块实现跨模态特征的交互与对齐。实验表明,该方法在多个基准数据集上取得了最先进的性能,验证了文本作为语义锚点在音频-视觉学习中的有效性。

Comments 12 pages, 6 figures

详情
英文摘要

Audio-visual understanding requires effective alignment between heterogeneous modalities, yet cross-modal correspondence remains challenging when temporally aligned audio and visual signals lack clear semantic correspondence. We propose to use text as a semantic anchor for audio-visual representation learning. To this end, we introduce a parameter-efficient adaptation framework built on frozen audio and visual encoders, centered on Text-Bridged Audio-Visual Adapter (TB-AVA), which enables text-mediated interaction between audio and visual streams. At the core of TB-AVA, Gated Semantic Modulation (GSM) selectively modulates feature channels based on text-inferred semantic relevance. We evaluate the proposed approach on multiple benchmarks, including AVE, AVS, and AVVP, where the proposed framework achieves state-of-the-art performance, demonstrating text as an effective semantic anchor for parameter-efficient fine-tuning (PEFT) in audio-visual learning.

2605.11533 2026-05-14 cs.CL cs.CV

Checkup2Action: A Multimodal Clinical Check-up Report Dataset for Patient-Oriented Action Card Generation

Sike Xiang, Shuang Chen, Kevin Qinghong Lin, Jialin Yu, Yijia Sun, Philip Torr, Amir Atapour-Abarghouei

AI总结 该研究提出了一个名为 Checkup2Action 的多模态临床体检报告数据集,用于生成面向患者的行动建议卡片。该数据集包含2000份去标识化的实际体检报告,涵盖人口统计、体格检查、实验室检测、心血管评估和影像学证据等信息,每个行动卡片包含临床问题、优先级、推荐科室、随访时间、患者解释及问题等结构化内容。研究将体检报告到行动建议的生成任务定义为约束结构化生成问题,并引入了涵盖覆盖度、优先级一致性、部门与时间推荐准确性等多维度的评估协议,为评估模型在临床报告上的患者导向推理能力提供了新的基准。

详情
英文摘要

Clinical check-up reports are multimodal documents that combine page layouts, tables, numerical biomarkers, abnormality flags, imaging findings, and domain-specific terminology. Such heterogeneous evidence is difficult for laypersons to interpret and translate into concrete follow-up actions. Although large language models show promise in medical summarisation and triage support, their ability to generate safe, prioritised, and patient-oriented actions from multimodal check-up reports remains under-benchmarked. We present \textbf{Checkup2Action}, a multimodal clinical check-up report dataset and benchmark for structured \textit{Action Card} generation. Each card describes one clinically relevant issue and specifies its priority, recommended department, follow-up time window, patient-facing explanation, and questions for clinicians, while avoiding diagnostic or treatment-prescriptive claims. The dataset contains 2,000 de-identified real-world check-up reports covering demographic information, physical examinations, laboratory tests, cardiovascular assessments, and imaging-related evidence. We formulate checkup-to-action generation as a constrained structured generation task and introduce an evaluation protocol covering issue coverage and precision, priority consistency, department and time recommendation accuracy, action complexity, usefulness, readability, and safety compliance. Experiments with general-purpose and medical large language models reveal clear trade-offs between issue coverage, action correctness, conciseness, and safety alignment. Checkup2Action provides a new multimodal benchmark for evaluating patient-oriented reasoning over clinical check-up reports.

2605.11505 2026-05-14 cs.AI

Selective Off-Policy Reference Tuning with Plan Guidance

Duc Anh Le, Tien-Phat Nguyen, Thien Huu Nguyen, Linh Ngo Van, Trung Le

AI总结 本文研究了如何在强化学习中利用可验证奖励进行推理,并针对GRPO类方法在处理困难提示时效果不佳的问题,提出了一种名为SORT的新方法。该方法通过引入计划引导机制,在不改变策略生成过程的前提下,利用参考解生成计划,并据此调整策略更新的权重,从而提升模型对结构化信息的学习能力。实验表明,SORT在多个推理基准测试中优于现有方法,尤其在较弱模型上表现突出。

详情
英文摘要

Reinforcement learning with verifiable rewards helps reasoning, but GRPO-style methods stall on hard prompts where all sampled rollouts fail. SORT adds a repair update for those failures without changing rollout generation: it derives a plan from the reference solution, compares token probabilities with and without that plan, and gives higher weight to tokens that become more predictable under plan conditioning. This turns all-wrong prompts into selective, structure-aware learning signals instead of uniform imitation. Across three backbones and eight reasoning benchmarks, SORT improves over GRPO and guidance baselines, with largest gains on weaker models.