arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.04823 2026-06-04 cs.AI cs.CL cs.MA

R-APS: Compositional Reasoning and In-Context Meta-Learning for Constrained Design via Reflective Adversarial Pareto Search

R-APS:基于反思性对抗帕累托搜索的组合推理与上下文元学习用于约束设计

João Pedro Gandarela, Thiago Rios, Stefan Menzel, André Freitas

AI总结 提出R-APS方法,通过推理模式分解、分阶段组合推理、敏感性引导对抗测试和元归纳规则提取,联合解决LLM在代理设置中的错误传播、最坏情况扰动和知识失效问题,在平面机构合成任务上实现更紧的鲁棒性证书和更快的迭代速度。

详情
AI中文摘要

大型语言模型(LLM)在开放式任务上表现流畅,但在需要规划、使用工具和长时间行动的代理设置中,流畅性并不能保证可靠交付。我们将这一差距归因于三个耦合的结构性失败:错误传播而不定位、最坏情况扰动未评估、积累的知识从未失效。我们认为这些失败有一个共同根源:溯因、反事实、元归纳、纠正和归纳推理将共享上下文拉向不相容的方向。我们提出反思性对抗帕累托搜索(R-APS),据我们所知,这是第一种通过推理模式分解联合解决所有三个失败的方法,为每种推理模式分配其自己的上下文,并在三个时间尺度上协调交互:带有类型化验证批评者的分阶段组合推理(失败定位)、作为第一类帕累托目标的敏感性引导反事实压力测试(鲁棒性)、以及带有显式失效的元归纳规则提取(持久记忆)。R-APS无需微调,仅通过结构化协议设计在冻结的LLM上运行。我们在平面机构综合(机器人、假肢、机械设计)上评估,每个候选解由运动学求解器检查。在32个目标轨迹上,R-APS提供的鲁棒性证书比均匀扰动基线紧3.5倍,首次接纳迭代速度提高46%,Chamfer距离比Enum+GA减少2.1倍,同时联合控制杆数和最坏情况鲁棒性。小型4B推理专用模型在协议内与通用70B骨干模型竞争,表明结构化协议可以部分抵消模型规模。

英文摘要

Large language models (LLMs) are fluent on open-ended tasks, yet in agentic settings, where a system must plan, use tools, and act over extended horizons, fluency does not ensure reliable delivery. We trace this gap to three coupled structural failures: errors propagate without localization, worst-case perturbations go unevaluated, and accumulated knowledge is never invalidated. We argue these share a root cause: abductive, counterfactual, meta-inductive, corrective, and inductive reasoning pull a shared context in incompatible directions. We introduce Reflective Adversarial Pareto Search (R-APS), to our knowledge the first method addressing all three failures jointly via reasoning-mode decomposition, allocating each reasoning mode its own context and orchestrating interaction across three timescales: staged compositional reasoning with a typed validation critic (failure localization), sensitivity-guided counterfactual stress-testing as a first-class Pareto objective (robustness), and meta-inductive rule extraction with explicit invalidation (persistent memory). R-APS requires no fine-tuning and operates on a frozen LLM purely via structured protocol design. We evaluate on planar mechanism synthesis (robotics, prosthetics, mechanical design), with every candidate checked by a kinematic solver. On 32 target trajectories, R-APS delivers robustness certificates 3.5x tighter than uniform-perturbation baselines, 46% faster iterations-to-first-admission, and 2.1x Chamfer-distance reduction over Enum+GA while jointly controlling bar-count and worst-case robustness. Small 4B reasoning-specialized models prove competitive with general-purpose 70B backbones inside the protocol, suggesting structured protocols can partially offset model scale.

2606.04822 2026-06-04 cs.LG

Reconciling Causality and Non-Equilibrium Thermodynamics with Hamiltonian Causal Models

用哈密顿因果模型调和因果关系与非平衡热力学

Dario Rancati, Max Welling, Francesco Locatello

AI总结 提出哈密顿因果模型(HCMs),通过分离不可变运动方程与可干预机制,定义路径级因果效应,并与非平衡热力学自然接口,利用熵产生量化因果效应。

详情
AI中文摘要

物理时间现象的因果建模必须处理沿轨迹的干预、非平稳诱导律、路径依赖效应以及由动力学介导的反馈,这些在标准因果模型中都具有挑战性。我们引入了哈密顿因果模型(HCMs),这是一个轨迹级框架,其中观测变量与局部环境相互作用,干预作为哈密顿机制的控制。HCMs将不可变的运动方程与可干预机制分离,并将因果效应定义为干预路径律之间的差异。HCMs的一个关键动机是它们与非平衡热力学的自然接口。熵产生量化了过程的不可逆性,是一个核心因果可观测量:它可以从数据中估计,并见证系统演化过程中标准平均处理效应的端点和累积版本所不可见的因果效应。如同物理学中,原因和结果不是两个随机变量之间关系的原始概念,而是源于热力学箭头的不可逆性。因此,我们的论文调和了统计因果模型和非平稳热力学的语言,为描述广泛物理系统中的因果关系提供了新工具。

英文摘要

Causal modeling of physical temporal phenomena must handle interventions that act along trajectories, nonstationary induced laws, path-dependent effects, and feedback mediated by dynamics, all challenging in standard causal models. We introduce Hamiltonian Causal Models (HCMs), a trajectory-level framework in which observed variables interact with local environments and interventions act as controls of Hamiltonian mechanisms. HCMs separate immutable equations of motion from intervenable mechanisms and define causal effects as discrepancies between interventional path laws. A key motivation for HCMs is their natural interface with non-equilibrium thermodynamics. Entropy production quantifies the irreversibility of a process and is a central causal observable: it is estimable from data and witnesses causal effects along the system's evolution that are invisible to endpoint and cumulative versions of the standard average treatment effect. As in physics, cause and effect are not primitives of the relation between two random variables but arise from the non-invertibility of the thermodynamic arrow. With this, our paper reconciles the language of statistical causal models and non-stationary thermodynamics, offering new tools to describe causality in a wide range of physical systems.

2606.04820 2026-06-04 cs.CV cs.AI cs.LG

OA-CutMix: Correcting the Label Bias of CutMix

OA-CutMix:纠正CutMix的标签偏差

Tobias Christian Nauen, Stanislav Frolov, Federico Raue, Brian B. Moser, Andreas Dengel

AI总结 针对CutMix中标签分配基于区域面积导致语义偏差的问题,提出OA-CutMix,利用分割掩码根据可见目标面积分配标签,在不改变图像混合过程的情况下提升分类准确率。

详情
AI中文摘要

CutMix已成为事实上的标准混合增强方法,但其标签分配基于一个有缺陷的假设:粘贴补丁的面积忠实地反映了其对混合图像的语义贡献。然而,在实践中,补丁经常落在背景区域,将标签信用分配给其目标不可见的类别。CutMix标签与语义目标面积的平均差异为21.5%。在17%的样本中,一张图像贡献了零个可见目标像素,却获得了非零的标签权重。我们提出目标感知CutMix(OA-CutMix),通过用从预计算分割掩码中导出的权重替换基于面积的CutMix权重来纠正这种偏差,根据每个图像贡献给混合图像的可见目标面积比例分配标签。图像混合过程完全保持不变。我们在4种架构和6个数据集上评估了OA-CutMix与10多种静态和动态混合方法的性能。OA-CutMix在所有任务中始终达到最高准确率,甚至优于动态混合方法,但训练时间成本仅为其一小部分。对于小目标,改进最大,因为CutMix的标签偏差最大。因此,纠正标签足以匹配或超过修改图像混合算法的方法的性能。

英文摘要

CutMix has become the de facto standard mixing augmentation, yet its label assignment rests on a flawed assumption: The area of the pasted patch faithfully reflects its semantic contribution to the mixed image. In practice, however, patches frequently land on background regions, assigning label credit to classes whose objects are not visible. The mean discrepancy of the CutMix label and the semantic object area is $21.5\%$. In $17\%$ of samples an image contributes zero visible object pixels yet receives nonzero label weight. We propose Object-Aware CutMix (OA-CutMix), which corrects this bias by replacing the area-based CutMix weight with one derived from precomputed segmentation masks, assigning labels in proportion to the visible object area each image contributes to the mix. The image mixing procedure is left entirely unchanged. We evaluate OA-CutMix against 10+ static and dynamic mixing methods across 4 architectures and 6 datasets. OA-CutMix consistently achieves the highest accuracy over all tasks, outperforming even dynamic mixing methods, but at a fraction of the training-time cost. Improvements are largest for small objects, where the label bias from CutMix is greatest. Thus, correcting the label is sufficient to match or exceed the performance of methods modifying the image mixing algorithm.

2606.04818 2026-06-04 cs.RO

Real-World Deployment of a 5G-Connected Edge-Controlled Aerial Robot in Industrial Subterranean Mines

工业地下矿井中5G连接边缘控制空中机器人的实际部署

Achilleas Santi Seisa, Emanuele Pagliari, Gerasimos Damigos, Elias Small, George Nikolakopoulos

AI总结 本文首次在实际工业地下矿井中部署了由边缘卸载控制器控制的5G连接自主飞行空中机器人,采用模型预测控制器(MPC)生成平滑无碰撞路径,展示了边缘控制机器人系统在时间关键、安全高效未来部署中的潜力。

详情
Comments
6 pages, 8 figures, MED 2026
AI中文摘要

本文介绍了首次由边缘卸载控制器控制的5G连接空中机器人的实际自主飞行,旨在弥合受控设置与实际设置之间的差距。该机器人在一个活跃的工业地下矿井中运行,而高层控制器部署在附近的基于Kubernetes的边缘集群中。机器人与边缘之间的通信通过5G新无线电(NR)独立组网(SA)网络实现。所选的控制器是模型预测控制器(MPC),它生成控制动作,使机器人能够在采矿环境中无缝导航。人类操作员为空中机器人选择航点,MPC生成平滑、无碰撞的路径以自主执行。所提出的基于5G边缘的闭环系统在实际工业环境中进行了评估,展示了边缘控制机器人系统在时间关键、安全高效的未来部署中的潜力。

英文摘要

This article presents the first real-world autonomous flight of a 5G-connected aerial robot controlled by an edge-offloaded controller, and aims to bridge the gap between controlled and factual setups. The robot operates within an active industrial subterranean mine, while the high-level controller is deployed in a nearby Kubernetes-based edge cluster. Communication between the robot and the edge is enabled via a 5G New Radio (NR) Standalone (SA) network. The chosen controller is a Model Predictive Controller (MPC), which generates control actions to allow the robot to navigate seamlessly through the mining environment. A human operator selects waypoints for the aerial robot, and the MPC generates smooth, collision-free paths for autonomous executions. The proposed 5G edge-based closed-loop system is evaluated in a real industrial setting and demonstrates the potential of edge-controlled robotic systems toward time-critical, safe and efficient future deployments.

2606.04816 2026-06-04 cs.AI cs.LG

Beyond Objective Equivalence: Constraint Injection for LLM-Based Optimization Modeling on Vehicle Routing Problems

超越目标等价性:基于LLM的车辆路径问题优化建模的约束注入

Xizi Luo, Changhong He, Dongdong Geng, Chenggong Shi, Yu Mei

AI总结 针对LLM在约束密集的运筹问题中可能添加虚假约束或遗漏必要约束的问题,提出约束注入方法,结合差分测试形成双重验证器,并在车辆路径问题上验证其有效性。

详情
Comments
28 pages
AI中文摘要

大型语言模型(LLM)越来越多地将自然语言优化问题转化为可执行的求解器代码。然而,对于约束密集的运筹学(OR)问题,现有的数据过滤和训练流程主要依赖于目标等价性信号,如差分测试和答案一致性,这些信号允许程序在测试实例上添加虚假约束或静默省略必要约束,只要这些约束在测试实例上非绑定。我们提出约束注入,利用可行探针暴露虚假过度约束,利用单约束违反探针揭示静默约束遗漏。结合差分测试,它形成一个双重验证器。我们在车辆路径问题(VRPs)上实例化并评估该方法,VRPs是代表性的约束密集组合优化测试平台,具有耦合的操作约束。我们开发了VRPCoder,一个8B端到端模型,将自然语言VRP场景转化为Gurobi脚本,并附带一个专家验证的VRP基准套件,涵盖21种变体。该验证器在数据合成期间用作拒绝采样过滤器,在组相对策略优化(GRPO)中用作每次rollout的奖励。在四个VRP基准上,VRPCoder-GRPO达到93%的平均Pass@1,在三个基准上优于Gemini-3.1-Pro Preview,超过Claude-Sonnet-4.5平均28个百分点,并超过先前的OR-LLM平均78个百分点。

英文摘要

Large language models (LLMs) increasingly translate natural-language optimization problems into executable solver code. Yet for constraint-dense operations research (OR) problems, existing data-filtering and training pipelines largely rely on objective-equivalence signals such as differential testing and answer agreement, which a program can pass while adding spurious constraints or silently omitting required ones, whenever those constraints are non-binding on the tested instance. We propose constraint injection, which uses feasible probes to expose spurious over-constraint and one-constraint-violating probes to reveal silent constraint omission. Combined with differential testing, it forms a dual verifier. We instantiate and evaluate it on vehicle routing problems (VRPs), a representative constraint-dense combinatorial optimization testbed with coupled operational constraints. We develop VRPCoder, an 8B end-to-end model that translates natural-language VRP scenarios into Gurobi scripts, together with an expert-verified VRP benchmark suite covering 21 variants. The verifier is reused as a rejection-sampling filter during data synthesis and as a per-rollout reward in group relative policy optimization (GRPO). Across four VRP benchmarks, VRPCoder-GRPO reaches 93\% average Pass@1, outperforms Gemini-3.1-Pro Preview on three benchmarks, exceeds Claude-Sonnet-4.5 by 28 average points, and surpasses prior OR-LLMs by 78 average points.

2606.04815 2026-06-04 cs.LG cs.AI

Learning While Acting: A Skill-Enhanced Test-Time Co-Evolution Framework for Online Lifelong Learning Agents

边行动边学习:面向在线终身学习智能体的技能增强测试时协同进化框架

Bo Mao, Jie Zhou, Yutao Yang, Xin Li, Xian Wei, Qin Chen, Xingjiao Wu, Liang He

AI总结 提出LifeSkill框架,通过验证器引导的技能学习和在线技能内化,使LLM智能体在测试时持续内化反馈,提升终身学习性能。

详情
AI中文摘要

终身学习对于在动态、交互环境中运行的大型语言模型(LLM)智能体至关重要。然而,现有的用于长时任务的终身学习智能体通常依赖于离散技能或过去经验检索,并在推理期间使用静态参数,这阻止了它们像人类学习者一样持续内化测试时反馈。为弥补这一差距,我们提出了技能增强测试时协同进化(LifeSkill),一个用于在线终身学习智能体的两阶段强化学习框架。具体来说,我们设计了验证器引导的技能学习,通过根据多个技能条件策略滚动的平均验证器成功率奖励候选技能,解决了技能提取缺乏直接监督的问题,鼓励模型生成对解决任务有用的技能,而不仅仅是文本上合理的技能。此外,我们引入了在线技能内化,通过在测试时交互期间将技能条件轨迹转化为奖励信号,持续改进策略模型。这使得智能体能够将推理能力直接内化到其参数中,避免了经验检索的上下文膨胀。在LifelongAgentBench上的实验表明,与现有终身学习智能体基线相比,LifeSkill将平均性能提高了7个绝对百分点。

英文摘要

Lifelong learning is essential for Large Language Model (LLM) agents operating in dynamic, interactive environments. However, existing lifelong learning agents for long-horizon tasks typically depend on discrete skill or past experiences retrieval with static parameters during inference, which prevents them from continuously internalizing test-time feedback like human learners. To bridge this gap, we propose Skill-enhanced Test-Time Co-Evolution (\texttt{LifeSkill}), a two-stage reinforcement learning framework for Online Lifelong Learning Agents. Specifically, we design Verifier-Guided Skill Learning that addresses the lack of direct supervision for skill extraction by rewarding candidate skills according to the average verifier success of multiple skill-conditioned policy rollouts, encouraging the model to generate skills that are useful for solving tasks rather than merely plausible in text. Furthermore, we introduce Online Skill Internalization, which continuously improves the policy model during test-time interaction by transforming skill-conditioned trajectories into reward signals. This enables the agent to directly internalize reasoning capabilities into its parameters, avoiding the context bloat of experience retrieval. Experiments on LifelongAgentBench show that LifeSkill improves average performance by 7 absolute points by comparing with existing lifelong agent baselines.

2606.04807 2026-06-04 cs.AI cs.CL cs.CY cs.LG

BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization

BiasGRPO:通过组相对策略优化在高方差奖励景观中稳定偏差缓解

Saket Reddy, Ke Yang, ChengXiang Zhai

AI总结 提出BiasGRPO框架,利用组相对策略优化(GRPO)通过归一化组内奖励来稳定大语言模型的社会偏差缓解,优于DPO和PPO。

详情
Comments
Accepted to Findings of the ACL
AI中文摘要

缓解大语言模型(LLMs)中的社会偏差提出了一个独特的对齐挑战:与可验证任务不同,偏差缺乏单一的真实标准,从而产生高方差、主观的奖励景观。先前的基于偏好的微调方法存在主要权衡:直接偏好优化(DPO)受限于离线训练中缺乏探索,而近端策略优化(PPO)由于潜在不可靠的评论家估计可能导致训练不稳定。在本文中,我们提出了BiasGRPO,一个使用组相对策略优化(GRPO)的框架,通过对一组采样完成进行奖励归一化来稳定对齐。通过用组相对基线替代价值函数,我们的方法在保持在线训练探索优势的同时减少了不稳定性。我们发现BiasGRPO在多个基准测试中优于DPO和PPO,表明其有效性。为了适应GRPO,我们综合扩展了一个涵盖多个领域和上下文的数据集。我们还创建并发布了一个定制的偏差奖励模型,该模型在有效指导生成的同时高度计算高效且避免知识退化,提供了一个可无缝集成到多目标RLHF流程中的宝贵资源。

英文摘要

Mitigating social bias in Large Language Models (LLMs) presents a distinct alignment challenge: unlike verifiable tasks, bias lacks a single ground truth, creating a high-variance, subjective reward landscape. Previous preference-based fine-tuning methods have major trade-offs: Direct Preference Optimization (DPO) is limited by the lack of exploration inherent in offline training, while Proximal Policy Optimization (PPO) can lead to training instability due to potentially unreliable critic estimates. In this paper, we propose BiasGRPO, a framework using Group Relative Policy Optimization (GRPO) to stabilize alignment by normalizing rewards across a group of sampled completions. By substituting the value function with a group-relative baseline, our approach reduces instability while maintaining the exploration benefits of online training. We find that BiasGRPO outperforms DPO and PPO across multiple benchmarks, indicating its effectiveness. To adapt GRPO, we synthetically extend a dataset spanning multiple domains and contexts. We also create and release a custom bias reward model that effectively guides generation while being highly compute-efficient and avoiding knowledge degradation, providing a valuable resource that can be seamlessly integrated into multi-objective RLHF pipelines.

2606.04806 2026-06-04 cs.CV cs.AI

NoRA: Evaluating Grounded Reasonableness in Visual First-person Normative Action Reasoning

NoRA: 评估视觉第一人称规范性动作推理中的基于事实的合理性

Sichao Li, Sai Ma, Daniel Kilov, Secil Yanik Guyot, Zhuang Li, Seth Lazar

AI总结 提出NoRA基准,通过事实-理由-动作支持图评估多模态模型生成合理动作并基于可见事实进行推理的能力,发现当前VLM在构建完整动作空间和绑定正确支持方面存在不足。

详情
AI中文摘要

LLM和智能系统越来越多地部署在社交环境中,使得规范能力对安全和适当行为至关重要。然而,现有方法要么仅在文本中评估规范性判断,要么将其简化为从固定候选动作集中选择。我们认为两者都不够。在实践中,智能体永远不会获得一个选项菜单;它们必须从头识别一个合理的动作,基于可见事实并由可检查的理由支持。我们引入了NoRA,一个视觉第一人称视频基准,要求模型生成候选的下一个动作,并通过显式的事实-理由-动作支持图来证明每个动作。该基准包含1,420个带注释的视频片段,包括HumanGold-190和LLMSilver-1230分割。每个实例通过动作对齐、事实基础和支持绑定进行评估,汇总为单一的基于事实的合理性分数。我们在直接、深思熟虑和结构化提示模式下对12个多模态系统进行了基准测试,发现当前的VLM经常能恢复合理的动作和相关的场景事实,但始终难以构建完整的合理动作空间并将所选动作绑定到正确的局部支持上。NoRA使这一差距可测量,将评估问题从模型是否能选择一个动作转变为是否能基于正确的可见理由证明一个适当的动作。

英文摘要

LLMs and agentic systems are increasingly deployed in social environments, making normative competence critical for safe and appropriate behavior. However, existing approaches either assess normative judgment in text alone or reduce it to choosing among a fixed set of candidate actions. We argue both are insufficient. In practice, agents are never handed a menu of options; they must identify a reasonable action from scratch, grounded in visible facts and supported by inspectable reasons. We introduce NoRA, a visual first-person video benchmark that requires models to generate candidate next actions and justify each through an explicit fact-reason-action support graph. The benchmark comprises 1,420 annotated video clips, including HumanGold-190 and LLMSilver-1230 splits. Each instance is evaluated through action alignment, factual grounding, and support binding, aggregated into a single grounded reasonableness score. We benchmark 12 multimodal systems under direct, deliberate, and structured prompting regimes, finding that current VLMs frequently recover plausible actions and relevant scene facts, but consistently struggle to construct the full reasonable action space and bind selected actions to the correct local support. NoRA makes this gap measurable, shifting the evaluation question from whether a model can pick an action to whether it can justify an appropriate action for the right visible reasons.

2606.04801 2026-06-04 cs.CV

Fast Cubical Persistent Homology on 2D and 3D Images via Union-Find, Pruning, and Lookup Tables

基于并查集、剪枝和查找表的2D和3D图像快速立方体持久同调

Titouan Le Breton, Karol Szustakowski, Marie Piraud

AI总结 提出Flash Cubical方法,通过并查集、边剪枝和查找表技术,高效计算2D和3D图像在V-过滤下的立方体持久性,在时间和内存上达到最优。

详情
AI中文摘要

我们提出Flash Cubical,一种在$\mathbb{F}_2$上对2D和3D图像的V-过滤进行立方体持久性高效计算的方法。该实现基于三个核心思想。首先,立方体复形满足某些性质,允许通过并查集和对偶性计算最高维度的持久性。其次,对某些边进行剪枝可以实现快速高效的并查集。第三,使用查找表,利用立方体复形的规律性预计算局部信息,避免运行时计算局部信息。据我们所知,这是在V-过滤下最有效的立方体持久性实现,无论在时间还是内存成本上。尽管本文关注V-过滤立方体复形的持久性,但基本思想自然推广到立方体复形的T-过滤,并为其他复形提供了有希望的方向。

英文摘要

We present Flash Cubical, a highly efficient computation of cubical persistence on a V-filtration for 2D and 3D images over $\mathbb{F}_2$. The implementation is built around three core ideas. First, cubical complexes satisfy properties that allow for the computation of persistence of the highest dimension via union-find and duality. Second, pruning of certain edges allows for a fast and efficient implementation of union-find. Third, the use of a lookup table, which exploits the regularity of cubical complexes to pre-compute local information. This avoids the need to compute local information at run time. To the best of our knowledge, this is the most efficient implementation of cubical persistence with a V-filtration, both in terms of time and memory costs. Although the paper focuses on persistence for V-filtration cubical complexes, the underlying ideas generalise naturally to T-filtrations on cubical complexes and suggest promising directions for other complexes.

2606.04798 2026-06-04 cs.LG

Uncertainty-Aware (Un)Supervised Few-Shot User Adaptation for On-Device Personalized Human Activity Recognition

不确定性感知的(无)监督少样本用户自适应用于设备上个性化人类活动识别

Maximilian Burzer, Till Riedel, Michael Beigl, Tobias Röddiger

AI总结 提出一种无梯度框架,通过贝叶斯原型估计实现监督/无监督少样本用户自适应,仅需每活动3秒校准数据即可显著提升设备上HAR模型性能。

详情
Comments
6 pages, 4 figures, 2 tables, 2 algorithms
AI中文摘要

基于传感器的人类活动识别(HAR)模型通常因个体运动模式和传感器放置导致的域偏移而在未见用户上性能下降。因此,实用的可穿戴HAR系统需要轻量级的个性化方法,这些方法应适用于校准数据有标签、无标签或不可用的情况,并在有限校准下具有鲁棒性。我们提出一个无梯度框架,将预训练的HAR分类器重新用作原型网络,利用先验原型保持零样本性能并规范自适应。对于有标签校准,我们引入闭式贝叶斯原型估计,并将相同原理扩展到无标签校准。仅使用每活动3秒的校准数据(一次样本),监督自适应在四个数据集上将宏F1提高了+2.76至+33.44个百分点,而无监督自适应提高了+0.56至+32.13个百分点。由于自适应仅需要闭式原型更新,该框架能够实现现有HAR分类器的高效且鲁棒的设备上个性化。

英文摘要

Sensor-based Human Activity Recognition (HAR) models often degrade on unseen users due to domain shifts caused by individual movement patterns and sensor placement. Practical wearable HAR systems therefore require personalization methods that are lightweight, applicable whether calibration data is labeled, unlabeled, or unavailable, and robust under limited calibration. We present a gradient-free framework that repurposes pretrained HAR classifiers as Prototypical Networks using using prior prototypes, which preserve zero-shot performance and regularize adaptation. For labeled calibration, we introduce closed-form Bayesian prototype estimation and extend the same principle to unlabeled calibration. With only 3 seconds of calibration data per activity (one shot), supervised adaptation improves macro-F1 by +2.76 to +33.44 percentage points across four datasets, while unsupervised adaptation improves by +0.56 to +32.13 points. Since adaptation requires only closed-form prototype updates, the framework enables efficient and robust on-device personalization of preexisting HAR classifiers.

2606.04797 2026-06-04 cs.CV cs.LG

Crafting Your Evolving Dreams: Concept-Incremental Versatile Customization

打造你不断演变的梦想:概念增量式多功能定制

Jiahua Dong, Wenqi Liang, Hongliu Li, Yang Cong, Duzhen Zhang, Hanbin Zhao, Henghui Ding, Yulun Zhang, Salman Khan, Fahad Shahbaz Khan

AI总结 提出持续可定制扩散模型(CCDM),通过属性解耦LoRA模块和相关性引导聚合策略解决灾难性遗忘,并结合可控区域上下文合成策略处理概念忽视,实现概念增量式多功能定制。

详情
Comments
Accepted to Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
AI中文摘要

定制扩散模型(CDMs)因其生成个性化概念的卓越能力而引起了广泛关注。然而,大多数CDMs不切实际地假设用户的个性化概念集合是静态的,无法随时间增长。此外,在增量学习一系列新概念时,它们对先前学习的概念表现出显著的灾难性遗忘和概念忽视。为了解决上述挑战,我们开发了一种新颖的持续可定制扩散模型(CCDM),使用户能够进行概念增量式多功能定制。具体来说,我们设计了一个属性解耦LoRA(AD-LoRA)模块和一个相关性引导的AD-LoRA聚合策略,以缓解灾难性遗忘。它们可以保留每个任务的概念特定属性,并利用有益的任务间相关性来增强新定制任务的持续学习。此外,为了解决概念忽视的挑战,我们提出了一种可控区域上下文合成策略,该策略根据用户提供的条件进行多概念合成。该策略通过保证用户定义区域之间的语义独立性及其平滑边界过渡,增强了多概念合成的整体一致性。实验表明,我们的CCDM在基线方法上表现出显著改进。

英文摘要

Custom diffusion models (CDMs) have garnered significant interest owing to their remarkable capacity for generating personalized concepts. However, the majority of CDMs unrealistically presume that the user's collection of personalized concepts is static and incapable of incremental growth over time. Furthermore, they exhibit significant catastrophic forgetting and concept neglect of previously learned concepts when incrementally learning a sequence of new ones. To resolve the above challenges, we develop a novel Continually Customizable Diffusion Model (CCDM), enabling users to perform concept-incremental versatile customization. Specifically, we design an attribute-decoupled LoRA (AD-LoRA) module and a relevance-guided AD-LoRA aggregation strategy to mitigate catastrophic forgetting. They can preserve concept-specific attributes of each task and leverage beneficial inter-task correlations to enhance the continual learning of new customization tasks. Additionally, to address the challenge of concept neglect, we propose a controllable regional context synthesis strategy that performs multi-concept composition in alignment with user-provided conditions. This strategy enhances the overall consistency in multi-concept synthesis by guaranteeing semantic independence between user-defined regions and their smooth boundary transitions. Experiments show our CCDM exhibits significant improvements over baseline methods.

2606.04792 2026-06-04 cs.CV

A Pathology Foundation Model for Gastric Cancer with Real-World Validation

用于胃癌的病理基础模型及真实世界验证

Ling Liang, Jiabo Ma, Zhengyu Zhang, Fengtao Zhou, Yingxue Xu, Yihui Wang, Cheng Jin, Zhengrui Guo, On Ki Tang, Zhijian Cen, Zhen Wang, Qi Xie, Chengyu Lu, Chenglong Zhao, Feifei Wang, Yu Cai, Hongyi Wang, Jing Zhang, Yaping Ye, Shijun Sun, Shenglei Li, Yu Wang, Zhenhui Li, Ronald Cheong Kin Chan, Xiuming Zhang, Zhe Wang, Hao Chen, Li Liang

AI总结 提出胃癌专用基础模型GRACE,基于多中心HE染色全切片图像,在28项临床任务中优于通用PFM,并通过前瞻性验证和读者研究证实其辅助诊断效能。

详情
AI中文摘要

胃癌仍然是癌症死亡的主要原因,但其组织学和分子异质性使诊断和风险分层复杂化。通用病理基础模型(PFM)在胃癌诊疗的关键细粒度终点上往往表现停滞,且很少有模型经过严格的前瞻性验证或临床读者研究。我们提出了GRACE,一个用于真实世界评估和临床决策支持的胃癌专用基础模型。GRACE基于来自37,493名患者的多中心胃癌病理数据集(共48,364张HE染色全切片图像)开发。在28项临床相关任务评估中,GRACE持续优于代表性泛癌PFM,达到宏观AUC 0.9188,在癌前病变诊断(宏观AUC 0.9322)、肿瘤组织病理评估(宏观AUC 0.9119)、分子分型(宏观AUC 0.8682)和预后预测方面表现强劲。除基准测试外,GRACE的转化价值通过严格的证据链得到证实。在安全门控标准(排除要求100%阴性预测值,纳入要求100%阳性预测值)下,GRACE简化了高达69.6%的恶性诊断病例的审查,并分流了46.8%的MMR-IHC随访请求。这种转化可行性通过病理学家-AI协作的随机交叉读者研究得到进一步加强。在GRACE辅助下,诊断准确率从82.0%提高到89.9%,正确诊断的校正优势比提高近两倍(OR 1.987),同时敏感性和特异性也得到提升。AI辅助还使诊断时间减少14.9%,诊断信心提高9.0%,并显著改善评估者间一致性。当校准至不劣于高级病理医生时,AI辅助工作流可分流60.7%的萎缩病例和82.7%的肠化生病例。

英文摘要

Gastric cancer remains a major cause of cancer mortality, yet its histological and molecular heterogeneity complicates diagnosis and risk stratification. General-purpose pathology foundation models (PFMs) often plateau on fine-grained endpoints central to gastric cancer care, and few have undergone rigorous prospective validation or clinical reader studies. We present GRACE, a Gastric-specific foundation model for Real-world Assessment and Clinical dEcision support. GRACE was developed from multicenter gastric pathology datasets totaling 48,364 primarily HE-stained whole-slide images from 37,493 patients. When evaluated on 28 clinically relevant tasks, GRACE consistently outperformed representative pancancer PFMs, achieving a macro-AUC of 0.9188, with strong performance for precancerous lesion diagnosis (macro-AUC 0.9322), tumor histopathological assessment (macro-AUC 0.9119), molecular profiling (macro-AUC 0.8682), and prognostic prediction. Beyond benchmarking, GRACE's translational value was substantiated through a rigorous evidence chain. Under safety-gated criteria requiring 100% NPV for rule-out and 100% PPV for rule-in, GRACE streamlined review for up to 69.6% of malignancy-diagnosis cases and triaged 46.8% of MMR-IHC follow-up requests. This translational feasibility was further strengthened by a randomized crossover reader study of pathologist-AI collaboration. With GRACE assistance, diagnostic accuracy improved from 82.0% to 89.9%, yielding nearly twofold higher adjusted odds of a correct diagnosis (OR 1.987) alongside concurrent gains in sensitivity and specificity. AI assistance also reduced diagnostic time by 14.9%, elevated diagnostic confidence by 9.0%, and markedly improved inter-rater agreement. When calibrated to maintain non-inferior performance to senior pathologists, the AI-assisted workflow could triage 60.7% of atrophy and 82.7% of intestinal metaplasia cases.

2606.04788 2026-06-04 cs.CV cs.RO

Z-FLoc: Zero-Shot Floorplan Localization via Geometric Primitives

Z-FLoc: 基于几何基元的零样本楼层平面定位

Ayumi Umemura, Toshinori Kuwahara, Marc Pollefeys, Daniel Barath

AI总结 提出一种零样本楼层平面定位方法,通过从单目3D重建的鸟瞰图中提取直线和圆等几何基元,并与楼层平面进行鲁棒匹配,无需重新训练即可泛化到新环境。

详情
AI中文摘要

视觉定位——在预先存在的地图中估计相机姿态——是计算机视觉中的一个基本问题。楼层平面是一种有吸引力的地图表示:它们对于大多数建筑来说易于获取、紧凑,并且固有地不受视觉外观变化的影响。然而,弥合相机观测与楼层平面几何之间的严重领域差距仍然具有挑战性。现有方法通过数据驱动学习来解决这一差距,但它们需要大规模训练数据和特定环境的重新训练,限制了实际部署。我们提出了一种零样本楼层平面定位方法,无需任何重新训练即可泛化到新环境。我们的关键见解是,主导几何基元——直线和圆——在人造环境中无处不在,并提供外观不变的结构约束。我们从单目3D重建的鸟瞰图投影中提取这些基元,并通过鲁棒估计框架内的专用最小求解器将它们与楼层平面进行匹配。在模拟和真实数据集上的实验表明,我们的方法在未见过的环境上优于最先进的基于学习的方法,同时在所有实验中使用单一固定的超参数集。源代码将公开提供。

英文摘要

Visual localization -- estimating a camera pose within a pre-existing map -- is a fundamental problem in computer vision. Floorplans are an attractive map representation: they are readily available for most buildings, compact, and inherently invariant to visual appearance changes. However, bridging the severe domain gap between camera observations and floorplan geometry remains challenging. Existing methods address this gap through data-driven learning, yet they require large-scale training data and environment-specific retraining, limiting their practical deployment. We propose a zero-shot floorplan localization method that generalizes to novel environments without any retraining. Our key insight is that dominant geometric primitives -- lines and circles -- are ubiquitous in human-made environments and provide appearance-invariant structural constraints. We extract these primitives from a bird's-eye-view (BEV) projection of monocular 3D reconstructions and match them to the floorplan via dedicated minimal solvers within a robust estimation framework. Experiments on both simulated and real-world datasets show that our approach outperforms state-of-the-art learning-based methods on unseen environments, while using a single fixed set of hyperparameters across all experiments. The source code will be made publicly available.

2606.04781 2026-06-04 cs.AI cs.LG

AIP: A Graph Representation for Learning and Governing Agent Skills

AIP: 一种用于学习和治理智能体技能的图表示

Zachary Blumenfeld, Jim Webber

AI总结 提出Agent指令协议(AIP),将有向执行图作为技能表示,通过编译人类编写的技能提升任务表现,并支持技能的可诊断修复与治理。

详情
AI中文摘要

当前的智能体技能主要由自由形式的散文组成,要求智能体在每个会话中阅读、解释并重新推导如何行动。这带来了两个叠加的成本:在实现密集型任务上降低了可靠性,并且技能创建和改进困难,因为编辑散文是一个脆弱的过程,人类和智能体都难以处理,特别是对于模型训练中代表性不足的领域特定程序性知识。智能体指令协议(AIP)通过将技能建模为有向执行图来解决这两个问题:离散步骤作为节点,由确定性脚本或自然语言描述支持,通过显式类型的输入/输出边连接,并由模式验证的YAML规范管理。一个编译器元技能将现有的人类编写的技能转换为这种形式。好处是双重的。首先,将人类编写的技能编译为AIP后,Claude Sonnet在SkillsBench的27个真实智能体任务上的平均任务奖励从0.60提高到0.71,通过率从53%提高到67%——这是统计上显著的提升(Wilcoxon符号秩检验p=0.011),在12个任务中获胜,2个失败,13个平局——通常耗时更少。该图为智能体提供了经过验证、可运行的单元,而不是要求它从自然语言中重新推导代码、命令和工具调用。其次,在创建和改进方面,由于每个技能都经过模式验证、功能可测试且可逐节点寻址,因此可以精确诊断和修复故障。两个作者编写的技能故障被追溯到脚本级别。在调整AIP规范并重新编译后,两者均恢复且无回归(一个任务从0/5变为5/5),将技能改进转变为可测量的调优循环,而不是散文重写。相同的图结构支持语料库级别的治理和技能内省,并为基于技能的强化学习提供了自然的动作空间。

英文摘要

Agent Skills today consist largely of free-form prose requiring the agent to read, interpret, and re-derive how to act in every session. This imposes two compounding costs: reduced reliability on implementation-heavy tasks, and difficulty in skill creation and improvement, since editing prose is a fragile process that both humans and agents struggle with, particularly for domain-specific procedural knowledge underrepresented in model training. The Agent Instruction Protocol (AIP) addresses both by modeling a skill as a directed execution graph: discrete steps as nodes backed by deterministic scripts or natural-language descriptions, connected by explicit typed input/output edges, and governed by a schema-validated YAML specification. A compiler meta-skill translates existing human-written skills into this form. The benefits are twofold. First, compiling human-written skills to AIP raised Claude Sonnet's mean task reward from 0.60 to 0.71 and pass rate from 53% to 67% across 27 real agent tasks from SkillsBench - a statistically significant gain (Wilcoxon signed-rank p = 0.011), winning 12 tasks to 2 with 13 ties - often in less wall-clock time. The graph delivers vetted, runnable units to the agent rather than asking it to re-derive code, commands, and tool calls from natural language. Second, on creation and improvement, because each skill is schema-validated, functionally testable, and addressable node-by-node, failures can be diagnosed and repaired precisely. Two authored-skill failures were traced to the script level. After adjusting the AIP spec and recompiling, both recovered with zero regressions (one task going from 0/5 to 5/5), turning skill improvement into a measurable tuning loop rather than a prose rewrite. That same graph structure supports corpus-level governance and skill introspection, and provides a natural action space for reinforcement learning over skills.

2606.04780 2026-06-04 cs.CL

PersonaTree: Structured Lifecycle Memory for Person Understanding in LLM Agents

PersonaTree: 面向LLM智能体人物理解的结构化生命周期记忆

Yubo Hou, Jingwei Song, Hongbo Zhang, Zhisheng Chen, Bang Xiao, Tao Wan, Zengchang Qin

AI总结 提出PersonaTree,一种结构化生命周期记忆框架,通过三级人物树和显式支持路径,将交互证据抽象为人物理解,在多个基准上取得领先性能。

详情
AI中文摘要

持久化的LLM智能体需要记忆表示,使得在长期交互中人物理解的形成变得明确。现有的智能体记忆方法强调信息保留和检索,但对累积的交互证据如何被抽象为人物理解的解释有限。我们将这一过程视为图式形成,其中情境证据被抽象为可重用模式和稳定的人物层面断言。我们引入PersonaTree,一种结构化生命周期记忆框架,通过三级人物树实现这一观点,并具有从证据到断言的显式支持路径。PersonaTree通过保守写入、置信度引导的合并和查询条件路径检索来维护树,仅返回每个查询所需的证据深度。在六个涉及人物理解和持久记忆的基准测试中,使用三个回答骨干,PersonaTree在18个紧凑分数中排名第一,并在16个设置中进入前两名。消融实验表明,层次结构提高了KnowMe上的抽象人物理解,而在可比上下文预算下,支持路径检索提高了RealPref的对齐度。

英文摘要

Persistent LLM agents require memory representations that make the formation of person understanding explicit across long term interaction. Existing agent memory methods emphasize information retention and retrieval, yet give limited account of how accumulated interaction evidence is abstracted into person understanding. We view this process as schema formation, where situated evidence is abstracted into reusable patterns and stable person level claims. We introduce PersonaTree, a structured lifecycle memory framework that realizes this view as a three level persona tree with explicit support paths from evidence to claims. PersonaTree maintains the tree through conservative writing, confidence guided consolidation, and query conditioned path retrieval, returning only the evidence depth required by each query. Across six person understanding and persistent memory benchmarks with three answer backbones, PersonaTree ranks first in 12 of 18 compact scores and reaches the top two in 16 settings. Ablations show that hierarchy improves abstract person understanding on KnowMe, while support path retrieval improves RealPref alignment under a comparable context budget.

2606.04779 2026-06-04 cs.AI math.CO

Tree-Based Formalization of Multi-Agent Complementarity in Human-AI Interactions

基于树的人机交互中多智能体互补性形式化

Andrea Ferrario

AI总结 本文提出一种基于树的形式化框架,通过有序智能体角色配置和平面二叉树表示人机交互协议,证明互补性在回归中可实现,但在分类中受限于局部聚合和损失函数的自然条件。

详情
Comments
29 pages, 9 figures
AI中文摘要

互补性是指人机交互(HAI)的表现优于其成员中最佳预测基准的情况。尽管这一概念在HAI研究中至关重要,但关于互补性的形式化工作仍然有限。现有框架未能建模智能体的预测如何组合成对工作流敏感的多智能体协议。我们通过引入基于树的多智能体HAI互补性形式化来填补这一空白。一个HAI协议由一个有序的智能体角色配置以及一棵有根平面二叉树表示,树的叶子由预测向量装饰。沿树递归评估一个局部二元组合规则,产生相对于逐点最小预言基准的树相对互补性泛函。我们证明了四个结果。第一,基于选择器的HAI(包括自我或AI依赖)无法实现互补性,无论任务、损失或预测质量如何。第二,在平方损失下的回归中,互补性等价于与真实向量之间的欧几里得距离最小化;对于$N=2$,最优线性池化权重具有封闭形式并具有残差校正解释。第三,在线性局部组合下,每个协议树定义了叶子权重单纯形上的重心坐标图;协议树的Tamari覆盖重新参数化保持互补性,对于$N=4$,它们满足五边形恒等式。第四,在二元分类中,在端点单调损失(包括标准Bregman和许多有限伯努利$f$散度损失)下,没有内部局部组合能实现互补性;在交叉熵下的多类聚合中存在类似障碍。总之,我们的框架表明,互补性在多智能体回归中是可实现的,但在分类中,在局部聚合和损失函数的自然条件下受到阻碍。

英文摘要

Complementarity is the case in which a human--AI interaction (HAI) outperforms the best prediction benchmark available among its members. Although this idea is central in HAI research, formal work on complementarity remains limited. Existing frameworks do not model how agents' predictions compose into workflow-sensitive multi-agent protocols. We close this gap by introducing a tree-based formalization of complementarity in multi-agent HAI. An HAI protocol is represented by an ordered agent-role configuration together with a rooted planar binary tree whose leaves are decorated by prediction vectors. A local binary composition rule is evaluated recursively along the tree, yielding a tree-relative complementarity functional relative to a pointwise-min oracle benchmark. We prove four results. First, selector-based HAIs, including self- or AI-reliance, cannot achieve complementarity regardless of task, loss, or prediction quality. Second, in regression under squared loss, complementarity is equivalent to Euclidean distance minimization from the ground-truth vector; for $N=2$, the optimal linear-pooling weight has a closed form and a residual-correction interpretation. Third, under linear local composition, every protocol tree defines a barycentric coordinate chart on the simplex of leaf weights; Tamari-cover reparameterizations of protocol trees preserve complementarity, and for $N=4$, they satisfy the pentagon identity. Fourth, in binary classification, no internal local composition can achieve complementarity under endpoint-monotone losses, including standard Bregman and many finite Bernoulli $f$-divergence losses; an analogous obstruction holds for multiclass aggregation under cross-entropy. In summary, our framework shows that complementarity is attainable in multi-agent regression, but obstructed in classification under natural conditions on local aggregation and loss functions.

2606.04778 2026-06-04 cs.AI cs.CL cs.LG

Inference-Time Vulnerability Beyond Shallow Safety: Alignment Along Generation Trajectories

超越浅层安全的推理时脆弱性:沿生成轨迹的对齐

Kyungmin Park, Taesup Kim

AI总结 本文揭示安全对齐的大语言模型在推理时存在更广泛的脆弱性,即任意生成步骤的短标记注入都能显著改变后续安全行为,并提出通过直接在生成轨迹上对齐模型来提升鲁棒性。

详情
AI中文摘要

安全对齐的大语言模型(LLMs)在推理时仍然容易受到干预,这些干预会将生成导向有害输出。最近的研究将其归因于浅层安全,即对齐集中在最初的几个输出标记上。我们表明,浅层安全是更广泛的推理时脆弱性的一个特例,其中在任何生成步骤的短标记注入都能显著改变后续的安全行为。我们还发现,模型在其隐藏状态中与拒绝方向的对齐并不能预测其对这种注入的鲁棒性,这表明在扰动下,内部状态本身并不能决定生成行为。为了解决这个问题,我们通过模拟序列中段扰动构建的生成轨迹上直接对齐模型,并表明这提高了对中段注入的鲁棒性,并泛化到利用早期标记生成的攻击。我们的工作认为,鲁棒的安全对齐需要对生成过程本身进行训练,而不仅仅是其输出。

英文摘要

Safety-aligned Large Language Models (LLMs) remain vulnerable to interventions during inference that redirect generation toward harmful outputs. Recent work attributes this to shallow safety, where alignment concentrates in the first few output tokens. We show that shallow safety is a special case of a broader inference-time vulnerability, in which short token injections at any generation step can substantially alter subsequent safety behavior. We also find that a model's alignment with refusal directions in its hidden states does not predict its robustness to such injection, revealing that internal state alone does not determine generation behavior under perturbation. To address this, we align models directly on generation trajectories constructed by simulating mid-sequence perturbation, and show that this improves robustness to mid-sequence injection and generalizes to attacks that exploit early-token generation. Our work argues that robust safety alignment requires training on the generation process itself, not only its outputs.

2606.04776 2026-06-04 cs.RO

SoftPINCH: EMG-Driven Soft Exoskeleton Assistance for Finger Flexion and Grasping

SoftPINCH: 用于手指屈曲和抓取的EMG驱动软体外骨骼辅助

Nicklas Nikolaj Grønvall, Magnus Malthe Sigsgaard Nielsen, Xiaofeng Xiong, Saravana Prashanth Murali Babu

AI总结 提出一种结合肌腱驱动软体外骨骼、指尖磁接触传感和神经EMG解码的EMG驱动软体可穿戴外骨骼系统SoftPINCH,用于拇指-食指屈曲和捏取辅助,实验表明CNN+LSTM模型在解码中达到99.4%准确率,且主动辅助可显著降低肌肉用力。

详情
Comments
Submitted to 18th International Conference on the Simulation of Adaptive Behavior (SAB 2026)
AI中文摘要

表面肌电图(sEMG)提供了一种非侵入式接口,用于检测手部运动意图并控制可穿戴辅助设备。然而,由于EMG信号受噪声、运动伪影、电极放置、肌肉疲劳和受试者间差异的影响,可靠的EMG驱动手部辅助仍然具有挑战性。同时,许多手部外骨骼在机械上仍具有限制性或笨重,限制了舒适性和自然手部运动。本工作提出了SoftPINCH,一种用于拇指-食指屈曲和捏取辅助的EMG驱动软体可穿戴外骨骼。该系统结合了肌腱驱动的软体外骨骼、指尖磁接触传感和用于基于意图辅助的神经EMG解码。在食指和拇指运动期间记录前臂肌肉的表面EMG,并评估了三种独立于受试者的解码架构:LSTM、CNN+LSTM和带注意力的CNN+LSTM。CNN+LSTM和CNN+LSTM-attention模型均达到99.4%的LOSO测试准确率,优于达到97.8%的独立LSTM。然而,注意力机制相比CNN+LSTM并未提供显著改进,表明基于CNN的特征提取足以实现鲁棒的EMG表示。因此,由于高准确率和较低的架构复杂度,选择了CNN+LSTM模型进行实时部署。功能评估表明,主动外骨骼辅助在孤立手指屈曲和物体抓取期间减少了肌肉用力。在负重抓取期间,辅助在所有测试负载下均减少了肌肉用力,在最高负载下减少了92.6%。这些结果证明了SoftPINCH通过实时EMG驱动软体机器人控制实现直观、低用力捏取辅助的潜力。

英文摘要

Surface electromyography (sEMG) provides a non-invasive interface for detecting hand-movement intention and controlling wearable assistive devices. However, reliable EMG-driven hand assistance remains challenging because EMG signals are affected by noise, motion artifacts, electrode placement, muscle fatigue, and inter-subject variability. At the same time, many hand exoskeletons remain mechanically restrictive or bulky, limiting comfort and natural hand motion. This work presents SoftPINCH, an EMG-driven soft wearable exoskeleton for thumb-index finger flexion and pinch grasp assistance. The system combines a tendon-driven soft exoskeleton, fingertip magnetic contact sensing, and neural EMG decoding for intention-based assistance. Surface EMG was recorded from forearm muscles during index and thumb movements, and three subject-independent decoding architectures were evaluated: LSTM, CNN+LSTM, and CNN+LSTM with attention. The CNN+LSTM and CNN+LSTM-attention models both achieved 99.4% LOSO test accuracy, outperforming the standalone LSTM, which reached 97.8%. However, the attention mechanism did not provide a significant improvement over CNN+LSTM, indicating that CNN-based feature extraction was sufficient for robust EMG representation. The CNN+LSTM model was therefore selected for real-time deployment due to its high accuracy and lower architectural complexity. Functional evaluation showed that active exoskeleton assistance reduced muscular effort during isolated finger flexion and object grasping. During weighted grasping, assistance reduced muscular effort across all tested loads, with a 92.6% reduction at the highest load. These results demonstrate the potential of SoftPINCH for intuitive, low-effort pinch assistance using real-time EMG-driven soft robotic control.

2606.04775 2026-06-04 cs.LG cs.AI cs.CV cs.SY eess.SY math.OC

Activation Steering of Video Generation Models via Reduced-Order Linear Optimal Control

通过降阶线性最优控制引导视频生成模型的激活

Jihoon Hong, Alice Chan, Qiyue Dai, Julian Skifstad, Glen Chou

AI总结 提出LA-LQR框架,将文本到视频推理建模为动态系统,通过降阶最优控制实现最小干预的激活引导,减少不安全内容生成同时保持视觉质量。

详情
AI中文摘要

在大规模网络数据上训练的文本到视频(T2V)模型可能生成不良内容,这促使我们进行干预以减少有害输出而不牺牲视觉质量。激活引导提供了一种有吸引力的机制替代微调和提示过滤,但现有的T2V引导方法仍然有限,通常采用粗糙的、非预测性的干预,可能导致过度引导和内容退化。为了弥补这一差距,我们提出了潜在激活线性二次型调节器(LA-LQR),一种用于最小侵入性T2V引导的降阶最优控制框架。LA-LQR将T2V推理表述为一个动态系统,并计算闭环反馈干预,将激活引导向期望的特征设定点,同时惩罚不必要的扰动。为了使最优控制对高维视频激活可行,我们将激活投影到由对比提示对导出的低维、任务相关子空间,估计该潜在空间中的局部线性动力学,并求解潜在LQR问题以获得时间步和层特定的引导信号。我们提供了将潜在设定点跟踪与原始激活空间特征控制联系起来的理论界限,并实证验证了降阶潜在动力学的保真度。在概念引导和视频安全基准测试中,LA-LQR相对于基线减少了不安全生成,同时保持了提示保真度和视觉质量。

英文摘要

Text-to-video (T2V) models trained on large-scale web data can generate undesired content, motivating interventions that reduce harmful outputs without sacrificing visual quality. Activation steering offers an attractive mechanistic alternative to finetuning and prompt filtering, but existing T2V steering methods remain limited, typically applying coarse, non-anticipative interventions that can lead to oversteering and content degradation. To close this gap, we propose Latent Activation Linear-Quadratic Regulator (LA-LQR), a reduced-order optimal control framework for minimally invasive T2V steering. LA-LQR formulates T2V inference as a dynamical system and computes closed-loop feedback interventions that steer activations toward desired feature setpoints while penalizing unnecessary perturbations. To make optimal control feasible for high-dimensional video activations, we project activations onto a low-dimensional, task-relevant subspace derived from contrastive prompt pairs, estimate local linear dynamics in this latent space, and solve a latent LQR problem to obtain timestep- and layer-specific steering signals. We provide theoretical bounds relating latent setpoint tracking to raw activation-space feature control, and empirically validate the fidelity of the reduced latent dynamics. On concept steering and video safety benchmarks, LA-LQR reduces unsafe generations relative to baselines, while preserving prompt fidelity and visual quality.

2606.04773 2026-06-04 cs.CV cs.CL

NextMotionQA: Benchmarking and Judging Human Motion Understanding with Vision-Language Models

NextMotionQA: 使用视觉语言模型基准测试和评判人体运动理解

Yong Cao, Chuqiao Li, Xianghui Xie, Gerard Pons-Moll, Andreas Geiger

AI总结 提出NextMotionQA基准,通过三项互补任务和多粒度难度分层,系统评估视觉语言模型在人体运动理解中的能力,并揭示其在细粒度评判中的局限性。

详情
Comments
23 pages, 8 figures, 9 tables
AI中文摘要

人体运动理解的可靠评估对于推进具身人工智能、机器人和动画至关重要。然而,现有基准存在语义粒度粗糙、难度无区分、标注质量有限以及答案模糊等问题,无法诊断当前模型的失败之处。为弥补这一差距,我们引入NextMotionQA,这是一个全面的基准,利用视觉语言模型(VLM)进行半自动化、专家验证的数据集构建。NextMotionQA包含三项互补任务:多项选择题问答、视频字幕生成和细粒度错误纠正。每项任务沿三个核心语义轴系统组织,并分为三个任务复杂度级别。我们对十二个代表性VLM的广泛评估揭示了在传统单任务评估中不可见的关键能力差距和弱点。在互补方向上,近期工作开始使用VLM作为文本到运动评估的评判者;我们探究它们在更困难任务下是否表现出同样的退化。我们发现,VLM在粗粒度标准上与专家评分高度一致(Cohen's κ=0.70),但在细粒度、部件级评判上表现不佳(κ=0.10),验证了该范式在其强项领域的有效性,同时明确了其局限性。

英文摘要

Reliable evaluation of human motion understanding is fundamental to advancing embodied AI, robotics, and animation. However, existing benchmarks suffer from coarse semantic granularity, undifferentiated difficulty, limited annotation quality, and pervasive answer ambiguity, leaving them unable to diagnose where current models fail. To bridge this gap, we introduce NextMotionQA, a comprehensive benchmark that leverages vision-language models (VLMs) for semi-automated, expert-verified dataset. NextMotionQA features three complementary tasks: multiple-choice question answering, video captioning, and fine-grained error correction. Each task is systematically structured across three core semantic axes and stratified into three task complexity levels. Our extensive evaluation of twelve representative VLMs uncovers critical capability gaps and weakness that remain invisible under conventional, single-task evaluations. In a complementary direction, recent work has begun using VLMs as judges for text-to-motion evaluation; we ask whether they show the same degradation under harder tasks. We find that VLMs align strongly with expert ratings on coarse criteria (Cohen's κ=0.70) but break down on fine-grained, part-level judgment (κ=0.10), validating the paradigm in its strong regime while clarifying its limits.

2606.04772 2026-06-04 cs.CV cs.AI

Coarse-to-fine Hierarchical Architecture with Sequential Mamba for Brain Reconstruction

用于脑重建的基于顺序Mamba的粗到细层次架构

Hoang-Son Vo, Van-Hung Bui, Minh-Huy Mai-Duc, Tien-Dung Mai, Soo-Hyung Kim

AI总结 提出CHASMBrain,一种基于双流Mamba和粗到细策略的两阶段图像到fMRI编码框架,在NSD数据集上优于基线,并揭示了视觉皮层的因果组织特性。

详情
AI中文摘要

理解深度视觉表征与人类视觉系统之间的关系是计算神经科学中的一个基本挑战。尽管现代视觉模型在图像识别中取得了强劲性能,但它们与人类视觉皮层层次组织的对应关系仍是一个开放问题。在本研究中,我们提出了CHASMBrain,一种新颖的分层两阶段图像到fMRI编码框架。我们的架构利用双流Mamba设计,明确分离并处理全局语义标记和局部空间补丁,这一设计受视觉皮层功能组织的启发。采用粗到细策略:第一阶段预测去噪的ROI级激活,第二阶段使用Mamba-VAE将这些粗响应细化为全体素级预测。在自然场景数据集(NSD)上的实验表明,我们的方法达到了0.429的皮尔逊相关系数和0.261的均方误差,优于所有评估的基线,包括岭回归和DINOv2线性探针。除了预测性能,因果分支消融实验揭示了一种非对称特化:补丁流特定锁定于早期视觉皮层(视网膜拓扑区域),而CLS流为高阶区域提供更广泛的语义上下文——这种对应关系是因果性的,而不仅仅是相关性的。跨被试迁移实验进一步表明,学习到的骨干网络在个体间泛化良好,只需极少的个体适应,表明模型捕捉到了共享的、与主体无关的视觉表征。

英文摘要

Understanding the relationship between deep visual representations and the human visual system is a fundamental challenge in computational neuroscience. While modern vision models achieve strong performance in image recognition, their correspondence with the hierarchical organization of the human visual cortex remains an open question. In this study, we propose CHASMBrain, a novel hierarchical two-stage framework for image-to-fMRI encoding. Our architecture leverages a dual-stream Mamba design to explicitly separate and process global semantic tokens and local spatial patches, motivated by the functional organization of the visual cortex. A coarse-to-fine strategy is employed: Stage 1 predicts denoised ROI-level activations, while Stage 2 refines these coarse responses into full voxel-level predictions using a Mamba-VAE. Experiments on the Natural Scenes Dataset (NSD) demonstrate that our method achieves a Pearson correlation of 0.429 and an MSE of 0.261, outperforming all evaluated baselines including ridge regression and DINOv2 linear probes. Beyond predictive performance, causal branch-ablation experiments reveal an asymmetric specialization: the patch stream is specifically locked to early visual cortex (retinotopic regions), while the CLS stream contributes broader semantic context to higher-order areas -- a correspondence that holds causally, not merely correlationally. Cross-subject transfer experiments further show that the learned backbone generalizes across individuals with minimal per-subject adaptation, suggesting the model captures a shared, subject-agnostic visual representation.

2606.04769 2026-06-04 cs.CR cs.AI cs.SE

Description-Code Inconsistency in Real-world MCP Servers: Measurement, Detection, and Security Implications

现实世界 MCP 服务器中的描述-代码不一致性:测量、检测与安全影响

Yutao Shi, Xiaohan Zhang, Xiangjing Zhang, Xihua Shen, Hui Ouyang, Huming Qiu, Mi Zhang, Min Yang

AI总结 针对 MCP 服务器中工具描述与代码实现不一致的问题,提出结合结构感知静态分析与 Direct-Reverse-Arbitration 提示方法的自动检测框架 DCIChecker,并在大规模数据集上揭示 9.93% 的不一致率及其安全风险。

详情
Comments
Preprint
AI中文摘要

模型上下文协议 (MCP) 已成为赋能大型语言模型 (LLM) 使用外部工具的关键标准。在此生态系统中,LLM 依赖 MCP 服务器提供的自然语言描述来选择和执行函数。这种交互隐含假设工具描述忠实地反映了其底层实现,而该假设在实践中并未得到强制验证。因此,MCP 部署可能遭受名为描述-代码不一致性 (DCI) 的问题,即工具对其能力和安全边界的描述与代码实际行为不一致。 本文对现实世界 MCP 服务器中的 DCI 进行了全面研究。我们正式定义了该问题,并提出了一个涵盖功能不一致和未声明副作用的综合分类法。在此分类法指导下,我们开发了 DCIChecker,一个自动框架,结合结构感知静态分析与 Direct-Reverse-Arbitration 提示方法,交叉验证工具描述与实际代码实现。我们将该框架应用于一个大规模数据集,包含从 2,214 个现实世界 MCP 服务器中提取的 19,200 个描述-代码对。我们的测量揭示 DCI 普遍存在,其中 9.93% 的对存在不一致。我们进一步证明 DCI 造成了关键的防御盲点,助长了从操作故障到隐蔽恶意行为等多种风险。最后,我们提出了缓解策略以强制语义一致性并增强新兴智能体生态系统的可靠性。

英文摘要

The Model Context Protocol (MCP) has emerged as a critical standard empowering Large Language Models (LLMs) to utilize external tools. In this ecosystem, LLMs rely on natural language descriptions provided by MCP servers to select and execute functions. This interaction implicitly assumes that tool descriptions faithfully reflect their underlying implementations, while this assumption is not mandatorily verified in practice. As a result, MCP deployments may suffer from a problem named Description-Code Inconsistency (DCI), where a tool's description of its capabilities and security boundaries is not consistent with what the code actually does. In this paper, we present a comprehensive study of DCI in real-world MCP servers. We formally define the problem and propose a comprehensive taxonomy spanning functionality inconsistencies and undeclared side effects. Guided by this taxonomy, we develop DCIChecker, an automated framework that combines structure-aware static analysis with the Direct-Reverse-Arbitration prompting method to cross-validate tool descriptions against actual code implementations. We apply this framework to a large-scale dataset comprising 19,200 description-code pairs extracted from 2,214 real-world MCP servers. Our measurement reveals that DCI is widespread, with 9.93% of these pairs exhibiting inconsistencies. We further demonstrate that DCI creates a critical defense blind spot, facilitating varied risks from operational failures to stealthy malicious behaviors. Finally, we propose mitigation strategies to enforce semantic consistency and enhance the reliability of the emerging agentic ecosystem.

2606.04767 2026-06-04 cs.LG cs.CV

Measuring Model Robustness via Fisher Information: Spectral Bounds, Theoretical Guarantees, and Practical Algorithms

通过Fisher信息度量模型鲁棒性:谱界、理论保证与实用算法

Chong Zhang, Xiang Li, Jia Wang, Qiufeng Wang, Xiaobo Jin

AI总结 提出基于Fisher信息矩阵谱范数的攻击无关鲁棒性度量,理论推导常见架构的闭式谱界,并开发高效估计算法,实验验证其与对抗脆弱性的强相关性。

详情
Comments
35 pages, 1 figure
AI中文摘要

深度神经网络的鲁棒性对于安全关键部署至关重要,但现有评估方法通常依赖于攻击且缺乏可解释性。我们提出了一种基于Fisher信息矩阵(FIM)谱范数的原则性、攻击无关的鲁棒性度量,该度量量化了模型输出分布对输入扰动的worst-case敏感性。理论上,我们证明了FIM等于输入Jacobian的方差,并推导了常见架构(包括VGG、ResNet、DenseNet和Transformer)的闭式谱界,提供了首个理论鲁棒性排名。为了实现可扩展的评估,我们开发了高效算法,包括幂迭代和基于Hutchinson的估计,支持白盒和黑盒设置。在多个数据集(包括CIFAR、ImageNet和医学图像)和多种架构上的大量实验表明,我们的度量与对抗脆弱性之间存在强相关性。我们的框架作为一种可解释的诊断工具,补充了基于攻击的评估,提供了对架构敏感性的洞察,并指导更鲁棒模型的设计。代码可在https://github.com/franz-chang/SRP/获取。

英文摘要

The robustness of deep neural networks is crucial for safety-critical deployments, yet existing evaluation methods are often attack-dependent and lack interpretability. We propose a principled, attack-agnostic robustness metric based on the spectral norm of the Fisher Information Matrix (FIM), which quantifies the worst-case sensitivity of the model's output distribution to input perturbations. Theoretically, we establish that the FIM equals the variance of the input Jacobian and derive closed-form spectral bounds for common architectures, including VGG, ResNet, DenseNet, and Transformer, providing the first theoretical robustness ranking. To enable scalable evaluation, we develop efficient algorithms, including power iteration and Hutchinson-based estimation, that support both white-box and black-box settings. Extensive experiments across multiple datasets, including CIFAR, ImageNet, and medical images, and across multiple architectures show a strong correlation between our metric and adversarial vulnerability. Our framework serves as an interpretable diagnostic tool that complements attack-based evaluations, offering insights into architectural sensitivity and guiding the design of more robust models. Code is available at: https://github.com/franz-chang/SRP/.

2606.04764 2026-06-04 cs.CV

Do Foundation Models See Biology? Evaluating Attention Coherence with Spatial Transcriptomics in Glioblastoma

基础模型是否理解生物学?利用空间转录组学评估胶质母细胞瘤中的注意力一致性

Dilakshan Srikanthan, Amoon Jamzad, Paul Wilson, Nooshin Maghsoodi, Robert Policelli, Gabor Fichtinger, John F. Rudan, Parvin Mousavi

AI总结 提出基于空间转录组学的框架,客观评估病理基础模型注意力图与生物学的一致性,发现注意力捕捉多基因转录程序而非单个分子事件。

详情
AI中文摘要

病理基础模型的注意力图是否捕捉真实的生物学仍未知,但这一问题对临床信任和监管批准至关重要。我们提出一个基于空间转录组学的框架,用于无假设的注意力正交评估,并将其应用于五个病理基础模型(CONCH v1.5、UNI v2、Virchow2、GigaPath、H-Optimus-1)和一个ResNet50基线。使用基于注意力的多实例学习,我们训练单任务和多任务模型预测胶质母细胞瘤中的五种分子改变(CPTAC队列),在独立TCGA队列上验证,并使用来自18个样本的共配准Visium空间转录组数据评估注意力图与87个转录特征之间的生物学一致性。内部结果显示,没有单一编码器在所有任务中占优,外部验证则颠倒了内部性能排名。注意力图显示从通路(Cohen's d=0.329)到单个基因(d=0.055)的五倍富集梯度,表明注意力捕捉的是涌现的多基因转录程序而非单个分子事件。空间平滑的注意力图并不意味生物学一致性,不同编码器关注不同的生物学区室。我们的框架提供了对基础模型从组织病理学中学到内容的客观定量评估,推动该领域超越定性显著性图审查。

英文摘要

Whether attention maps from pathology foundation models capture genuine biology remains unknown, yet this question is critical for clinical trust and regulatory approval. We propose a spatial transcriptomics-based framework for orthogonal, hypothesis-free evaluation of attention and apply it to five pathology foundation models (CONCH v1.5, UNI v2, Virchow2, GigaPath, H-Optimus-1) and a ResNet50 baseline. Using attention-based multiple instance learning, we train single-task and multi-task models to predict five molecular alterations in glioblastoma on the CPTAC cohort, validate on an independent TCGA cohort, and evaluate biological coherence of attention maps against 87 transcriptional signatures using co-registered Visium spatial transcriptomics data from 18 samples. Internally, no single encoder dominates across all tasks, and external validation inverts internal performance rankings. Attention maps show a five-fold enrichment gradient from pathways (Cohen's d=0.329) to individual genes (d=0.055), indicating that attention captures emergent multi-gene transcriptional programs rather than individual molecular events. Spatially smooth attention maps do not imply biological coherence, and different encoders attend to distinct biological compartments. Our framework provides objective, quantitative assessment of what foundation models learn from histopathology, moving the field beyond qualitative saliency map review.

2606.04757 2026-06-04 math.OC cs.LG

Near-Optimal Decentralized Stochastic Convex Optimization over Networks

网络上的近最优去中心化随机凸优化

Nitai Kluger, Amit Attia, Tomer Koren

AI总结 针对去中心化随机光滑凸优化问题,提出一种加速去中心化方法,在总梯度样本预算N下,将可支持的工作节点数提升至M≲√ρ N^{3/4},并证明其最优性。

详情
Comments
12 papers
AI中文摘要

我们研究去中心化随机光滑凸优化,其中$M$个工作者使用局部随机梯度并通过固定八卦网络上的仅邻居通信来最小化平均目标。该设置中的一个核心问题是,在总梯度样本预算为$N$的情况下,确定可以使用的最大工作者数量,同时仍保持集中式$O(1/\sqrt N)$统计速率。我们引入了一种加速去中心化方法,该方法在最多$\smash{M\lesssim \sqrt\rho\,N^{3/4}}$个工作者时保持该速率,其中$\rho$是八卦网络的谱间隙,改进了先前最佳的最大缩放$\smash{M\lesssim \rho\sqrt N}$。该方法基于一步延迟随机加速方案,使工作者能够将小批量与加速八卦交错进行,同时控制残差分歧,其保证仅对数依赖于最优-局部异质性。我们还为线性跨度去中心化一阶方法建立了匹配的下界,表明该方法在对数因子内是最优的。

英文摘要

We study decentralized stochastic smooth convex optimization, where $M$ workers minimize an average objective using local stochastic gradients and neighbor-only communication over a fixed gossip network. A central question in this setting is to determine the largest number of workers that can be used under a total budget of $N$ gradient samples while still preserving the centralized $O(1/\sqrt N)$ statistical rate. We introduce an accelerated decentralized method that preserves this rate for up to $\smash{M\lesssim \sqrtρ\,N^{3/4}}$ workers, where $ρ$ is the spectral gap of the gossip network, improving the best prior maximal scaling of $\smash{M\lesssim ρ\sqrt N}$. The method is based on a one-step-delayed stochastic acceleration scheme that enables workers to interleave minibatching with accelerated gossip while controlling residual disagreement, and its guarantee depends only logarithmically on the optimum-local heterogeneity. We also establish a matching lower bound for linear-span decentralized first-order methods, showing that the method is optimal up to logarithmic factors.

2606.04755 2026-06-04 hep-ex cs.AI cs.IR

Archi: Agentic Operations at the CMS Experiment

Archi: CMS实验中的代理操作

Pietro Lugato, Luca Lavezzo, Jason Mohoney, Hasan Ozturk, Muhammad Hassan Ahmed, Juan Pablo Salas, Viphava Ohm, Krittin Phornsiricharoenphant, Gabriele Benelli, Mariarosaria D'Alfonso, Manasvita Joshi, Warren Nam, Aron Soha, Samantha Sunnarborg, Austin Swinney, Jack Tucker, Dmytro Kovalskyi, Tim Kraska, Christoph Paus

AI总结 提出Archi开源框架,整合异构数据源并部署可配置、私有的代理,用于CMS实验计算操作支持,在真实查询中表现有效。

详情
AI中文摘要

我们提出Archi,一个面向科学合作的开源端到端框架,它结合了异构数据源的系统化摄取和组织,以及可配置、私有且可扩展的代理的部署,这些代理能够检索和推理这些数据。自2026年2月起,Archi的一个实例已部署在CERN大型强子对撞机的CMS实验计算操作团队中,作为技术操作员的辅助代理,通过结合文档、历史数据和实时监控系统提供检索和分析能力。我们根据操作员反馈和从生产使用中收集的问题集对系统进行评估,这些问题由人工和自动化专家组评分。该系统在操作任务中证明有效,解决了CMS操作员提出的真实世界查询。我们还观察到,本地托管的开源权重模型表现具有竞争力,从而能够对敏感数据进行完全私有管理。

英文摘要

We present Archi, an open-source, end-to-end framework for scientific collaborations that combines the systematic ingestion and organization of heterogeneous data sources with the deployment of configurable, private, and extensible agents that retrieve and reason over them. An instance of Archi has been deployed for the Computing Operations team of the CMS experiment at CERN's LHC since February 2026 as a support agent for technical operators, offering retrieval and analysis capabilities by combining documentation, historical data, and live monitoring systems. We evaluate the system on operator feedback and a question set collected from production usage, graded by human and automated panels. The system proves effective at operational tasks, resolving real-world queries posed by CMS operators. We also observe that locally-hosted, open-weight models perform competitively, enabling fully private management of sensitive data.

2606.04754 2026-06-04 cs.LG

Beyond Structural Symmetries: Linear Mode Connectivity via Neuron Identifiability

超越结构对称性:通过神经元可辨识性实现线性模式连通性

Vincent Bürgin, Daniel Herbst, Ya-Wei Eileen Lin, Stefanie Jegelka

AI总结 本文通过提出有效函数类理论框架并形式化神经元可辨识性,揭示了神经网络中即使结构不对称也存在大量近似等价解,并展示了神经元可辨识性如何无需先验对齐即可实现表示合并及线性低损失路径。

详情
Comments
Accepted at ICML 2026
AI中文摘要

深度学习中的许多显著现象,如线性模式连通性和训练动力学的结构化行为,都与参数对称性密切相关:即保持实现函数不变的变换。尽管参数对称性日益受到关注,但参数、数据和表示之间的确切相互作用仍未得到充分探索。为了研究这一点,我们开发了一个有效函数类的理论框架,即神经元在其输入支持上可以实现的函数集以及实现它们的范数代价。然后,我们通过跨独立训练运行的神经元可辨识性来形式化有效对称性破缺。我们的分析表明,即使在结构不对称的模型中,神经网络也可以容纳大量近似等价的解族。我们进一步证明,神经元可辨识性使得无需先验对齐即可进行表示合并,并刻画了这种合并何时允许线性低损失路径。这些发现强调了有效函数类在影响损失景观中的作用。

英文摘要

Many striking phenomena in deep learning, such as linear mode connectivity and the structured behavior of training dynamics, are closely tied to parameter symmetries: transformations that leave the realized function unchanged. Despite growing attention to parameter symmetries, the exact interplay between parameters, data, and representations remains underexplored. To investigate this, we develop a theoretical framework of effective function classes, i.e., the set of functions a neuron can realize on its input support, and the norm cost of realizing them. We then formalize effective symmetry breaking via neuron identifiability across independent training runs. Our analysis shows that neural networks can admit large families of approximately equivalent solutions even in structurally asymmetric models. We further show that neuron identifiability enables representation merging without prior alignment, and characterize when such merging admits a linear low-loss path. These findings highlight the role of effective function classes in affecting the loss landscape.

2606.04751 2026-06-04 cs.AI

FALSIFYBENCH: Evaluating Inductive Reasoning in LLMs with Rule Discovery Games

FALSIFYBENCH: 通过规则发现游戏评估大语言模型中的归纳推理

Leonardo Bertolazzi, Katya Tentori, Raffaella Bernardi

AI总结 提出FALSIFYBENCH框架,基于Wason 2-4-6任务评估LLM在假设生成、证据收集和信念修正方面的归纳推理能力,发现推理模型优于指令微调模型,且主动寻求证伪的负测试策略是成功的关键。

详情
AI中文摘要

大型语言模型(LLM)越来越多地被部署为科学任务中的自主智能体。然而,这些系统能否有效参与与科学发现相关的归纳推理形式仍是一个开放问题。在这项工作中,我们引入了FALSIFYBENCH,一个受经典Wason 2-4-6任务启发的假设驱动推理评估框架,其中智能体必须通过迭代提出示例并接收反馈来发现隐藏的语义属性。该任务捕捉了科学推理的关键要素:假设生成、证据收集以及根据确认和证伪证据进行信念修正。我们对跨模型家族和规模的12个LLM的评估表明,推理模型通常比指令微调模型更强的科学推理者,尽管没有模型接近最优性能。成功的主要驱动因素是负测试的能力:主动寻求证伪其假设的模型始终优于主要寻求确认的模型。此外,先前工作中被忽略的细粒度回合级分析揭示,失败与模型在假设空间中导航的可识别模式相关。

英文摘要

Large language models (LLMs) are increasingly deployed as autonomous agents in scientific tasks. Yet whether these systems can effectively engage in forms of inductive reasoning relevant to scientific discovery remains an open question. In this work, we introduce FALSIFYBENCH, an evaluation framework for hypothesis-driven reasoning inspired by the classic Wason 2-4-6 task, in which agents must discover hidden semantic properties by iteratively proposing examples and receiving feedback. This task captures key elements of scientific reasoning: hypothesis generation, evidence gathering, and belief revision in response to both confirming and disconfirming evidence. Our evaluation of 12 LLMs across model families and scales shows that reasoning models are generally stronger scientific reasoners than instruction-tuned models, although no model comes close to optimal performance. The primary driver of success is the capacity for negative testing: models that actively seek to falsify their hypotheses consistently outperform those that primarily seek confirmation. Moreover, a fine-grained turn-level analysis, neglected in previous work, reveals that failure is tied to identifiable patterns in how models navigate the hypothesis space.

2606.04750 2026-06-04 cs.AI cs.CY cs.LG

Fog of Love: Engineering Virtuous Agent Behavior with Affinity-based Reinforcement Learning in a Game Environment

Fog of Love: 基于亲和力强化学习在游戏环境中塑造道德智能体行为

Ajay Vishwanath, Christian Omlin

AI总结 本文提出基于亲和力的强化学习方法,通过策略正则化在多智能体角色扮演游戏Fog of Love中同时实现竞争与合作目标,并提升智能体行为的可解释性。

详情
AI中文摘要

在人工智能中注入道德行为越来越受到关注。其中一种提出的技术是基于亲和力的强化学习,它通过对目标函数进行策略正则化来激励道德行为,而不完全依赖于奖励函数设计。迄今为止,该技术已在状态和动作空间最小的网格世界和玩具问题环境中证明有效。为了将这项研究扩展到更复杂的环境,我们引入了一个基于角色扮演棋盘游戏Fog of Love的双人多智能体环境。在该环境中,两个智能体竞争以实现各自的道德目标,同时合作以维持他们的关系。鉴于多智能体性质,这是一个复杂问题,其中多智能体深度确定性策略梯度智能体既不能成功竞争也不能成功合作。我们提供的证据表明,局部亲和力增强了智能体在实现竞争和合作目标方面的性能,从而在两个领域都获得了更高的总体得分。这不仅产生了道德选择,还阐明了智能体的目的论,并使其行为达到人类水平的可解释性。

英文摘要

Instilling virtuous behavior in artificial intelligence has seen increasing interest. One of the techniques proposed is known as affinity-based reinforcement learning, which uses policy regularization on the objective function to incentivize virtuous actions without being fully dependent on the reward function design. Thus far, this technique has been demonstrated to be effective in grid worlds and toy-problem environments with minimal state and action spaces. To expand this research to more sophisticated environments, we introduce a two-player multi-agent environment based on the role-playing board game known as Fog of Love. In this environment, two agents compete to fulfill their individual virtues, while also cooperating to satisfy their relationship. Given the multi-agent nature, this is a complex problem where multi-agent deep deterministic policy gradient agents neither compete nor cooperate successfully. We present evidence that localized affinities enhance agent performance in achieving both competitive and cooperative objectives, resulting from superior overall scores in both domains. This not only results in virtuous choices but also clarifies an agent's teleology and makes its behavior human-level interpretable.

2606.04749 2026-06-04 cs.RO cs.LG

COP-Q: Safety-First Reinforcement Learning for Robot Control via Cholesky-Ordered Projection

COP-Q:基于Cholesky有序投影的安全优先强化学习机器人控制

Guopeng Li, Moritz A. Zanger, Matthijs T. J. Spaan, Julian F. P. Kooij

AI总结 提出COP-Q方法,通过Cholesky分解编码目标优先级并利用联合Q值空间的广义置信界,在安全优先的离线策略强化学习中平衡安全与奖励目标,减少过度保守性,提升样本效率。

详情
Comments
7 pages, 6 figures, 2 tables
AI中文摘要

安全机器人控制需要在满足安全约束的同时最大化回报。在离线策略安全强化学习中,奖励和安全Q值通常由独立的评论家集成学习,每个目标的不确定性独立处理。这种按目标处理的方式忽略了目标间的相关性,可能导致过于保守的价值估计,从而降低样本效率。为解决此问题,我们提出Cholesky有序投影Q学习(COP-Q),一种安全优先的方法,将目标间协方差纳入向量值Q值估计中。COP-Q在联合Q值空间中构建广义置信界,并使用Cholesky分解以顺序形式编码目标优先级。这在对安全目标保持保守性的同时,自适应地减少对奖励目标的过度保守性。得到的估计同时用于时序差分目标计算和演员优化。COP-Q引入最小的计算开销,并且与大多数现有深度Q学习框架兼容。在Brax中的机器人运动和安全健身房中的安全导航实验(涵盖硬安全和软安全设置)表明,与代表性基线相比,COP-Q实现了强大的安全性能以及有竞争力或更高的样本效率。

英文摘要

Safe robot control requires maximizing return while satisfying safety constraints. In off-policy safe reinforcement learning, reward and safety Q-values are commonly learned by separate critic ensembles, with uncertainty handled independently for each objective. This objective-wise treatment neglects inter-objective correlation and can lead to overly conservative value estimates, thereby reducing sample efficiency. To address this issue, we propose Cholesky-Ordered Projection Q-learning (COP-Q), a safety-first method that incorporates inter-objective covariance into vector-valued Q-value estimation. COP-Q constructs a generalized confidence bound in the joint Q-value space and uses Cholesky factorization to encode objective priority in a sequential form. This preserves conservatism on safety while adaptively reducing excessive conservatism on the reward objective. The resulting estimate is used in both temporal-difference target computation and actor optimization. COP-Q incurs minimal computational overhead and is readily compatible with most existing deep Q-learning frameworks. Experiments on robot locomotion in Brax and safe navigation in Safety-Gymnasium, covering both hard- and soft-safety settings, demonstrate that COP-Q achieves strong safety performance together with competitive or improved sample efficiency relative to representative baselines.