arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.16565 2026-05-20 cs.AI cs.OS

Skim: Speculative Execution for Fast and Efficient Web Agents

Skim：用于快速和高效网络代理的推测执行

Mike Wong, Kevin Hsieh, Suman Nath, Ravi Netravali

发表机构 * Princeton University（普林斯顿大学）； Microsoft Research（微软研究院）

AI总结 Skim通过利用专门构建网站的可预测结构，提出了一种推测执行框架，以降低网络代理的任务成本和延迟，同时保持准确性。

Comments 14 pages, 21 figures

详情

AI中文摘要

Skim是一种用于网络代理的推测执行框架，利用专门构建网站的可预测结构。当今网络代理的开销并非任务本身固有，而是由代理的组合方式决定：前沿模型推断、浏览器渲染和ReAct风格的规划被应用于每个任务的每一步，无论复杂度如何。Skim的关键观察是，网站在相同类型的查询中强制执行稳定的URL模式、答案格式和任务到轨迹的映射，因此大多数查询可以完全绕过这些重型组件。离线分析器在每个网站上捕获这些模式一次。在运行时，Skim将每个查询匹配到模板，合成目标URL，并使用小型模型提取答案。一个轻量级验证器将每个快速路径输出与查询和模式进行比对；罕见的不准确会级联到完整的代理，但通过快速路径的最终URL进行预热，以保持上游轨迹进度。在标准网络代理基准测试中，结合三个主干代理（WebVoyager、AgentOccam、BrowserUse），Skim将任务的中位成本降低了1.9倍，延迟减少了33.4%，且没有精度损失。

英文摘要

Skim is a speculative execution framework for web agents that exploits the predictable structure of purpose-built websites. Today's web-agent expense is not intrinsic to the tasks but a property of how agents are composed: frontier-model inference, browser rendering, and ReAct-style planning are applied to every step of every task regardless of complexity. Skim's key observation is that websites enforce stable URL patterns, answer formats, and task-to-trajectory mappings across queries of the same type, so most queries can bypass these heavyweight components entirely. An offline profiler captures these patterns once per site. At runtime, Skim matches each query to a template, synthesizes the destination URL, and extracts the answer with a small model. A lightweight verifier gates each fast-path output against the query and schema; rare misspeculations cascade to the full agent, warm-started by the fast path's final URL to preserve upstream trajectory progress. Across standard web-agent benchmarks paired with three backboneagents (WebVoyager, AgentOccam, BrowserUse), Skim reduces median per-task cost by 1.9x and latency by 33.4% with no accuracy loss.

URL PDF HTML ☆

赞 0 踩 0

2605.16447 2026-05-20 cs.LG cs.AI

Nested Spatio-Temporal Time Series Forecasting

嵌套时空时间序列预测

Yinghao Ai, Yukai Zhou, Ruoxi Jiang, Junyi An, Chao Qu, Zhijian Zhou, Shiyu Wang, Fenglei Cao, Zenglin Xu, Furao Shen, Yuan Qi

发表机构 * Fudan University, Shanghai（复旦大学）； Department of Computer Science and Technology, Nanjing University（南京大学计算机科学与技术系）； ByteDance（字节跳动）

AI总结本文提出了一种嵌套预测框架，通过结合未来宏观区域趋势与微观历史观测，实现了精细化预测，并通过谱聚类方法构建语义连贯的区域，有效过滤系统性噪声并保留关键趋势，实验表明该方法在多个高维数据集上优于现有最先进基线。

Comments Accept by ICML 2026

详情

AI中文摘要

时空预测对于现实应用如交通管理至关重要，但在噪声和非平稳条件下捕捉可靠交互仍具挑战性。现有方法主要依赖历史空间先验，往往无法考虑演化的时空相关性并产生系统性误差。在本文中，我们提出了一种嵌套预测框架，将未来宏观区域趋势与微观历史观测相结合，使模型能够从抽象的未来表示中获得自上而下的指导以实现精细化预测。具体而言，我们采用基于谱聚类的方法构建语义连贯的区域，提供了理论和经验证据表明这种表示能有效过滤系统性噪声并保留关键趋势。在此基础上，我们开发了一种逐步由粗到细的预测器，将这些代表性特征整合到推理过程中。这使模型能够利用趋势预测来提前预测动态异常，如周期性偏移。此外，对多个高维数据集的广泛实验表明，我们的方法在多个高维数据集上始终优于现有最先进基线，验证了未来宏观指导的嵌套预测的有效性。

英文摘要

Spatiotemporal forecasting is critical for real-world applications like traffic management, yet capturing reliable interactions remains challenging under noisy and non-stationary conditions. Existing methods primarily rely on historical spatial priors, often failing to account for evolving temporal correlations and suffering from systematic errors. In this work, we propose a nested forecasting framework that couples future macro-level regional trends with micro-level historical observations, enabling top-down guidance from abstract future representations for fine-grained forecasting. Specifically, we employ a spectral clustering-based approach to construct semantically coherent regions, providing both theoretical and empirical evidence that this representation effectively filters systematic noise while preserving essential trends. Building on this, we develop a progressive coarse-to-fine predictor to integrate these representative features into the inference process. This enables the model to leverage trend predictions to anticipate dynamic anomalies, such as periodic offsets, in advance. Furthermore, extensive experiments on multiple high-dimensional datasets demonstrate that our method consistently outperforms state-of-the-art baselines, validating the effectiveness of future macro-guided nested forecasting.

URL PDF HTML ☆

赞 0 踩 0

2605.16170 2026-05-20 cs.LG

BAPR: Bayesian amnesic piecewise-robust reinforcement learning for non-stationary continuous control

BAPR: 基于贝叶斯遗忘的分段鲁棒强化学习用于非平稳连续控制

Yifan Zhang, Liang Zheng

发表机构 * Central South University（中南大学）

AI总结该研究提出BAPR方法，结合贝叶斯在线变化检测与鲁棒集合强化学习，解决非平稳连续控制中的鲁棒性与适应性问题，通过形式化验证确保算法稳定性与收敛性。

详情

AI中文摘要

现实中的控制系统经常在分段平稳条件下运行，其中动态在较长时期内保持稳定，随后经历 abrupt 的 regime 变化。标准鲁棒强化学习方法面临根本性困境：全局保守策略在稳定时期浪费性能，而局部适应策略在未检测到 regime 变化时风险崩溃。我们提出 BAPR（贝叶斯遗忘分段鲁棒 SAC），将贝叶斯在线变化检测（BOCD）与鲁棒集合强化学习统一。BAPR 操作符——一种加权由模式条件贝尔曼操作符和冻结信念分布构成的凸组合——是一个 γ-收缩。一个互补的反例，在 Lean~4 中机验证，建立了明确的边界：当信念依赖于 Q 函数时，收缩因子变为 γ + λΔ（其中 Δ 是模式奖励差），且收缩失败恰好当 γ + λΔ ≥ 1。我们推导了抽象操作符的组件式形式化误差预算——每个组件机验证，限制了切换后的恢复；预算适用于抽象模式混合操作符，并通过冻结参数设计直觉继承到实现的共享批评者算法。所有结果均通过形式化验证，无 sorry（1,145 行，3 个 Lean~4 文件，22 个机验证定理）。BOCD 驱动了适应性保守机制：在检测到变化点后，策略变得最保守，并随着信心增长而平滑放松，检测延迟为 O(log(1/δ))。一个通过 RMDM 损失训练的上下文条件模块，从模拟器提供的模式 ID 提取模式感知表示，在训练时和部署时均无需模式标签。

英文摘要

Real-world control systems frequently operate under \emph{piecewise stationary} conditions, where dynamics remain stable for extended periods before undergoing abrupt regime changes. Standard robust RL methods face a fundamental dilemma: a globally conservative policy wastes performance during stable periods, while a locally adaptive policy risks catastrophic failure when the regime changes undetected. We propose \textbf{BAPR} (Bayesian Amnesic Piecewise-Robust SAC), which unifies Bayesian Online Change Detection (BOCD) with robust ensemble RL. The BAPR operator -- a convex combination of mode-conditional Bellman operators weighted by a frozen belief distribution -- is a $γ$-contraction. A complementary counterexample, machine-verified in Lean~4, establishes a \emph{sharp boundary}: when beliefs depend on the Q-function, the contraction factor becomes $γ+ λΔ$ (where $Δ$ is the mode reward gap), and contraction fails exactly when $γ+ λΔ\geq 1$. We derive a \emph{component-wise} formal error budget for the abstract operator -- every component machine-verified -- bounding post-switch recovery; the budget applies to the abstract mode-mixture operator and inherits to the implemented shared-critic algorithm only through the frozen-parameter design intuition. All results are formally verified with no \texttt{sorry} (1,145 lines across 3 Lean~4 files, 22 machine-verified theorems). BOCD drives an adaptive conservatism mechanism: the policy becomes maximally conservative after detected change-points and smoothly relaxes as confidence grows, with detection delay $O(\log(1/δ))$. A context-conditioning module trained via RMDM loss provides mode-aware representations from simulator-provided mode IDs at training time and requires no mode labels at deployment.

URL PDF HTML ☆

赞 0 踩 0

2605.16137 2026-05-20 cs.CV cs.RO

STABLE: Simulation-Ready Tabletop Layout Generation via a Semantics-Physics Dual System

STABLE: 通过语义-物理双系统生成仿真准备的桌面布局

Zhen Luo, Yixuan Yang, Xudong Xu, Jinkun Hao, Zhaoyang Lyu, Feng Zheng, Jiangmiao Pang, Yanwei Fu

发表机构 * Shanghai AI Laboratory（上海人工智能实验室）

AI总结本文提出STABLE，一种通过语义-物理双系统生成仿真准备的桌面布局的方法，通过语义推理模块生成粗略布局，物理校正模块校正布局以确保物理合理性，从而提升场景的物理有效性。

Comments ICML 2026

详情

AI中文摘要

从任务指令生成仿真准备的桌面场景是嵌入式人工智能领域引人入胜且有前景的研究方向。然而，现有任务到场景生成方法仅依赖大型语言模型（LLMs）预测场景布局，不可避免地导致物体碰撞或漂浮，因为LLMs在三维空间推理方面存在固有局限性。在本文中，我们提出了STABLE，一种专为仿真准备的桌面场景生成设计的语义-物理双系统。STABLE由两个互补模块组成：（i）语义推理器，一个在结构化桌面场景数据集上微调的LLM，用于从输入任务指令生成粗略布局；（ii）物理校正器，一个具有物理意识的基于流的去噪模型，输出姿态更新以校正布局，从而确保场景的物理合理性，同时保持与任务指令的语义一致性。STABLE采用渐进生成范式：通过交替使用语义推理器和物理校正器，它逐步从任务关键对象扩展到背景对象。实验表明，STABLE成功生成严格符合任务指令的仿真准备的桌面场景，并显著提高了场景的物理有效性。

英文摘要

Generating simulation-ready tabletop scenes from task instructions is an intriguing and promising research direction in the field of Embodied AI. However, existing task-to-scene generation methods rely exclusively on large language models (LLMs) to predict scene layouts, inevitably yielding object collisions or floating due to LLMs' inherent limitations in 3D spatial reasoning. In this paper, we present STABLE, a semantics-physics dual-system tailored for simulation-ready tabletop scene generation. STABLE consists of two complementary modules: (i) a Semantic Reasoner, a fine-tuned LLM trained on a structured tabletop scene dataset to generate coarse layouts from input task instructions, and (ii) a Physics Corrector, a physics-aware flow-based denoising model that outputs pose updates to refine layouts, which ensures the physical plausibility of scenes while preserves semantic alignment with task instructions. STABLE adopts a progressive generation paradigm: by alternating between the Semantic Reasoner and Physics Corrector, it incrementally expands the scene from task-critical objects to background objects. Experiments demonstrate that STABLE successfully generates simulation-ready tabletop scenes that strictly conform to task instructions and significantly enhances the physical validity of scenes over prior art.

URL PDF HTML ☆

赞 0 踩 0

2605.15599 2026-05-20 cs.CV cs.AI

Pretraining Objective Matters in Extreme Low-Data FGVC: A Backbone-Controlled Study

预训练目标在极低数据细粒度视觉分类中的影响：一个骨干网络控制研究

Alexander Hackett, Srikanth Thudumu, Ginny Fisher, Jason Fisher

发表机构 * Santa Clara University（圣克拉拉大学）； IAAIR

AI总结本文研究了在极低数据细粒度视觉分类中预训练目标对下游表示质量的影响，通过比较四种冻结的ViT-B/16编码器，得出了在数据稀缺时优先选择边界增强预训练目标的结论。

Comments Presented at the 13th Workshop on Fine-Grained Visual Categorization (FGVC13) at CVPR 2026

Journal ref 13th Workshop on Fine-Grained Visual Categorization (FGVC13), CVPR 2026

详情

AI中文摘要

极端低数据细粒度分类在专家领域中普遍存在，其中标注成本高昂，但从业者仍需要有原则的指导来选择预训练编码器。我们使用一个定制的数据集，包含三个类别的标注图像，研究了在匹配的骨干容量下，预训练目标如何影响下游表示质量。我们比较了四种冻结的ViT-B/16编码器，分别通过监督分类、对比学习（SigLIP2）、掩码重建（MAE）和自蒸馏（DINOv3）进行训练，并使用留一验证法通过线性和非线性探测器评估。为了控制低N情况下的统计噪声，我们使用排列检验（N=1000）在宏级一对多AUC上进行测试。监督和对比学习编码器在线性可分性方面表现最强（逻辑AUC：0.768和0.735；SVM AUC：0.739和0.697），而MAE在非线性探测器下表现更优（XGBoost AUC：0.713）。我们发现DINOv3在该领域整体表现较差。这些结果支持在极低数据细粒度视觉分类中的一种实用建议：当数据稀缺限制探测到线性决策规则时，优先选择边界增强预训练目标；当非线性分类器可行时，考虑使用重建式编码器。

英文摘要

Extreme low-data fine-grained classification is common in expert domains where labeling is expensive, yet practitioners still need principled guidance for selecting pretrained encoders. We study emerald inclusion grading with a custom dataset of labeled images across three classes and ask: under matched backbone capacity, how does pretraining objective affect downstream representation quality? We compare four frozen ViT-B/16 encoders trained with supervised classification, contrastive learning (SigLIP2), masked reconstruction (MAE), and self-distillation (DINOv3), and evaluate them with leave-one-out cross-validation using linear and nonlinear probes. To control statistical noise in the low-N regime, we use permutation testing (N=1000) on macro one-vs-rest AUC. Supervised and contrastive encoders provide the strongest linear separability (logistic AUC: 0.768 and 0.735; SVM AUC: 0.739 and 0.697), while MAE improves under nonlinear probes (XGBoost AUC: 0.713). We find that DINOv3 underperforms across probe families in this domain. These results support a practical recommendation for extreme low-data FGVC: prioritize margin-enforcing pretraining objectives when data scarcity restricts probing to linear decision rules, and consider reconstruction-style encoders when nonlinear classifiers are feasible given dataset constraints.

URL PDF HTML ☆

赞 0 踩 0

2605.15532 2026-05-20 cs.LG cs.AI cs.CL

DeltaPrompts: Escaping the Zero-Delta Trap in Multimodal Distillation

DeltaPrompts: 逃离多模态蒸馏中的零delta陷阱

Jaehun Jung, Hyunwoo Kim, Brandon Cui, Ximing Lu, David Acuna, Prithviraj Ammanabrolu, Yejin Choi

发表机构 * NVIDIA Research（NVIDIA研究院）

AI总结本文提出DeltaPrompts，通过量化教师与学生之间的答案分歧（Δ）来生成高分歧的推理问题，从而解决传统蒸馏中因零delta提示导致的学习信号不足问题，实验表明DeltaPrompts在多个场景下显著提升了模型性能。

详情

AI中文摘要

蒸馏使紧凑的视觉-语言模型（VLMs）能够获得强大的推理能力，但驱动这一过程的提示通常通过简单的启发法或从现成数据集中聚合获得。我们揭示了这种方法中的关键低效性：标准图表/文档推理数据集中多达69%的提示实际上是零delta，意味着教师和学生已经诱导出完全相同的答案分布。在这些提示上训练提供极小的学习信号，导致学生性能在数据规模扩大时迅速饱和。为逃离零delta陷阱，我们回归基本原理：蒸馏本质上最小化了分布差异，因此只有暴露教师与学生之间功能性能力差距的提示才具有价值。我们通过答案分歧（Δ）量化这一差距，证明非零分歧对有效扩展至关重要。基于这一洞察，我们提出一个分阶段合成流程，利用现有数据集作为种子，主动针对学生失败模式生成更好的提示。结果是DeltaPrompts，一个包含20万 synthetic 高分歧推理问题的多样化数据集。我们评估DeltaPrompts在三个不同场景下的表现：在目标教师-学生对上的在线蒸馏、转移到新型模型家族而不重新生成数据、以及非推理模型的离线微调。在所有场景中，DeltaPrompts均带来显著收益，即使在高度优化的推理模型（如Qwen3-VL-8B-Thinking）上，也能在10个基准测试中平均获得高达15%的相对提升。

英文摘要

Distillation enables compact Vision-Language Models (VLMs) to obtain strong reasoning capabilities, yet the prompts driving this process are typically chosen via simple heuristics or aggregated from off-the-shelf datasets. We reveal a critical inefficiency in this approach: up to 69% of the prompts in standard chart / document reasoning datasets are effectively zero-delta, meaning the teacher and student already induce the exact same answer distribution. Training on these prompts provides minimal learning signal, causing student improvement to rapidly saturate regardless of data scale. To escape the zero-delta trap, we return to first principles: distillation fundamentally minimizes distributional divergence, and thus a prompt is valuable only if it exposes a functional capability gap between the teacher and student. We quantify this gap through answer divergence ($Δ$), demonstrating that non-zero divergence is critical for effective scaling. Building on this insight, we propose a staged synthesis pipeline that repurposes existing datasets as seeds, actively targeting student failure modes to produce better prompts. The result is DeltaPrompts, a diverse dataset of 200k synthetic, high-divergence reasoning problems. We evaluate DeltaPrompts across three distinct settings: on-policy distillation with the target teacher-student pair, transfer to a novel model family without regenerating the data, and off-policy fine-tuning of a non-reasoning model. Across all scenarios, DeltaPrompts drives substantial gains, yielding up to 15% relative improvement even on top of a highly-optimized reasoning model (e.g., Qwen3-VL-8B-Thinking) -- averaged over 10 benchmarks spanning chart, document and perception-centric reasoning.

URL PDF HTML ☆

赞 0 踩 0

2605.14588 2026-05-20 cs.LG

Silent Collapse in Recursive Learning Systems

递归学习系统中的沉默崩溃

Zhipeng Zhang

发表机构 * China Mobile Research Institute（中国移动研究院）； China Mobile GBA (Greater Bay Area) Innovation Institute（中国移动大湾区创新研究院）

AI总结本文研究了递归学习系统中模型内部分布逐渐退化的现象，提出MTR框架通过监测轨迹统计量和调整学习强度来提前预警并防止沉默崩溃。

详情

AI中文摘要

递归学习——即模型在由自身先前版本生成的数据上进行训练——在大型语言模型、自主代理和自监督系统中日益常见。然而，标准性能度量（损失、困惑度、准确率）往往无法在不可逆退化发生前检测到内部退化。本文识别出一种现象，我们称之为沉默崩溃：在广泛递归条件下，模型内部分布（预测熵、表征多样性、尾部覆盖）即使在传统度量看似稳定或改进时也会逐渐收缩。我们发现沉默崩溃并非 abrupt，其发生前总是可靠地由三个轨迹级前兆预示：（1）锚点熵的收缩，（2）表征漂移的冻结，（3）尾部覆盖的侵蚀。这些信号在任何传统验证度量退化之前多代出现，从而实现早期预警。基于这些前兆，我们提出了MTR（监控-信任-调节器）框架，一个轻量级的元认知循环，通过监测轨迹统计量、估计慢时间尺度的信任变量，并自适应调节有效学习强度。MTR在不需访问原始干净数据的情况下提供早期预警并主动防止沉默崩溃，这是当原始数据不可用、受污染或私有时的关键优势。

英文摘要

Recursive learning -- where models are trained on data generated by previous versions of themselves -- is increasingly common in large language models, autonomous agents, and self-supervised systems. However, standard performance metrics (loss, perplexity, accuracy) often fail to detect internal degradation before it becomes irreversible. Here we identify a phenomenon we call silent collapse: under broad recursive conditions, model internal distributions -- predictive entropy, representational diversity, and tail coverage -- progressively contract even as conventional metrics appear stable or improving. We discover that silent collapse is not abrupt. Its onset is reliably preceded by three trajectory-level precursors: (1) contraction of anchor entropy, (2) freezing of representation drift, and (3) erosion of tail coverage. These signals manifest multiple generations before any degradation in standard validation metrics, enabling early warning. Based on these precursors, we propose the MTR (Monitor--Trust--Regulator) framework, a lightweight metacognitive loop that monitors trajectory statistics, estimates a slow-timescale trust variable, and adaptively modulates the effective learning intensity. MTR provides early warning and actively prevents silent collapse without requiring access to pristine real data -- a critical advantage when original data is unavailable, contaminated, or private.

URL PDF HTML ☆

赞 0 踩 0

2605.14048 2026-05-20 cs.AI cs.LG

Network-Aware Bilinear Tokenization for Brain Functional Connectivity Representation Learning

面向网络的双线性分块化用于脑功能连接表示学习

Leo Milecki, Qingyu Hu, Bahram Jafrasteh, Mert R. Sabuncu, Qingyu Zhao

发表机构 * Department of Radiology, Weill Cornell Medicine, New York, NY, USA.（韦尔·科恩医学中心放射科, 纽约, NY, 美国）； School of Electrical and Computer Engineering, Cornell University and Cornell Tech, New York, NY, USA.（康奈尔大学电气与计算机工程学院及康奈尔科技, 纽约, NY, 美国）

AI总结本文提出了一种面向网络的双线性分块化方法，用于改进脑功能连接的表示学习，通过重新定义功能连接的分块方式，提升模型在跨群体评估中的稳定性和可迁移性。

Comments Author-submitted version, provisionally accepted at MICCAI 2026

详情

AI中文摘要

Masked autoencoders (MAEs) 近年来在静息状态脑功能连接（FC）的自监督表示学习中显示出潜力。然而，一个基本问题仍未解决：如何对FC矩阵进行分块以与大规模脑网络的内在模块化组织对齐？现有方法通常采用以区域为中心或图基的方案，将FC视为结构上均质的元素，并忽略了大规模脑网络的组织结构。我们引入NERVE（通过双线性分块化进行脑功能连接的网络感知表示学习），一种自监督学习框架，通过将FC矩阵划分为内网络和跨网络连接块来重新定义FC分块。与基于图像的MAE不同，由网络对定义的FC分块在大小上异质且对应不同的功能角色。为了解决这个问题，NERVE通过一种新的结构化双线性分解来嵌入FC分块。这种形式保留了网络身份，并将参数复杂度从网络数量的二次方减少到线性。我们评估了NERVE在三个大规模发展队列（ABCD、PNC和CCNP）中对行为和精神病理学的预测。与结构上不敏感的MAE变体和基于图的自监督基线相比，所提出的网络感知形式在跨队列评估中产生了更稳定和可迁移的表示。消融研究确认了所提出的双线性网络嵌入和解剖学基础的分区对于性能至关重要。这些发现突显了在功能连接组学中将领域特定的结构先验纳入自监督学习的重要性。代码可在：https://github.com/leomlck/NERVE。

英文摘要

Masked autoencoders (MAEs) have recently shown promise for self-supervised representation learning of resting-state brain functional connectivity (FC). However, a fundamental question remains unresolved: how should FC matrices be tokenized to align with the intrinsic modular organization of large-scale brain networks? Existing approaches typically adopt region-centric or graph-based schemes that treat FC as structurally homogeneous elements and overlook the large-scale network brain organization. We introduce NERVE (Network-Aware Representations of Brain Functional Connectivity via Bilinear Tokenization), a self-supervised learning framework that redefines FC tokenization by partitioning FC matrices into patches of intra- and inter-network connectivity blocks. Unlike image-based MAE, where fixed-size patches share a common tokenizer, FC patches defined by network pairs are heterogeneous in size and correspond to distinct functional roles. To resolve this problem, NERVE embeds FC patches through a novel structured bilinear factorization. This formulation preserves network identity and reduces parameter complexity from quadratic to linear scaling in the number of networks. We evaluate NERVE across three large-scale developmental cohorts (ABCD, PNC, and CCNP) for behavior and psychopathology prediction. Compared to structurally agnostic MAE variants and graph-based self-supervised baselines, the proposed network-aware formulation yields more stable and transferable representations, particularly in cross-cohort evaluation. Ablation studies confirm that the proposed bilinear network embedding and anatomically grounded parcellation are critical for performance. These findings highlight the importance of incorporating domain-specific structural priors into self-supervised learning for functional connectomics. Code is available at: https://github.com/leomlck/NERVE.

URL PDF HTML ☆

赞 0 踩 0

2605.14014 2026-05-20 cs.LG cs.AI

Dywave: Event-Aligned Dynamic Tokenization for Heterogeneous IoT Sensing Signals

Dywave: 为异构物联网传感信号设计的事件对齐动态分词方法

Tomoyoshi Kimura, Denizhan Kara, Jinyang Li, Hongjue Zhao, Yigong Hu, Yizhuo Chen, Xiaomin Ouyang, Shengzhong Liu, Tarek Abdelzaher

发表机构 * University of Illinois Urbana-Champaign（伊利诺伊大学厄巴纳-香槟分校）； Hong Kong University of Science（香港科学大学）； Shanghai Jiao Tong University（上海交通大学）

AI总结本文提出Dywave，一种用于异构物联网传感信号的动态分词框架，通过小波基层次分解构建紧凑的输入表示，以适应内在时间结构和底层物理事件，从而在活动识别、压力评估和附近物体检测等任务中提升准确率并提高计算效率。

详情

AI中文摘要

物联网系统持续收集来自无处不在传感器的异构传感信号，以支持智能应用，如人类活动分析、情绪监测和环境感知。这些信号本质上是非平稳和多尺度的，给标准分词技术带来了独特挑战。本文提出Dywave，一种为物联网传感信号设计的动态分词框架，该框架构建了与内在时间结构和底层物理事件对齐的紧凑输入表示。Dywave利用基于小波的层次分解，识别出对应底层语义事件的时间边界，并自适应地压缩冗余区间，同时保持时间一致性。在五个真实物联网传感数据集上进行的广泛评估表明，Dywave在活动识别、压力评估和附近物体检测等任务中，比最先进的方法在准确率上提高了高达12%，同时通过减少输入标记长度最多75%来提高计算效率。此外，Dywave在面对领域偏移和变化的序列长度时表现出更强的鲁棒性。

英文摘要

Internet of Things (IoT) systems continuously collect heterogeneous sensing signals from ubiquitous sensors to support intelligent applications such as human activity analysis, emotion monitoring, and environmental perception. These signals are inherently non-stationary and multi-scale, posing unique challenges for standard tokenization techniques. This paper proposes Dywave, a dynamic tokenization framework for IoT sensing signals that constructs compact input representations aligned with intrinsic temporal structures and underlying physical events. Dywave leverages wavelet-based hierarchical decomposition, identifies meaningful temporal boundaries corresponding to underlying semantic events, and adaptively compresses redundant intervals while preserving temporal coherence. Extensive evaluations on five real-world IoT sensing datasets across activity recognition, stress assessment, and nearby object detection demonstrate that Dywave outperforms state-of-the-art methods by up to 12% in accuracy, while improving computational efficiency by reducing input token lengths by up to 75% across mainstream sequence models. Moreover, Dywave exhibits improved robustness to domain shifts and varying sequence lengths.

URL PDF HTML ☆

赞 0 踩 0

2605.13793 2026-05-20 cs.CL

An LLM-Based System for Argument Mining

基于LLM的论证挖掘系统

Paulo Pirozelli, Victor Hugo Nascimento Rocha, Fabio G. Cozman, Douglas Aldred

发表机构 * Universidade de São Paulo Center for Artificial Intelligence (C4AI)（圣保罗大学人工智能中心（C4AI））； Instituto Mauá de Tecnologia Núcleo de Sistemas Eletrônicos Embarcados (NSEE)（马乌阿技术研究所电子系统嵌入核（NSEE））

AI总结本文提出一个基于大语言模型的端到端系统，用于从自然语言文本中提取论证并构建抽象论证图，通过多阶段流程识别论证组件、选择相关元素并揭示其逻辑关系，实验表明该系统能有效恢复论证结构并在不同标注方案下表现良好。

详情

AI中文摘要

论证是人类推理中的基本方面，其中主张被支持、挑战并相互比较。我们提出一个端到端的大语言模型（LLM）基于系统，用于将自然语言文本中的论证重建为抽象论证图。该系统遵循一个多阶段流程，逐步识别论证性组件、选择相关元素并揭示它们的逻辑关系。这些元素表示为由两种组件类型（前提和结论）和三种关系类型（支持、攻击和削弱）组成的有向无环图。我们进行了两项互补的实验来评估该系统。首先，我们在论证理论教科书中的论证上进行手动评估，以评估系统恢复论证结构的能力。其次，我们在基准数据集上进行定量评估，通过将我们的输出映射到已建立的标注方案来与先前工作进行比较。结果表明，该系统能够充分恢复论证结构，并且在适应不同标注方案时，在基准数据集上取得合理表现。这些发现突显了基于LLM的流程在可扩展论证挖掘中的潜力。

英文摘要

Arguments are a fundamental aspect of human reasoning, in which claims are supported, challenged, and weighed against one another. We present an end-to-end large language model (LLM)-based system for reconstructing arguments from natural language text into abstract argument graphs. The system follows a multi-stage pipeline that progressively identifies argumentative components, selects relevant elements, and uncovers their logical relations. These elements are represented as directed acyclic graphs consisting of two component types (premises and conclusions) and three relation types (support, attack, and undercut). We conduct two complementary experiments to evaluate the system. First, we perform a manual evaluation on arguments drawn from an argumentation theory textbook to assess the system's ability to recover argumentative structure. Second, we conduct a quantitative evaluation on benchmark datasets, allowing comparison with prior work by mapping our outputs to established annotation schemes. Results show that the system can adequately recover argumentative structures and, when adapted to different annotation schemes, achieve reasonable performance across benchmark datasets. These findings highlight the potential of LLM-based pipelines for scalable argument mining.

URL PDF HTML ☆

赞 0 踩 0

2605.13318 2026-05-20 cs.AI cs.ET

VERA-MH: Validation of Ethical and Responsible AI in Mental Health

VERA-MH：心理健康领域伦理和负责任AI的验证

Luca Belli, Kate H. Bentley, Josh Gieringer, Emily Van Ark, Nilu Zhao, Pradip Thachile, Matt Hawrilenko, Millard Brown, Adam M. Chekroud

发表机构 * Spring Health ； Yale University（耶鲁大学）

AI总结本研究提出VERA-MH，一种用于评估心理健康支持聊天机器人安全性的新型临床验证方法，重点评估聊天机器人在识别自杀倾向风险方面的表现。

详情

AI中文摘要

随着聊天机器人在更多领域被使用，包括原本未被设计用于的领域，如心理健康支持。为此，我们介绍了验证伦理和负责任AI在心理健康中的应用（VERA-MH），一种新的临床验证评估，用于评估聊天机器人在心理健康支持中的安全性。VERA-MH的第一版专注于自杀念头（SI）风险，通过评估聊天机器人如何回应可能处于危机中的用户。VERA-MH由三个步骤组成：对话模拟、对话评估和模型评分。首先，为评估的聊天机器人模拟对话，另一个聊天机器人将扮演用户角色，基于特定的人设进行模拟。这些用户人设是在临床指导下开发的，以确保代表多种风险因素、人口特征和披露因素。在评估步骤中，一个第二支持模型作为LLM-as-a-Judge，结合一个临床开发的评分表。评分表结构为流程，每次提出一个Yes/No问题，以提高答案的一致性并突出模型的失败模式。在最后阶段，每个对话的结果被汇总以呈现最终的聊天机器人评估。与框架一起，我们还展示了对四个领先LLM提供商的评估结果。

英文摘要

Chatbot usage has increased, including in fields for which they were never developed for--notably mental health support. To that end, we introduce Validations of Ethical and Responsible AI in Mental Health (VERA-MH), a novel clinically-validated evaluation for safety of chatbots in the context of mental health support. The first iteration of VERA-MH focuses on Suicidal Ideation (SI) risks, by assessing how well chatbots can responds to users that might be in crisis. VERA-MH is comprised of three steps: conversation simulation, conversation judging and model rating. First, to simulate conversations with the chatbot under evaluation, another chatbot is tasked with role-playing users based on specific personas. Such user personas have been developed under clinical guidance, to make sure that, among others, multiple risk factors, demographic characteristics and disclosure factors were represented. In the judging step, a second support model is used as an LLM-as-a-Judge, together with a clinically-developed rubric. The rubric is structured as a flow, with a single Yes/No question asked each time, to improve answers' consistency and highlight models' failure modes. In the last stage, results of each conversation are aggregated to present the final evaluation of the chatbot. Together with the framework, we present the result of the evaluations for four leading LLM providers.

URL PDF HTML ☆

赞 0 踩 0

2605.13193 2026-05-20 cs.CV

FIKA-Bench: From Fine-grained Recognition to Fine-Grained Knowledge Acquisition

FIKA-Bench: 从细粒度识别到细粒度知识获取

Geng Li, Yuxin Peng

发表机构 * Wangxuan Institute of Computer Technology, Peking University（北京大学计算机技术研究所）

AI总结本文提出FIKA-Bench，一个包含311个公开来源和现实实例的细粒度知识获取基准，通过过滤和审计确保实例质量，评估最新多模态模型和代理发现细粒度识别任务仍具挑战性，需改进代理设计以提升知识获取能力。

Comments Project page with code: https://ligeng0197.github.io/FIKA-Bench.github.io/

详情

AI中文摘要

日常生活中细粒度识别往往不是封闭书目分类问题：当遇到陌生物体时，人类会主动搜索、比较视觉细节并验证证据后再做决定。现有基准主要评估视觉识别能力，忽略了这种主动外部知识获取能力。我们研究细粒度知识获取，即系统必须寻求、验证并使用外部证据来回答开放式细粒度识别问题。我们引入FIKA-Bench，一个泄漏意识且证据支持的实例集合，包含311个公开来源和现实实例。为确保高质量，每个实例均经过前沿封闭书目模型过滤以去除记忆案例，并经过审核以消除图像-答案泄漏，仅保留由验证证据支持的样本。我们对最新多模态模型（LMMs）和代理的评估显示，该任务仍具挑战性：最佳系统仅达到25.1%的准确率，无模型超过30%。关键发现是，仅给模型配备工具不足以弥合这一差距；代理失败主要由错误实体检索和较差的视觉判断驱动。这些结果表明，可靠的知识获取需要更好的代理设计，以专注于细粒度识别。

英文摘要

Fine-grained recognition in everyday life is often not a closed-book classification problem: when encountering unfamiliar objects, humans actively search, compare visual details, and verify evidence before deciding. Existing benchmarks primarily evaluate visually recognition, leaving this active external knowledge acquisition ability underexplored. We study fine-grained knowledge acquisition, where a system must seek, verify, and use external evidence to answer open-ended fine-grained recognition questions. We introduce FIKA-Bench, a leakage-aware and evidence-grounded collection of 311 public-source and real-life instances. To ensure high quality, every example is filtered against frontier closed-book models to remove memorized cases and audited to eliminate image-answer leakage, retaining only samples supported by verified evidence. Our evaluation of latest Large Multimodal Models (LMMs) and agents reveals that the task remains a formidable challenge: the best system reaches only 25.1% accuracy, with no model exceeding 30%. Crucially, we find that merely equipping models with tools is insufficient to bridge this gap; agent failures are predominantly driven by wrong entity retrieval and poor visual judgement. These results show that reliable knowledge acquisition needs better agent designs that focus on fine-grained recognition.

URL PDF HTML ☆

赞 0 踩 0

2605.12640 2026-05-20 cs.CV

MambaPanoptic: A Vision Mamba-based Structured State Space Framework for Panoptic Segmentation

MambaPanoptic：基于视觉Mamba的结构状态空间框架用于全景分割

Qing Cheng, Damiano Bertolini, Wei Zhang, Dong Wang, Niclas Zeller, Daniel Cremers

发表机构 * Technical University of Munich（慕尼黑技术大学）； Munich Center for Machine Learning (MCML)（慕尼黑机器学习中心）； Polytechnic University of Milan（米兰理工大学）； University of Stuttgart（斯图加特大学）； Wuhan University（武汉大学）； Karlsruhe University of Applied Sciences（卡尔斯鲁厄应用科学大学）

AI总结本研究提出MambaPanoptic，一种基于视觉Mamba的结构状态空间框架，旨在解决全景分割中长程上下文建模、多尺度特征表示和高效密集预测的挑战，通过引入MambaFPN和改进的PanopticFCN风格核生成器实现统一的实例和物质预测。

Comments Accepted to ISPRS Congress 2026, camera-ready version

详情

AI中文摘要

全景分割要求同时识别可计数的实例和无形态的物质区域，对长程上下文建模、多尺度特征表示和高效密集预测提出了联合需求。现有的卷积和Transformer方法难以同时满足这三个要求：卷积架构在建模长程依赖方面能力有限，而基于Transformer的方法在高分辨率下会带来二次计算成本。在本文中，我们提出MambaPanoptic，一种完全基于Mamba的全景分割框架，通过两个主要贡献来解决这些限制。首先，我们引入MambaFPN，一种自上而下的特征金字塔，利用Mamba块生成具有线性计算复杂度的全局一致、多尺度特征表示。其次，我们采用PanopticFCN风格的核生成器，产生统一的实例和物质核用于无提案的全景预测，并通过在多个网络阶段应用QuadMamba基于的特征细化模块进行增强。在Cityscapes和COCO全景分割基准测试中，实验表明MambaPanoptic在同等模型大小下一致优于PanopticDeepLab和PanopticFCN，并在Cityscapes上以更少的参数匹配或超越Mask2Former在PQ和AP上的表现。

英文摘要

Panoptic segmentation requires the simultaneous recognition of countable thing instances and amorphous stuff regions, placing joint demands on long-range context modelling, multi-scale feature representation, and efficient dense prediction. Existing convolutional and transformer-based methods struggle to satisfy all three requirements concurrently: convolutional architectures are limited in their capacity to model long-range dependencies, while transformer-based methods incur quadratic computational cost that is prohibitive at high resolutions. In this paper, we propose MambaPanoptic, a fully Mamba-based panoptic segmentation framework that addresses these limitations through two principal contributions. First, we introduce MambaFPN, a top-down feature pyramid that leverages Mamba blocks to generate globally coherent, multi-scale feature representations with linear computational complexity. Second, we adopt a PanopticFCN-style kernel generator that produces unified thing and stuff kernels for proposal-free panoptic prediction, enhanced by a QuadMamba-based feature refinement module applied at multiple network stages. Experiments on the Cityscapes and COCO panoptic segmentation benchmarks demonstrate that MambaPanoptic consistently outperforms PanopticDeepLab and PanopticFCN under comparable model sizes, and matches or surpasses Mask2Former on Cityscapes in PQ and AP while requiring fewer parameters.

URL PDF HTML ☆

赞 0 踩 0

2605.12320 2026-05-20 cs.CV

Contrastive Learning under Noisy Temporal Self-Supervision for Colonoscopy Videos

在噪声时间自监督下利用对比学习进行结肠镜视频处理

Luca Parolari, Pietro Gori, Lamberto Ballan, Carlo Biffi, Loic Le Folgoc

发表机构 * Department of Mathematics, University of Padova, Padova, Italy（帕多瓦大学数学系）； LTCI, Telecom Paris, Institut Polytechnique de Paris, Palaiseau, France（巴黎电信学院）； Cosmo Intelligent Medical Devices, Dublin, Ireland（都柏林智能医疗设备公司）

AI总结本文提出一种在噪声时间自监督下利用对比学习进行结肠镜视频处理的方法，通过利用结肠镜检查的顺序流程来推导自监督关联，引入噪声感知的对比损失以处理噪声关联，从而在多项下游任务中取得了优于现有自监督和监督基线方法的性能。

Comments Accepted to MICCAI 2026

详情

AI中文摘要

学习鲁棒的息肉轨迹表示对于启用多项AI辅助结肠镜应用至关重要，从息肉特征化到自动化报告和检索。监督对比学习是学习此类表示的有效方法，但通常依赖于正确的正负定义。收集这些标签需要链接在整个视频中描绘相同基础息肉实体的轨迹，这成本高昂且需要专门的临床专业知识。在本工作中，我们利用结肠镜检查的顺序流程推导出自监督关联。由于时间推导的关联不保证正确，我们引入了噪声感知的对比损失以处理噪声关联。我们展示了所学表示在多项下游任务中的有效性，包括息肉检索和重识别、大小估计和组织学分类。我们的方法在多项任务中优于先前的自监督和监督基线方法，并且在所有任务中与最近的基座模型相匹配或超过，使用了一个仅在27个视频上训练的轻量级编码器。代码可在https://github.com/lparolari/ntssl上获得。

英文摘要

Learning robust representations of polyp tracklets is key to enabling multiple AI-assisted colonoscopy applications, from polyp characterization to automated reporting and retrieval. Supervised contrastive learning is an effective approach for learning such representations, but it typically relies on correct positive and negative definitions. Collecting these labels requires linking tracklets that depict the same underlying polyp entity throughout the video, which is costly and demands specialized clinical expertise. In this work, we leverage the sequential workflow of colonoscopy procedures to derive self-supervised associations from temporal structure. Since temporally derived associations are not guaranteed to be correct, we introduce a noise-aware contrastive loss to account for noisy associations. We demonstrate the effectiveness of the learned representations across multiple downstream tasks, including polyp retrieval and re-identification, size estimation, and histology classification. Our method outperforms prior self-supervised and supervised baselines, and matches or exceeds recent foundation models across all tasks, using a lightweight encoder trained on only 27 videos. Code is available at https://github.com/lparolari/ntssl.

URL PDF HTML ☆

赞 0 踩 0

2605.09329 2026-05-20 cs.CL cs.LG

Test-Time Speculation

测试时推测

Avinash Kumar, Sujay Sanghavi, Poulami Das

发表机构 * The University of Texas at Austin（德克萨斯大学奥斯汀分校）

AI总结本文研究了测试时推测方法，通过在线蒸馏技术提升长响应任务中推测器的接受长度，从而提高LLM推理效率。

详情

AI中文摘要

推测解码通过使用快速草稿模型生成token并用更准确的目标模型验证，从而加速LLM推理。其性能取决于接受长度，即目标模型接受的草稿token数量。我们的研究表明，即使是最先进的推测器，如DFlash、EAGLE-3和PARD，其接受长度也会随着生成长度的增加而下降，在仅几千个输出token后接近1（即无加速），这使推测器在长响应任务中变得无效。接受长度下降是因为大多数推测器在离线训练时仅在短序列上训练，但在推理时被迫匹配远长于训练分布的输出。为了解决这个问题，我们提出了测试时推测（TTS），一种在线蒸馏方法，可以在测试时连续调整推测器。TTS利用关键见解，即token验证步骤已经为每个草稿token调用了目标模型，从而提供所需的训练信号，以无额外成本地调整草稿。将草稿视为学生，目标模型视为教师，TTS在多个推测轮次中调整草稿，每次更新都提高草稿的准确性。我们的结果表明，在Qwen-3、Qwen-3.5和Llama3.1家族的多个模型上，TTS在最先进的推测器上将接受长度提高高达72%和41%，且随着生成长度的增加，收益呈比例增长。

英文摘要

Speculative decoding accelerates LLM inference by using a fast draft model to generate tokens and a more accurate target model to verify them. Its performance depends on the $\textit{acceptance length}$, or number of draft tokens accepted by the target. Our studies show that the acceptance length of even state-of-the-art speculators, like DFlash, EAGLE-3 and PARD degrade with generation length, reaching values close to 1 (i.e. no speedup) within just a few thousand output tokens, making speculators ineffective for long-response tasks. Acceptance lengths decline because most speculators are trained offline on short sequences, but are forced to match the target model on much longer outputs at inference, well beyond their training distribution. To address this issue, we propose $\textit{Test-Time Speculation (TTS)}$, an online distillation approach that continuously adapts the speculator at test-time. TTS leverages the key insight that the token verification step already invokes the target model for each draft token, providing the training signal needed to adapt the draft at no additional cost. Treating the draft as the student and the target as a teacher, TTS adjusts the draft over several speculation rounds, with each update improving the draft's accuracy as generation proceeds. Our results across multiple models from the Qwen-3, Qwen-3.5, and Llama3.1 families show that TTS improves acceptance lengths over state-of-the-art speculators by up to $72\%$ and $41\%$ on average, with the benefits scaling with increased generation lengths.

URL PDF HTML ☆

赞 0 踩 0

2605.04970 2026-05-20 cs.LG cs.AI

Skill Neologisms: Towards Skill-based Continual Learning

技能新词：迈向基于技能的持续学习

Antonin Berthon, Nicolas Astorga, Mihaela van der Schaar

发表机构 * University of Cambridge（剑桥大学）

AI总结本文提出了一种基于技能的新词（skill neologisms）方法，通过在模型词汇中集成软token，以提高模型在特定技能上的能力，同时支持零样本组合其他技能，从而实现可扩展的基于技能的持续学习。

详情

AI中文摘要

现代大语言模型（LLMs）在不断扩大的技能范围内表现出色，并能灵活组合这些技能。然而，以可扩展的方式将模型能力扩展到新技能仍然是一个开放性问题：微调和参数高效变体有灾难性遗忘的风险，而基于上下文的方法表达能力有限且受模型有效上下文的限制。我们探索了技能新词——整合在模型词汇中的软token，并优化以提高特定技能的能力——作为一种方法，以在不更新权重的情况下选择性地获取新技能。我们首先观察到预训练LLMs已经表现出与程序知识相关的token。然后在受控的合成任务上展示，技能新词可以学习以提高模型在特定技能上的能力，同时能够与分布外技能组合，且独立训练的技能新词可以零样本组合。最后，我们验证了在更现实的自然语言设置中，即Skill-Mix基准测试中，独立学习的技能新词的零样本组合。这些结果表明，技能新词可能为基于技能的持续学习提供可扩展的路径。

英文摘要

Modern LLMs show mastery over an ever-growing range of skills, as well as the ability to compose them flexibly. However, extending model capabilities to new skills in a scalable manner is an open problem: fine-tuning and parameter-efficient variants risk catastrophic forgetting, while context-based approaches have limited expressiveness and are constrained by the model's effective context. We explore skill neologisms--soft tokens integrated in the model's vocabulary and optimized to improve capabilities over a specific skill--as a way to selectively acquire new skills without weight updates. We first observe that pre-trained LLMs already exhibit tokens associated with procedural knowledge. We then show on a controlled synthetic task that skill neologisms can be learned to improve model capabilities on specific skills while being composable with out-of-distribution skills, and that independently trained skill neologisms can be composed zero-shot. Finally, we validate zero-shot composition of independently learned skill neologisms on the more realistic natural language setting of the Skill-Mix benchmark. These results suggest that skill neologisms may provide a scalable path towards skill-based continual learning.

URL PDF HTML ☆

赞 0 踩 0

2605.01361 2026-05-20 cs.LG

Decision-Focused Learning via Tangent-Space Projection of Prediction Error

通过预测误差的切线空间投影进行决策聚焦学习

Junhyeong Lee, Sangjin Jin, Yongjae Lee

发表机构 * Department of Industrial Engineering, Ulsan National Institute of Science and Technology（乌山国立科学与技术研究院工业工程系）

AI总结本文提出了一种基于预测误差切线空间投影的决策聚焦学习方法，通过几何特征简化了后悔梯度的计算，提升了下游决策质量并提高了计算效率。

Comments 21 pages, 4 figures, 11 tables

详情

AI中文摘要

决策聚焦学习（DFL）训练预测器以提高下游决策质量，但计算后悔梯度通常需要对求解器进行微分或依赖于替代损失函数，这可能计算成本高或偏离真实目标。我们证明，在标准正则性条件下，本地稳定的活动约束下，后悔梯度具有闭式几何特征，等价于预测误差投影到活动约束的切线空间，乘以局部曲率。这表明，可以通过过滤决策无关成分来获得后悔梯度，提供了一种更简单直接的替代方法。基于此，我们提出PEAR（投影误差作为后悔梯度），通过在活动约束上减少的线性系统计算后悔梯度，避免对求解器迭代或额外优化求解进行微分。在LP基准和一个现实QP任务上的实验表明，PEAR在所有基线中实现了最佳的决策质量，同时是最具计算效率的，其优势在约束变化下依然保持。

英文摘要

Decision-Focused Learning (DFL) trains predictors to improve downstream decision quality, but computing regret gradients typically requires differentiating through solvers or relying on surrogate losses, which can be computationally expensive or deviate from the true objective. We show that, under standard regularity with locally stable active constraints, the regret gradient admits a closed-form geometric characterization, equivalent to the prediction error projected onto the tangent space of active constraints, scaled by local curvature. This reveals that regret gradients can be obtained by filtering decision-irrelevant components from the MSE gradient, providing a simpler and more direct alternative to existing approaches. Based on this, we propose PEAR (Projected Error As Regret-gradient), which computes regret gradients via a reduced linear system over active constraints, avoiding differentiation through solver iterations or additional optimization solves. Experiments on LP benchmarks and a real-world QP task show that PEAR achieves the best decision quality among all baselines while being the most computationally efficient, with gains that persist under constraint shifts.

URL PDF HTML ☆

赞 0 踩 0

2604.15166 2026-05-20 cs.CV cs.AI cs.LG

Class Unlearning via Depth-Aware Removal of Forget-Specific Directions

通过深度感知移除遗忘特定方向实现类别反学习

Arman Hatami, Romina Aalishah, Ilya E. Monosov

发表机构 * Johns Hopkins University（约翰霍普金斯大学）

AI总结本文提出DAMP方法，通过深度感知移除遗忘特定方向，改进类别反学习的选性遗忘，同时更好地保留保留类性能并减少深层残留遗忘结构。

Comments Accepted for oral presentation at the CVPR 2026 Workshop on Machine Unlearning for Vision (MUV). Code: https://github.com/armanhtm/DAMP

详情

AI中文摘要

机器反学习旨在在不重新训练模型的情况下移除目标知识。然而，在类别反学习中，降低遗忘类的准确性并不一定意味着真正的遗忘：遗忘的信息可能仍编码在内部表示中，而显着的遗忘可能源于分类器头部抑制而非表示移除。我们显示现有类别反学习方法往往表现出弱或负的选择性，保留遗忘类结构在深度表示中，或严重依赖最终层偏移。我们随后引入DAMP（通过投影的深度感知调节），一种单次、闭合形式的权重手术方法，可以在不使用梯度优化的情况下从预训练网络中移除遗忘特定方向。在每个阶段，DAMP在下一个可学习操作的输入空间中计算类别原型，提取遗忘方向作为相对于保留类原型的残差，并应用基于投影的更新以减少下游对这些方向的敏感性。为了保持实用性，DAMP使用从探测分离性导出的参数无关深度感知缩放规则，应用较小的编辑在早期层和较大的编辑在深层。该方法自然扩展到多类遗忘通过低秩子空间移除。在MNIST、CIFAR-10、CIFAR-100和Tiny ImageNet以及卷积和变换器架构上，DAMP比一些先前方法更接近再训练的黄金标准，改进了选择性遗忘的同时更好地保留保留类性能并减少深层残留遗忘结构。

英文摘要

Machine unlearning aims to remove targeted knowledge from a trained model without the cost of retraining from scratch. In class unlearning, however, reducing accuracy on forget classes does not necessarily imply true forgetting: forgotten information can remain encoded in internal representations, and apparent forgetting may arise from classifier-head suppression rather than representational removal. We show that existing class-unlearning methods often exhibit weak or negative selectivity, preserve forget-class structure in deep representations, or rely heavily on final-layer bias shifts. We then introduce DAMP (Depth-Aware Modulation by Projection), a one-shot, closed-form weight-surgery method that removes forget-specific directions from a pretrained network without gradient-based optimization. At each stage, DAMP computes class prototypes in the input space of the next learnable operator, extracts forget directions as residuals relative to retain-class prototypes, and applies a projection-based update to reduce downstream sensitivity to those directions. To preserve utility, DAMP uses a parameter-free depth-aware scaling rule derived from probe separability, applying smaller edits in early layers and larger edits in deeper layers. The method naturally extends to multi-class forgetting through low-rank subspace removal. Across MNIST, CIFAR-10, CIFAR-100, and Tiny ImageNet, and across convolutional and transformer architectures, DAMP more closely resembles the retraining gold standard than some of the prior methods, improving selective forgetting while better preserving retain-class performance and reducing residual forget-class structure in deep layers.

URL PDF HTML ☆

赞 0 踩 0

2604.07303 2026-05-20 cs.RO

Robots that learn to evaluate models of collective behavior

能够评估集体行为模型的机器人

Mathis Hocke, Andreas Gerken, David Bierbach, Jens Krause, Tim Landgraf

发表机构 * Department of computer science, Freie Universität Berlin（自由大学柏林计算机科学系）； SCIoI Excellence Cluster, Technische Universität Berlin（柏林技术大学SCIoI卓越中心）； Faculty of Life Sciences, Humboldt-Universität zu Berlin（柏林洪堡大学生命科学学院）； Department of Fish Biology, Fisheries, and Aquaculture, Leibniz Institute of Freshwater Ecology and Inland Fisheries（莱比锡淡水生态与内陆渔业研究所鱼类生物学、渔业与水产养殖系）

AI总结本文提出了一种基于强化学习的框架，利用仿生机器人鱼评估活鱼行为的计算模型，通过闭环交互量化真实鱼与模拟鱼行为的差异，展示了学习驱动的机器人实验如何发现行为模型的不足。

详情

AI中文摘要

理解并建模动物行为对于研究集体运动、决策和生物启发机器人至关重要。然而，评估行为模型的准确性仍然常常依赖于离线比较静态轨迹统计。在这里，我们介绍了一种基于强化学习的框架，利用仿生机器人鱼（RoboFish）通过闭环交互评估计算模型中的活鱼行为。我们使用四个不同的鱼模型（一个简单的恒定跟随基准、两个基于规则的模型和一个生物基础的卷积神经网络模型）在仿真中训练策略，并将这些策略转移到真实的RoboFish系统中，与活鱼互动。策略被训练引导模拟鱼前往目标位置，使我们能够量化真实鱼对目标位置的响应与模拟鱼响应的差异。通过量化模拟到现实的差距（定义为模拟和现实行为指标分布的Wasserstein距离，如目标到达性能、个体间距离、墙互动和对齐），我们评估鱼模型。基于神经网络的鱼模型在目标到达性能和其他大多数指标上表现出最小的差距，表明其在该基准下的行为保真度高于传统基于规则的模型。更重要的是，这种分离表明，所提出的评估方法能够在匹配的闭环条件下定量区分候选模型。我们的工作展示了学习驱动的机器人实验如何揭示行为模型的不足，并提供了一种通过具身交互评估动物行为模型的一般框架。

英文摘要

Understanding and modeling animal behavior is essential for studying collective motion, decision-making, and bio-inspired robotics. Yet, evaluating the accuracy of behavioral models still often relies on offline comparisons to static trajectory statistics. Here we introduce a reinforcement-learning-based framework that uses a biomimetic robotic fish (RoboFish) to evaluate computational models of live fish behavior through closed-loop interaction. We trained policies in simulation using four distinct fish models-a simple constant-follow baseline, two rule-based models, and a biologically grounded convolutional neural network model-and transferred these policies to the real RoboFish setup, where they interacted with live fish. Policies were trained to guide a simulated fish to goal locations, enabling us to quantify how the response of real fish differs from the simulated fish's response. We evaluate the fish models by quantifying the sim-to-real gaps, defined as the Wasserstein distance between simulated and real distributions of behavioral metrics such as goal-reaching performance, inter-individual distances, wall interactions, and alignment. The neural network-based fish model exhibited the smallest gap across goal-reaching performance and most other metrics, indicating higher behavioral fidelity than conventional rule-based models under this benchmark. More importantly, this separation shows that the proposed evaluation can quantitatively distinguish candidate models under matched closed-loop conditions. Our work demonstrates how learning-based robotic experiments can uncover deficiencies in behavioral models and provides a general framework for evaluating animal behavior models through embodied interaction.

URL PDF HTML ☆

赞 0 踩 0

2604.05002 2026-05-20 cs.LG cs.AI

Learning Stable Predictors from Weak Supervision under Distribution Shift

在分布偏移下从弱监督中学习稳定的预测器

Mehrdad Shoeibi, Elias Hossain, Ivan Garibay, Niloofar Yousefi

发表机构 * University of Central Florida（中央佛罗里达大学）

AI总结本文研究了在分布偏移下从弱监督中学习稳定预测器的问题，通过CRISPR-Cas13d转录组扰动实验，探讨了监督漂移现象，并展示了弱监督在域内学习和部分跨细胞系迁移中的有效性，同时揭示了时间迁移中的失败源于监督漂移而非模型容量或简单协变量偏移。

详情

AI中文摘要

在真实标签不可用时，从弱、代理或相对监督中学习是常见的，但分布偏移下的鲁棒性仍缺乏理解，因为监督机制本身可能在不同环境中变化。我们正式将这种现象定义为监督漂移，即$P(y \mid x, c)$在不同上下文中变化，并在CRISPR-Cas13d转录组扰动实验中研究了它，其中指导效果是通过RNA-seq响应间接推断的。使用涵盖两种人类细胞系和多个诱导后时间点的公开数据，我们构建了一个受控的非独立同分布基准，具有明确的领域（细胞系）和时间偏移，同时在所有上下文中重用固定的弱标签构造以避免改变目标。在线性和树基模型中，弱监督支持域内有意义的学习（岭$R^2 = 0.356$，斯皮尔曼$ρ= 0.442$）和部分跨细胞系迁移（$ρ\approx 0.40$）。相比之下，时间迁移在所有考虑的模型类别中崩溃，产生负$R^2$和弱或接近零的$ρ$（岭$R^2 = -0.145$，$ρ= 0.008$；XGBoost $R^2 = -0.155$，$ρ= 0.056$；随机森林 $R^2 = -0.322$，$ρ= 0.139$）。使用外部重新计算的弱标签、偏移分数量化和简单的缓解基线进行额外的鲁棒性分析，保持了相同定性的模式。特征-标签关联和特征重要性分析在不同细胞系中相对稳定，但在时间上变化剧烈，表明失败源于监督漂移而非模型容量或简单协变量偏移。这些结果表明，在弱监督下强域内性能可能是误导性的，并促使将特征稳定性作为轻量级诊断，用于部署前检测非可迁移性。

英文摘要

Learning from weak, proxy, or relative supervision is common when ground-truth labels are unavailable, but robustness under distribution shift remains poorly understood because the supervision mechanism itself may change across environments. We formalize this phenomenon as supervision drift, defined as changes in $P(y \mid x, c)$ across contexts, and study it in CRISPR-Cas13d transcriptomic perturbation experiments where guide efficacy is inferred indirectly from RNA-seq responses. Using publicly available data spanning two human cell lines and multiple post-induction timepoints, we construct a controlled non-IID benchmark with explicit domain (cell line) and temporal shifts, while reusing a fixed weak-label construction across all contexts to avoid changing targets. Across linear and tree-based models, weak supervision supports meaningful learning in-domain (ridge $R^2 = 0.356$, Spearman $ρ= 0.442$) and partial cross-cell-line transfer ($ρ\approx 0.40$). In contrast, temporal transfer collapses across all model classes considered, yielding negative $R^2$ and weak or near-zero $ρ$ (ridge $R^2 = -0.145$, $ρ= 0.008$; XGBoost $R^2 = -0.155$, $ρ= 0.056$; random forest $R^2 = -0.322$, $ρ= 0.139$). Additional robustness analyses using externally recomputed weak labels, shift-score quantification, and simple mitigation baselines preserve the same qualitative pattern. Feature-label association and feature-importance analyses remain relatively stable across cell lines but change sharply over time, indicating that failures arise from supervision drift rather than model capacity or simple covariate shift. These results show that strong in-domain performance under weak supervision can be misleading and motivate feature stability as a lightweight diagnostic for non-transferability before deployment.

URL PDF HTML ☆

赞 0 踩 0

2603.25722 2026-05-20 cs.CV cs.LG

No Hard Negatives Required: Concept Centric Learning Leads to Compositionality without Degrading Zero-shot Capabilities of Contrastive Models

无需硬负样本：基于概念的学习在不降低对比模型零样本能力的情况下实现组合性

Hai X. Pham, David T. Hoffmann, Ricardo Guerrero, Brais Martinez

发表机构 * Samsung AI Center（三星人工智能中心）

AI总结本文提出了一种基于概念的学习方法，无需使用硬负样本即可在不损害对比模型零样本和检索能力的情况下实现组合性，通过简单的方法改进了文本和图像编码器的全局池化问题。

Comments Accepted at CVPR 2026. 2nd rev: update github repo URL

详情

AI中文摘要

对比视觉-语言（V&L）模型仍然是各种应用中的流行选择。然而，出现了几个限制，尤其是V&L模型学习组合性表示的能力有限。先前的方法通常通过生成定制训练数据来获得硬负样本。硬负样本已被证明可以提高组合性任务的性能，但通常只适用于单一基准，无法推广，并且可能导致基本V&L能力如零样本或检索性能的显著下降，使其不切实际。在本工作中，我们采取了不同的方法。我们识别出两个限制V&L组合性性能的根本原因：1）长训练标题不需要组合性表示；2）文本和图像编码器中的最终全局池化导致完全失去学习绑定所需的必要信息。为了解决这一问题，我们提出了两种简单的解决方案：1）使用标准NLP软件获得短的概念导向标题部分，并将其对齐到图像；2）引入无参数的跨模态注意力池化，从图像编码器中获得概念导向的视觉嵌入。通过这些更改和简单的辅助对比损失，我们获得了标准组合性基准的SOTA性能，同时保持或提高了强大的零样本和检索能力。这在不增加推理成本的情况下实现。我们在此工作的代码已发布在https://github.com/saic-fi/concept_centric_clip。

英文摘要

Contrastive vision-language (V&L) models remain a popular choice for various applications. However, several limitations have emerged, most notably the limited ability of V&L models to learn compositional representations. Prior methods often addressed this limitation by generating custom training data to obtain hard negative samples. Hard negatives have been shown to improve performance on compositionality tasks, but are often specific to a single benchmark, do not generalize, and can cause substantial degradation of basic V&L capabilities such as zero-shot or retrieval performance, rendering them impractical. In this work we follow a different approach. We identify two root causes that limit compositionality performance of V&Ls: 1) Long training captions do not require a compositional representation; and 2) The final global pooling in the text and image encoders lead to a complete loss of the necessary information to learn binding in the first place. As a remedy, we propose two simple solutions: 1) We obtain short concept centric caption parts using standard NLP software and align those with the image; and 2) We introduce a parameter-free cross-modal attention-pooling to obtain concept centric visual embeddings from the image encoder. With these two changes and simple auxiliary contrastive losses, we obtain SOTA performance on standard compositionality benchmarks, while maintaining or improving strong zero-shot and retrieval capabilities. This is achieved without increasing inference cost. We release the code for this work at https://github.com/saic-fi/concept_centric_clip.

URL PDF HTML ☆

赞 0 踩 0

2603.25476 2026-05-20 cs.LG

How Class Ontology and Data Scale Affect Audio Transfer Learning

音频迁移学习中类本体和数据规模的影响

Manuel Milling, Andreas Triantafyllopoulos, Alexander Gebhard, Simon Rampp, Björn W. Schuller

发表机构 * CHI – Chair of Health Informatics（健康信息学系）； Technical University of Munich（慕尼黑技术大学）； MCML – Munich Center for Machine Learning（慕尼黑机器学习中心）； Munich Center for Machine Learning（慕尼黑机器学习中心）； Munich Data Science Institute（慕尼黑数据科学研究所）； Group on Language, Audio, & Music（语言、音频与音乐小组）； Imperial College（帝国学院）

AI总结本文研究了在音频到音频迁移学习中，类本体和数据规模如何影响迁移学习的效果，发现增加样本和类别的数量对迁移学习有积极影响，但相似性在下游任务中起主导作用。

详情

AI中文摘要

迁移学习是深度学习中的关键概念，允许人工神经网络在数据有限的任务中受益于大量预训练数据的基础。尽管其广泛应用和明显优势，但关于迁移学习内部机制以及何时和如何有效工作的理解仍然存在许多开放问题。为此，我们进行了严格的研究，专注于音频到音频的迁移学习，在此过程中，我们在AudioSet的（基于本体的）子集上预训练各种模型状态，并在三个计算机听觉任务上进行微调：声学场景识别、鸟类活动识别和语音命令识别。我们报告说，增加预训练数据中的样本和类别的数量对迁移学习都有积极影响。然而，这通常被预训练与下游任务之间的相似性所超越，这种相似性可以导致模型学习到相似的特征。

英文摘要

Transfer learning is a crucial concept within deep learning that allows artificial neural networks to benefit from a large pre-training data basis when confronted with a task of limited data. Despite its ubiquitous use and clear benefits, there are still many open questions regarding the inner workings of transfer learning and, in particular, regarding the understanding of when and how well it works. To that extent, we perform a rigorous study focusing on audio-to-audio transfer learning, in which we pre-train various model states on (ontology-based) subsets of AudioSet and fine-tune them on three computer audition tasks, namely acoustic scene recognition, bird activity recognition, and speech command recognition. We report that increasing the number of samples and classes in the pre-training data both have a positive impact on transfer learning. This is, however, generally surpassed by similarity between pre-training and the downstream task, which can lead the model to learn comparable features.

URL PDF HTML ☆

赞 0 踩 0

2603.22161 2026-05-20 cs.LG

Causal Evidence that Language Models use Confidence to Drive Behavior

语言模型使用置信度驱动行为的因果证据

Dharshan Kumaran, Nathaniel Daw, Simon Osindero, Petar Veličković, Viorica Patraucean

发表机构 * Google DeepMind（谷歌深Mind）； Princeton University（普林斯顿大学）

AI总结研究探讨了语言模型是否利用置信度信号来控制行为，如决定回答或 abstain，通过四个阶段实验发现模型使用多维内部置信表示和阈值策略来实现 abstention，揭示了结构化的元认知控制机制。

详情

AI中文摘要

元认知——评估自身认知表现的质量——指导跨物种的适应性行为。大量研究表明可以从语言模型输出中提取置信度信号，但一个根本问题仍然存在：模型是否真的利用这些信号来控制行为，例如决定是否回答或 abstain？为调查这一问题，我们开发了一个四阶段范式。第一阶段获取了无 abstention 选项的基线置信度估计。第二阶段揭示了 LLMs 在决定 abstain 时应用隐含阈值，置信度效应大小大约比其他机制大一个数量级。第三阶段通过激活引导提供了直接的因果证据：提升或抑制置信度信号会相应地降低或增加 abstention 率。第四阶段通过系统地变化指示阈值，证明 LLMs 主动部署置信度信号以实施 abstention 策略。关键的是，除了基于输出分布的校准对数概率置信度外，口头置信度在所有模型中独立预测 abstention，尽管其客观上对答案正确性的区分能力较弱。最后预答标记的激活解码进一步显示，这两种可观察的指标都是更丰富的内部表示的损失性读取。总体而言，这些结果表明，abstention 不仅仅是输出分布中证据强度的简单体现，而是更好地由多维内部置信表示和基于阈值的策略的联合操作所解释——与 LLMs 中的结构化元认知控制机制一致，这一能力在模型向自主代理过渡时变得越来越重要，因为这些代理必须识别自身的不确定性。

英文摘要

Metacognition -- assessing the quality of one's own cognitive performance -- guides adaptive behavior across species. Substantial research demonstrates that confidence signals can be extracted from language model outputs, yet a fundamental question remains: do models actually use these signals to control behavior, such as deciding whether to answer or abstain? To investigate, we developed a four-phase paradigm. Phase~1 elicited baseline confidence estimates without an abstention option. Phase~2 revealed that LLMs apply an implicit threshold to internal confidence when deciding to abstain, with confidence effect sizes approximately an order of magnitude larger than alternative mechanisms. Phase~3 provided direct causal evidence through activation steering: boosting or suppressing confidence signals correspondingly decreased or increased abstention rates. Phase~4 extended this by systematically varying instructed thresholds, demonstrating that LLMs actively deploy confidence signals to implement abstention policies. Critically, beyond calibrated log-probability based confidence derived from the output distribution, verbal confidence independently predicted abstention across all models, despite being objectively less discriminatory of answer correctness. Activation decoding at the last pre-answer token further showed that both observable measures are lossy readouts of a richer internal representation. Together, these results suggest that abstention is not fully captured by the strength of evidence in the output distribution alone, but is better explained by the joint operation of a multidimensional internal confidence representation and threshold-based policies -- consistent with structured metacognitive control in LLMs, a capacity of growing importance as models transition to autonomous agents that must recognize their own uncertainty.

URL PDF HTML ☆

赞 0 踩 0

2603.18396 2026-05-20 cs.LG cs.RO

RE-SAC: Disentangling aleatoric and epistemic risks in bus fleet control: A stable and robust ensemble DRL approach

RE-SAC：在公交车队控制中解耦偶然风险和本质风险：一种稳定且稳健的集成深度强化学习方法

Yifan Zhang, Liang Zheng

发表机构 * Central South University（中南大学）

AI总结该研究提出RE-SAC方法，通过解耦偶然风险和本质风险来提升公交车队控制的稳定性与鲁棒性，采用积分概率度量（IPM）基于的权重正则化和多样化Q-集成来应对不同类型的不确定性。

详情

AI中文摘要

公交保持控制因随机交通和乘客需求而具有挑战性。尽管深度强化学习（DRL）展现出潜力，但标准的actor-critic算法在波动环境中面临Q值不稳定的问题。这种不稳定性的一个关键来源是将两种不同的不确定性混淆：偶然不确定性（不可减少的噪声）和本质不确定性（数据不足）。将它们视为单一风险会导致在嘈杂状态下的价值低估，从而导致灾难性策略崩溃。我们提出了一种稳健的集成软actor-critic（RE-SAC）框架，以明确解耦这些不确定性。RE-SAC将积分概率度量（IPM）基于的权重正则化应用于批评者网络，以对抗偶然风险，为鲁棒Bellman算子提供平滑的分析下界，而无需昂贵的内循环扰动。为了应对本质风险，一个多样化Q-集成对稀疏覆盖区域中的过度自信价值估计进行惩罚。这种双重机制防止了集成方差将噪声误认为数据缺口，这种失败模式在我们的消融研究中被识别。在现实的双向公交走廊模拟实验中，RE-SAC在累计奖励（约-0.4e6）方面优于标准SAC（-0.55e6）。Mahalanobis稀有性分析证实，RE-SAC在罕见的分布外状态中将Oracle Q值估计误差减少了高达62%（MAE为1647 vs. 4343），展示了在高交通变异性下的优越鲁棒性。

英文摘要

Bus holding control is challenging due to stochastic traffic and passenger demand. While deep reinforcement learning (DRL) shows promise, standard actor-critic algorithms suffer from Q-value instability in volatile environments. A key source of this instability is the conflation of two distinct uncertainties: aleatoric uncertainty (irreducible noise) and epistemic uncertainty (data insufficiency). Treating these as a single risk leads to value underestimation in noisy states, causing catastrophic policy collapse. We propose a robust ensemble soft actor-critic (RE-SAC) framework to explicitly disentangle these uncertainties. RE-SAC applies Integral Probability Metric (IPM)-based weight regularization to the critic network to hedge against aleatoric risk, providing a smooth analytical lower bound for the robust Bellman operator without expensive inner-loop perturbations. To address epistemic risk, a diversified Q-ensemble penalizes overconfident value estimates in sparsely covered regions. This dual mechanism prevents the ensemble variance from misidentifying noise as a data gap, a failure mode identified in our ablation study. Experiments in a realistic bidirectional bus corridor simulation demonstrate that RE-SAC achieves the highest cumulative reward (approx. -0.4e6) compared to vanilla SAC (-0.55e6). Mahalanobis rareness analysis confirms that RE-SAC reduces Oracle Q-value estimation error by up to 62% in rare out-of-distribution states (MAE of 1647 vs. 4343), demonstrating superior robustness under high traffic variability.

URL PDF HTML ☆

赞 0 踩 0

2603.17305 2026-05-20 cs.AI cs.CL cs.LG

Contrastive Reasoning Alignment: Reinforcement Learning from Hidden Representations

对比推理对齐：从隐藏表示中学习强化学习

Haozheng Luo, Yimin Wang, Jiahao Yu, Binghui Wang, Yan Chen

发表机构 * Northwestern University（西北大学）； University of Michigan（密歇根大学）； Illinois Institute of Technology（伊利诺伊理工学院）

AI总结本文提出了一种基于对比学习和强化学习的框架CRAFT，通过优化隐藏状态空间中的目标来提升对抗攻击的鲁棒性，核心贡献是通过隐藏空间的几何结构实现推理层面的安全对齐。

Comments International Conference on Machine Learning (ICML) 2026

详情

AI中文摘要

我们提出CRAFT，一种红队对齐框架，利用模型推理能力和隐藏表示来提高对jailbreak攻击的鲁棒性。与以往主要在输出层面操作的防御方法不同，CRAFT将大型推理模型对齐以生成安全意识的推理轨迹，通过显式优化定义在隐藏状态空间上的目标。方法上，CRAFT将对比表示学习与强化学习相结合，分离安全和不安全的推理轨迹，得到支持鲁棒、推理层面安全对齐的潜在空间几何。理论上，我们证明将潜在文本一致性纳入GRPO可以消除表面上对齐的策略，将其排除在局部最优之外。实验上，我们在多个安全基准上评估CRAFT，使用两个强大的推理模型Qwen3-4B-Thinking和R1-Distill-Llama-8B，其中它在多个安全基准上均优于IPO和SafeKey等最先进的防御方法。值得注意的是，CRAFT在基础模型上实现了平均79.0%的推理安全性和87.7%的最终响应安全性提升，证明了隐藏空间推理对齐的有效性。

英文摘要

We propose CRAFT, a red-teaming alignment framework that leverages model reasoning capabilities and hidden representations to improve robustness against jailbreak attacks. Unlike prior defenses that operate primarily at the output level, CRAFT aligns large reasoning models to generate safety-aware reasoning traces by explicitly optimizing objectives defined over the hidden state space. Methodologically, CRAFT integrates contrastive representation learning with reinforcement learning to separate safe and unsafe reasoning trajectories, yielding a latent-space geometry that supports robust, reasoning-level safety alignment. Theoretically, we show that incorporating latent-textual consistency into GRPO eliminates superficially aligned policies by ruling them out as local optima. Empirically, we evaluate CRAFT on multiple safety benchmarks using two strong reasoning models, Qwen3-4B-Thinking and R1-Distill-Llama-8B, where it consistently outperforms state-of-the-art defenses such as IPO and SafeKey. Notably, CRAFT delivers an average 79.0% improvement in reasoning safety and 87.7% improvement in final-response safety over the base models, demonstrating the effectiveness of hidden-space reasoning alignment.

URL PDF HTML ☆

赞 0 踩 0

2603.11768 2026-05-20 cs.AI

Governing Evolving Memory in LLM Agents: Risks, Mechanisms, and the Stability and Safety Governed Memory (SSGM) Framework

在LLM代理中治理演化的记忆：风险、机制以及稳定性与安全性的治理记忆（SSGM）框架

Chingkwun Lam, Jiaxin Li, Lingfei Zhang, Kuo Zhao

发表机构 * College of Intelligent Science and Engineering（智能科学与工程学院）

AI总结本文研究了LLM代理中记忆治理的问题，提出了一种新的SSGM框架，通过一致性验证、时间衰减建模和动态访问控制来缓解记忆腐蚀风险，提高记忆系统的稳定性与安全性。

详情

AI中文摘要

长期记忆已成为自主大型语言模型（LLM）代理的基础组件，使连续适应、终身多模态学习和复杂推理成为可能。然而，随着记忆系统从静态检索数据库转变为动态、代理机制，关于记忆治理、语义漂移和隐私漏洞的关键问题变得突出。尽管最近的调查主要集中在记忆检索效率上，但它们大多忽略了在高度动态环境中记忆腐蚀的新兴风险。为了解决这些新兴挑战，我们提出了稳定性与安全性治理记忆（SSGM）框架，一种概念治理架构。SSGM通过在任何记忆巩固之前执行一致性验证、时间衰减建模和动态访问控制，将记忆演进与执行分离。通过形式分析和架构分解，我们展示了SSGM如何缓解拓扑诱导的知识泄漏，其中敏感上下文被固化到长期存储中，并帮助防止语义漂移，其中知识通过迭代总结退化。最终，这项工作提供了一个全面的记忆腐蚀风险分类法，并建立了部署安全、持久和可靠的代理记忆系统稳健治理范式。

英文摘要

Long-term memory has emerged as a foundational component of autonomous Large Language Model (LLM) agents, enabling continuous adaptation, lifelong multimodal learning, and sophisticated reasoning. However, as memory systems transition from static retrieval databases to dynamic, agentic mechanisms, critical concerns regarding memory governance, semantic drift, and privacy vulnerabilities have surfaced. While recent surveys have focused extensively on memory retrieval efficiency, they largely overlook the emergent risks of memory corruption in highly dynamic environments. To address these emerging challenges, we propose the Stability and Safety-Governed Memory (SSGM) framework, a conceptual governance architecture. SSGM decouples memory evolution from execution by enforcing consistency verification, temporal decay modeling, and dynamic access control prior to any memory consolidation. Through formal analysis and architectural decomposition, we show how SSGM can mitigate topology-induced knowledge leakage where sensitive contexts are solidified into long-term storage, and help prevent semantic drift where knowledge degrades through iterative summarization. Ultimately, this work provides a comprehensive taxonomy of memory corruption risks and establishes a robust governance paradigm for deploying safe, persistent, and reliable agentic memory systems.

URL PDF HTML ☆

赞 0 踩 0

2603.05933 2026-05-20 cs.CL cs.LG

Structured Style-Rewrite with Chain-of-Thought Planning for Low-Resource Character Dialogue

结构化风格重写与思维链规划用于低资源字符对话

Chanhui Zhu

发表机构 * Guangdong University of Finance（广东金融学院）

AI总结本文提出了一种结构化风格重写框架，结合思维链规划，以解决低资源条件下中文字符驱动生成中的风格分离问题，通过分解角色风格为可解释的格式签名、语法和语用维度，并利用思维链监督进行显式风格规划，实验表明该方法在保持语义忠实性的同时提升了风格质量。

Comments 30 pages, 5 figures. Preprint

详情

AI中文摘要

将小型语言模型（SLMs）应用于中文字符驱动生成仍然具有挑战性，因为数据稀缺和分离角色风格困难。标准监督微调（SFT）通常捕捉到表层语义，但会产生频繁的越界输出（OOC）。我们将此问题框架为受控的句子级风格重写任务，该任务将风格质量与对话情境管理分离。我们提出了一种结构化风格重写框架，将角色风格分解为可解释的格式签名、语法和语用维度，并结合思维链（CoT）监督进行显式风格规划。一个CoT共享直接偏好优化（DPO）阶段进一步通过确保偏好学习目标输出层面的风格执行而非推理轨迹差异来对齐风格规划与表层实现。在八个角色四个不同源领域的实验中，我们的方法使Qwen3-1.7B模型在有效风格得分上达到0.632，同时保持强语义忠实性（0.878），在评估系统中处于帕累托前沿，并在消费级硬件上显著优于更大的基线（如GLM-4.7）

英文摘要

Applying Small Language Models (SLMs) to Chinese character-driven generation remains challenging due to data scarcity and the difficulty of disentangling character style. Standard Supervised Fine-Tuning (SFT) often captures surface-level semantics but produces frequent Out-Of-Character (OOC) outputs. We frame this as a controlled sentence-level style rewriting task, which isolates stylistic quality from dialogue context management. We propose a Structured Style-Rewrite Framework that decomposes character style into interpretable format signature, syntactic, and pragmatic dimensions, combined with Chain-of-Thought (CoT) supervision for explicit style planning. A CoT-Shared Direct Preference Optimization (DPO) stage further aligns style planning with surface realization by ensuring preference learning targets output-level style execution rather than reasoning trace differences. Experiments across eight characters from four diverse source domains demonstrate that our method enables a Qwen3-1.7B model to achieve a Valid Style Score of $0.632$ while maintaining strong semantic fidelity (0.878), placing on the Pareto frontier among the evaluated systems and outperforming significantly larger baselines (e.g., GLM-4.7) on consumer hardware.

URL PDF HTML ☆

赞 0 踩 0

2603.05910 2026-05-20 cs.AI

The World Won't Stay Still: Programmable Evolution for Agent Benchmarks

世界不会静止：为智能体基准测试的可编程进化

Guangrui Li, Yaochen Xie, Yi Liu, Ziwei Dong, Xingyuan Pan, Tianqi Zheng, Jason Choi, Michael J. Morais, Binit Jha, Shaunak Mishra, Bingrou Zhou, Chen Luo, Monica Xiao Cheng, Dawn Song

发表机构 * Amazon（亚马逊公司）； UC Berkeley（伯克利大学）

AI总结本文研究了结构化环境进化作为智能体基准测试构建问题，提出了一种基于图的框架ProEvolve，使环境进化可编程，并在电商和航班预订领域验证了其在质量、实现有效性及失败模式方面的表现。

详情

AI中文摘要

LLM驱动的工具调用智能体通过与环境交互、查询数据和调用工具进行多轮过程来满足用户请求。然而，大多数现有基准测试在静态环境接口下评估这些系统，具有固定架构和工具集，难以评估智能体在环境演变时的行为——当能力被添加、重新组织或废弃时。在本文中，我们研究了结构化环境演变作为工具调用智能体的基准构建问题。我们提出了ProEvolve，一种基于图的框架，使环境演变可编程。其核心是一个类型关系图，为环境提供统一的显式表示——数据、工具和架构。在此形式化下，添加、删除或修改能力被表达为图变换，这些变换能一致地在工具、架构和数据访问之间传播更新。基于此，ProEvolve支持（1）通过显式图变换自动生成演变的可执行环境，以及（2）通过子图采样和实例化进行图引导的任务沙盒构建。我们通过两个工具调用领域——电商和航班预订——在质量、实现有效性和失败模式方面验证了ProEvolve。最后，我们使用生成的基准作为下游诊断，研究智能体在结构化环境演变下的代表性行为。

英文摘要

LLM-powered tool-calling agents fulfill user requests by interacting with environments, querying data, and invoking tools in a multi-turn process. Yet, most existing benchmarks evaluate these systems under static environment interfaces, with fixed schemas and toolsets, making it difficult to assess how agents behave as environments evolves -- when capabilities are added, reorganized, or deprecated across successive environment versions. In this paper, we study structured environment evolution as a benchmark-construction problem for tool-calling agents. We propose ProEvolve, a graph-based framework that makes environment evolution programmable. At its core, a typed relational graph provides a unified, explicit representation of the environment - data, tools, and schema. Under this formalism, adding, removing, or modifying capabilities are expressed as graph transformations that coherently propagate updates across tools, schemas, and data access. Building on this, ProEvolve supports (1) automatic generation of evolved executable environments through explicit graph transformations, and (2) graph-grounded construction of task sandboxes via subgraph sampling and instantiation. We validate ProEvolve in two tool-calling domains, e-commerce and airline booking, in terms of quality, implementation validity, and failure modes. Finally, we use the generated benchmark as a downstream diagnostic to study how representative agents behave under structured environment evolution.

URL PDF HTML ☆

赞 0 踩 0

2603.05066 2026-05-20 cs.LG

Reward-Conditioned Reinforcement Learning

基于奖励的强化学习

Michal Nauman, Marek Cygan, Pieter Abbeel

发表机构 * University of Warsaw（华沙大学）； Nomagic ； UC Berkeley, Amazon FAR（伯克利大学，亚马逊FAR）

AI总结本文提出基于奖励的强化学习（RCRL），通过在收集经验时使用单一名义目标，使智能体在不额外交互的情况下暴露于多种奖励目标，从而提高样本效率并支持零样本行为调整。

Comments preprint

详情

AI中文摘要

单任务强化学习代理通常在固定奖励函数下训练，这限制了它们对奖励误指定的鲁棒性和适应变化偏好的能力。我们引入基于奖励的强化学习（RCRL），一种离策略方法，该方法在收集经验时将智能体条件于奖励参数化，同时通过重新计算共享回放数据中的反事实奖励，使智能体暴露于多种奖励目标，而无需额外的环境交互。在单任务、多任务和基于视觉的基准测试中，RCRL在名义奖励参数化下提高了样本效率，能够高效适应新参数化，并在部署时支持零样本行为调整。我们的结果表明，RCRL提供了一种可扩展的机制，用于学习鲁棒且可操控的策略，而无需牺牲单任务训练的简洁性。

英文摘要

Single-task RL agents are typically trained under a fixed reward function, which limits their robustness to reward misspecification and their ability to adapt to changing preferences. We introduce Reward-Conditioned Reinforcement Learning (RCRL), an off-policy method that conditions agents on reward parameterizations while collecting experience under a single nominal objective. By recomputing counterfactual rewards from shared replay data, RCRL exposes the agent to multiple reward objectives without additional environment interaction, connecting single-task RL with ideas from multi-objective and multi-task learning. Across single-task, multi-task, and vision-based benchmarks, RCRL improves sample efficiency under the nominal reward parameterization, enables efficient adaptation to new parameterizations, and supports zero-shot behavioral adjustment at deployment. Our results show that RCRL provides a scalable mechanism for learning robust, steerable policies without sacrificing the simplicity of single-task training.

URL PDF HTML ☆

赞 0 踩 0

2602.20700 2026-05-20 cs.CV

NGL: Natural Garment Language for Training-Free Sewing Pattern Estimation

NGL：自然服装语言用于无训练缝纫图案估计

Anna Badalyan, Pratheba Selvaraju, Giorgio Becherini, Omid Taheri, Victoria Fernandez Abrevaya, Michael Black

发表机构 * Max Planck Institute for Intelligent Systems（马克斯·普朗克智能系统研究所）

AI总结本研究提出NGL自然服装语言，通过利用视觉语言模型的自然描述能力，实现无训练的缝纫图案估计，解决了传统方法在泛化能力、真实世界关联性和多层服装处理上的不足。

Comments 12 pages, 7 figures

详情

AI中文摘要

从图像估计缝纫图案是创建高质量3D服装的实用方法，但受限于真实世界图像和缝纫图案配对数据的稀缺性而具有挑战性。现有方法通过训练视觉语言模型（VLMs）从参数化服装模型采样的合成服装中学习低级缝纫图案表示来解决这一限制。然而，这些方法往往难以泛化到野外图像，无法捕捉真实世界服装部件之间的关联，并且局限于单层服装。相比之下，我们发现VLMs在描述服装时表现良好，但将这些描述映射到有效的缝纫图案仍然困难。为此，我们提出了NGL（自然服装语言），一种针对VLMs的领域特定语言，能够以与VLMs的自然描述能力对齐的方式表示服装。利用NGL，我们引入了一条完全无训练的流程，通过查询大型VLMs提取结构化的服装规格，并确定性地将其转换为有效的缝纫图案。我们在Dress4D、CloSe以及一个包含253张野外时尚图像的新数据集上评估了我们的方法。我们的方法在标准几何度量上实现了最先进的性能，并在人类和基于GPT的感知评估中优于现有基线。此外，NGL能够恢复多层服装，而竞争方法主要集中在单层服装上，突显了其在处理有遮挡部分的真实世界图像时的强大泛化能力。这些结果表明，高效的服装表示对于使用VLMs进行缝纫图案估计至关重要。我们的代码和数据将供研究使用。

英文摘要

Estimating sewing patterns from images is a practical approach for creating high-quality 3D garments, but it remains challenging due to the scarcity of paired real-world image and sewing-pattern data. Existing methods address this limitation by training vision-language models (VLMs) to learn low-level sewing-pattern representations from synthetic garments sampled from parametric garment models. However, they often struggle to generalize to in-the-wild images, fail to capture real-world correlations between garment parts, and are restricted to single-layer outfits. In contrast, we observe that VLMs are effective at describing garments in natural language, but mapping these descriptions into valid sewing patterns remains difficult. To bridge this gap, we propose NGL (Natural Garment Language), a novel domain-specific language that represents garments in terms aligned with VLMs' natural descriptive abilities. Leveraging NGL, we introduce a fully training-free pipeline that queries large VLMs to extract structured garment specifications and deterministically converts them into valid sewing patterns. We evaluate our method on the Dress4D, CloSe and a newly collected dataset of 253 in-the-wild fashion images. Our approach achieves state-of-the-art performance on standard geometry metrics and is preferred in both human and GPT-based perceptual evaluations compared to existing baselines. Furthermore, NGL recovers multi-layer outfits whereas competing methods focus mostly on single-layer garments, highlighting its strong generalization to real-world images even with occluded parts. These results demonstrate that an efficient garment representation is critical for sewing pattern estimation with VLMs. Our code and data will be released for research use.

URL PDF HTML ☆

赞 0 踩 0