arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.27997 2026-05-28 cs.CL cs.AI cs.LG

几何校正扩散后验采样：基于去噪器回拉曲率引导与流形对齐阻尼

Seunghyeok Shin, Minwoo Kim, Dabin Kim, Hongki Lim

发表机构 * Department of Electrical and Computer Engineering, Inha University, Incheon, 22212, South Korea（电气与计算机工程系，Inha大学，Incheon，22212，韩国）

AI总结提出一种基于去噪器回拉曲率引导和流形对齐阻尼的几何校正扩散后验采样方法，通过每噪声水平的阻尼高斯-牛顿校正替代标量引导，实现稳定高效的后验采样。

Comments Code: https://github.com/Seunghyeok0715/CLAMP

Journal ref International Conference on Machine Learning 2026

详情

AI中文摘要

扩散后验采样将扩散先验条件于测量值，但数据一致性更新通常由手动调整的引导权重缩放，并且在刚性、算子依赖的曲率下可能破坏采样稳定性。我们使用在扩散状态坐标中计算的每噪声水平阻尼高斯-牛顿校正替代标量引导。该校正通过去噪器回拉似然梯度，使用避免前向去噪器雅可比矩阵的单侧曲率模型，并应用与去噪器残差对齐的扩散校准秩一阻尼。每个校正通过自动微分的无矩阵GMRES求解，采样通过具有闭式漂移/噪声分离的方差保持朗之万转移进行。在FFHQ和ImageNet上的逆问题中，该方法在PSNR/SSIM/LPIPS上达到竞争性能，同时运行速度显著快于大多数对比基线；在加速MRI重建中，它在对比基线中取得了最佳的PSNR/SSIM。

英文摘要

Diffusion posterior sampling conditions diffusion priors on measurements, but data-consistency updates are typically scaled by hand-tuned guidance weights and can destabilize sampling under stiff, operator-dependent curvature. We replace scalar guidance with a per-noise-level damped Gauss--Newton correction computed in diffusion-state coordinates. The correction pulls likelihood gradients back through the denoiser, uses a one-sided curvature model that avoids forward denoiser Jacobians, and applies diffusion-calibrated rank-one damping aligned with the denoiser residual. Each correction is solved with matrix-free GMRES using automatic differentiation, and sampling proceeds with a variance-preserving Langevin transition with a closed-form drift/noise split. On FFHQ and ImageNet across inverse problems, it achieves competitive PSNR/SSIM/LPIPS while running markedly faster than most of the compared baselines; on accelerated MRI reconstruction, it achieves the best PSNR/SSIM among the compared baselines.

URL PDF HTML ☆

赞 0 踩 0

2605.27989 2026-05-28 cs.LG

Law of Neural Interaction: Depth-Width Shape, Interaction Efficiency, and Generalization

神经交互定律：深度-宽度形状、交互效率与泛化

Wenjie Sun, Jinning Yang, Shuai Zhang, Mengnan Du

发表机构 * The Chinese University of Hong Kong, Shenzhen（香港中文大学（深圳））； Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences（深圳先进技术研究院）； New Jersey Institute of Technology（新泽西理工学院）

AI总结通过将叠加原理从参数空间扩展到梯度空间定义为神经交互，发现固定预算下良好泛化伴随高效神经交互，且可通过调整深度-宽度比（R_D/W）使模型处于高效交互区间，该区间随预算扩展保持稳定，为模型形状初始化和泛化机制提供见解。

Comments 30 pages, 4 figures

详情

AI中文摘要

缩放定律的指导增加了现代大型语言模型（LLMs）的资源需求，但在固定预算下这些模型是否有效利用资源仍存疑问。先前研究证明叠加是损失的关键贡献者。通过利用神经特征假设，我们将叠加从参数空间扩展到梯度空间，并将其定义为神经交互。我们发现，在固定预算下，良好的泛化通常伴随着高效的神经交互，并且可以通过调整模型的深度-宽度比（$R_{D/W}$）将模型置于高效交互区间。此外，随着预算扩大，模型的高效交互区间保持相对稳定。通过比较现有小规模密集LLMs，我们观察到接近该区间的模型在MMLU-Pro基准上表现更好。我们的发现揭示了$R_{D/W}$影响资源利用效率，进而影响泛化，为模型形状初始化和理解模型泛化机制提供了见解。神经交互定律的代码可在：https://anonymous.4open.science/r/Neural_Interaction_Law-D788 获取。

英文摘要

The guidance of scaling laws has increased the resource demands of modern large language models (LLMs), yet it remains questionable whether these models utilize resources effectively under a fixed budget. Previous research has proved superposition as a key contributor to loss. By leveraging the Neural Feature Ansatz, we extend superposition from parameter space to gradient space and define it as neural interaction. We find that under a fixed budget, good generalization is usually accompanied by efficient neural interactions, and the model can be placed in an efficient interaction interval by adjusting its depth-width ratio ($R_{D/W}$). In addition, as the budget scales up, the efficient interaction interval of the model remains relatively stable. By comparing existing small scale dense LLMs, we observe that models operating near this interval tend to perform better on the MMLU-Pro benchmark. Our findings reveal that the $R_{D/W}$ influences resource utilization efficiency and thereby affects generalization, providing insights into model shape initialization and the understanding of model generalization mechanisms. Code for Neural Interaction Law is available at: https://anonymous.4open.science/r/Neural_Interaction_Law-D788

URL PDF HTML ☆

赞 0 踩 0

2605.27986 2026-05-28 cs.CL q-bio.QM

语义流正则化：教会LLMs生成多样且连贯的回复

Kerui Peng, Feifei Li, Xingyu Fan, Wenhui Que

发表机构 * Tencent Inc.（腾讯公司）； Beijing, China（中国北京）

AI总结针对大语言模型微调时输出多样性严重受限的跨风格坍缩问题，提出语义流正则化（SFR），通过条件流匹配监督骨干网络使用连续句子嵌入，在零部署成本下提升多样性和风格保真度。

详情

AI中文摘要

当大语言模型被微调以生成个性或语气条件化的回复时，其输出多样性受到严重限制——我们将这种失败称为跨风格坍缩。我们将这种坍缩追溯到交叉熵目标，该目标在共享表示下倾向于抑制多样化的延续。我们提出语义流正则化（SFR），一种轻量级的辅助目标，通过条件流匹配使用未来片段的连续句子编码器嵌入来监督骨干网络。随机流源通过构造保持多模态；流匹配头在推理时被丢弃，增加零部署成本。在一个大规模工业对话数据集（Qwen3-32B，9种个性）上，SFR在输出多样性、风格保真度和回复质量上优于SFT。我们进一步在公共LiveCodeBench-v5（Qwen2.5-Coder-7B-Instruct）上验证，其中SFR持续改进pass@k，证实了其超越风格化对话的通用性。在MBPP上的受控比较显示，多令牌预测是SFR的一个退化特例。

英文摘要

When large language models are fine-tuned to generate persona- or tone-conditioned responses, their output diversity is severely limited--a failure we term Cross-Style Collapse. We trace this collapse to the cross-entropy objective, which under shared representations tends to suppress diverse continuations. We propose Semantic Flow Regularization (SFR), a lightweight auxiliary objective that supervises the backbone with continuous sentence-encoder embeddings of future segments via conditional flow matching. The stochastic flow source preserves multi-modality by construction; the flow-matching head is discarded at inference, adding zero deployment cost. On a large-scale industrial dialogue dataset (Qwen3-32B, 9 personas), SFR improves output diversity, style fidelity, and response quality over SFT. We further validate on the public LiveCodeBench-v5 (Qwen2.5-Coder-7B-Instruct), where SFR consistently improves pass@k, confirming generality beyond stylized dialogue. A controlled comparison on MBPP reveals Multi-Token Prediction to be a degenerate special case of SFR.

URL PDF HTML ☆

赞 0 踩 0

2605.27970 2026-05-28 cs.AI

Geometry of Human Perceptual Domains Emerges Transiently in LLM Representations

人类感知域的几何结构在LLM表征中短暂出现

Simardeep Singh, Paras Chopra

发表机构 * Indian Institute of Technology Roorkee（印度理工学院罗尔基分校）

AI总结研究大型语言模型内部表征中是否出现与人类感知组织相似的几何结构，发现多个感知域的几何结构在中间层短暂涌现，且与人类基准对齐。

Comments 19 Pages, 28 Figures

详情

AI中文摘要

虽然大型语言模型（LLM）仅基于文本数据进行训练，但先前的工作表明，它们的内部表征在嵌入空间中可能展现出丰富的几何结构。基于这一研究方向，我们调查了这种结构是否与不同领域（例如颜色、音高、情感和味觉）的人类感知组织相似。具体来说，我们研究了多个开源Transformer架构的残差流中，与感知模态对应的内在几何结构逐层涌现的情况。我们的结果揭示了三个关键发现。首先，我们观察到多个感知域的逐层几何结构涌现，尽管训练过程中没有任何直接的感知监督。其次，这些感知域表现出不同的涌现轮廓，几何结构及其与人类基准的一致性在深度上遵循领域和模型特定的轨迹。第三，这种涌现遵循一致的表征轨迹：几何结构在早期层较弱或分散，在中间层逐渐组织化，在后期层减弱，表明感知几何结构作为模型内部转换管道的一部分短暂出现。这为理解类人感知几何结构在LLM中如何以及何处出现提供了新见解，为内部表征的机制分析提供了原则性途径。

英文摘要

While large language models (LLMs) are trained purely on textual data, prior work has shown that their internal representations can exhibit rich geometric structure in embedding space. Building on this line of work, we investigate whether such structure is similar to human perceptual organisation across different domains (e.g., color, pitch, emotion, and taste). Specifically, we study the layer-wise emergence of intrinsic geometrical structure corresponding to perceptual modalities within the residual streams of multiple open-weight transformer architectures. Our results reveal three key findings. First, we observe the emergence of layer-wise geometric structure across multiple perceptual domains, despite the absence of any direct perceptual supervision during training. Second, these perceptual domains exhibit distinct emergence profiles, with both geometric structure and its alignment with human baselines following domain- and model-specific trajectories across depth. Third, this emergence follows a consistent representational trajectory: geometry is weak or diffuse in early layers, becomes progressively organised in intermediate layers, and is attenuated in later layers, suggesting that perceptual geometry arises transiently as part of the model's internal transformation pipeline. This provides new insight into how and where human-like perceptual geometry arises in LLMs, offering a principled pathway for mechanistic analysis of internal representations.

URL PDF HTML ☆

赞 0 踩 0

2605.27965 2026-05-28 cs.AI

The Shape of Overthinking: Backtracking Bursts in Long Reasoning Traces

过度思考的形状：长推理轨迹中的回溯爆发

Navid Rezazadeh, Arash Gholami Davoodi

发表机构 * University of California, Irvine（加州大学尔湾分校）； Carnegie Mellon University（卡内基梅隆大学）

AI总结通过分析长推理轨迹中的回溯动态，发现早期孤立修复通常与正确推理兼容，而错误轨迹更常出现持续且聚集的晚期中度至重度回溯，并基于此提出爆发感知过滤策略以区分可恢复修复与潜在不稳定。

详情

AI中文摘要

推理模型通常生成长轨迹，其中有用的自我纠正和无效的修改难以区分。我们通过回溯动态研究这种区别：长形式推理轨迹中的局部重新考虑、撤回或重新推导。在6,000条Qwen3-8B AIME轨迹上，我们标注了片段级别的回溯严重性，并分析了事件时序、归一化深度和局部爆发结构。我们发现早期孤立修复通常与正确推理兼容，而错误轨迹更常显示中度至重度回溯，这些回溯持续存在并聚集在后期。跨语料库检查显示，在额外的模型/领域对中存在相同的定性不对称性。过滤分析将信号实例化为前缀因果选择性早期退出策略：在浅层和中间深度，爆发感知过滤优于固定长度过滤，同时仅使用前缀可用特征。中等长度截断仍然是强大的完整轨迹基线，但爆发感知控制提供了一种可部署的机制，用于区分可恢复修复与潜在不稳定。

英文摘要

Reasoning models often generate long traces in which useful self-correction and unproductive revision are hard to distinguish. We study this distinction through backtracking dynamics: local reconsideration, retraction, or re-derivation inside long-form reasoning traces. On 6{,}000 Qwen3-8B AIME traces, we annotate segment-level backtrack severity and analyze event timing, normalized depth, and local burst structure. We find that early isolated repair is often compatible with correct reasoning, whereas incorrect traces more often show moderate-to-severe backtracks that persist and cluster late. Cross-corpus checks show the same qualitative asymmetry across additional model/domain pairs. Filtering analyses instantiate the signal as a prefix-causal selective early-exit policy: at shallow and intermediate depths, burst-aware filtering outperforms fixed length-based filtering while using only prefix-available features. Moderate length cutoffs remain strong completed-trace baselines, but burst-aware control provides a deployable mechanism for separating recoverable repair from likely instability.

URL PDF HTML ☆

赞 0 踩 0

2605.27962 2026-05-28 cs.CV

Bridging the Generalization Gap in Adverse Weather Segmentation: A Training Recipe Perspective

缩小恶劣天气分割中的泛化差距：训练方案视角

Cong Xu, Pu Luo, Yumei Li, Boyou Xue

发表机构 * Xidian University（西安电子科技大学）

AI总结本文从训练方案角度出发，通过域自适应微调、多源数据混合、场景平衡采样和合成退化增强等方法，显著缩小了恶劣天气语义分割中的验证-测试泛化差距。

详情

AI中文摘要

本文描述了我们在第8届UG2+研讨会（CVPR 2026）Track 2中的方法，该赛道针对五种天气条件（模糊、黑暗、雪、雾和眩光）退化的户外场景进行语义分割。我们观察到一个核心挑战是严重的泛化差距——在验证集上表现良好的模型在测试集上往往崩溃。例如，SegFormer-B5从验证到测试下降了16.1 mIoU点，表明仅靠模型容量不足以实现鲁棒性。我们研究精心设计的训练方案（而非架构复杂性）是否可以解决这一差距。从预训练的SegMAN-S骨干开始，我们系统地研究了域自适应微调、多源数据混合、场景平衡采样和合成退化增强的效果。我们的最终系统在官方测试集上达到了59.9%的mIoU，同时验证-测试差距仅为6.5个点——不到更大模型的一半。我们分析了架构修改、损失函数变体和模型缩放的负面结果，为有限数据下天气鲁棒分割提供实用见解。

英文摘要

This paper describes our approach for the 8th UG2+ Workshop (CVPR 2026) Track~2, which targets semantic segmentation of outdoor scenes degraded by five weather conditions: blur, darkness, snow, haze, and glare. A central challenge we observe is a severe generalization gap -- models that perform well on the validation set often collapse on the test set. For instance, SegFormer-B5 drops 16.1 mIoU points from validation to test, suggesting that model capacity alone is insufficient for robustness. We investigate whether a carefully designed training recipe, rather than architectural complexity, can address this gap. Starting from a pre-trained SegMAN-S backbone, we systematically study the effects of domain-adaptive fine-tuning, multi-source data mixing, scene-balanced sampling, and synthetic degradation augmentation. Our final system achieves 59.9\% mIoU on the official test set while maintaining a validation-test gap of only 6.5 points -- less than half that of larger models. We analyze negative results from architectural modifications, loss function variants, and model scaling to provide practical insights for weather-robust segmentation under limited data.

URL PDF HTML ☆

赞 0 踩 0

2605.27960 2026-05-28 cs.CV

Mags-RL: Wearing Multimodal LLMs a Magnifying Glass via Agentic Reinforcement Learning For Complex Scene Reasoning

Mags-RL: 通过智能体强化学习为多模态大语言模型戴上放大镜以进行复杂场景推理

Xuanzhao Dong, Wenhui Zhu, Peijie Qiu, Xiwen Chen, Xiaobing Yu, Xin Li, Zhipeng Wang, Shao Tang, Gen Li, Yujian Xiong, Hao Wang, Yanxi Chen, Prayag Tiwari, Yalin Wang

发表机构 * Arizona State University（亚利桑那州立大学）； Clemson University（克莱姆森大学）； Washington University in St. Louis（圣路易斯华盛顿大学）； Halmstad University（哈姆斯塔德大学）； Florida State University（佛罗里达州立大学）； Rice University（里士满大学）

AI总结提出Mags-RL框架，通过智能体强化学习让多模态大语言模型调用超分辨率代理进行高分辨率细粒度检查，实现两轮推理以提升复杂场景下的视觉推理能力。

详情

AI中文摘要

尽管多模态大语言模型（MLLMs）广受欢迎且成功，但它们在准确解释图像方面常常遇到困难，这限制了它们在复杂场景（如高物体密度和复杂背景杂乱）中的推理能力。先前的工作主要通过引入额外的显式视觉线索（如需要额外标注的边界框）来解决这一限制。此外，由此产生的低分辨率裁剪往往丢失了MLLMs进行准确推理所需的细粒度细节。因此，我们提出了Mags-RL，一个智能体强化学习（RL）框架，它为MLLMs配备了一个外部超分辨率“放大镜”代理，用于高分辨率细粒度检查。具体来说，该模型执行两轮推理：第一轮，它生成初始推理并自主识别感兴趣区域，无需依赖额外标注；第二轮，它调用超分辨率代理裁剪并放大这些区域，然后重新审视并验证其先前的推理以产生最终答案。我们还引入了一种新颖的课程学习策略，实现了数据高效的RL训练，仅需少至40个训练样本即可达到合理的性能。在VSR、TallyQA和GQA子集上的实验表明，与近期强竞争方法相比，它表现出优越的性能，展示了具有精确视觉基础的高质量推理。代码和权重将很快发布。

英文摘要

Despite their popularity and success, Multimodal Large Language Models (MLLMs) often struggle to interpret images accurately, which limits their reasoning capability in complex scenarios (e.g., high object density and complex background clutter). Prior work mainly addresses this limitation by incorporating explicit visual cues like bounding boxes that require extra annotations. In addition, the resulting low-resolution crops often miss fine-grained details that MLLMs require for accurate reasoning. Therefore, we propose Mags-RL, an Agentic Reinforcement Learning (RL) framework that equips MLLMs with an external super-resolution "magnifying glass" agent for high-resolution fine-grained inspection. Specifically, the model performs two-round reasoning: in the first round, it generates an initial rationale and autonomously identifies regions of interest without relying on additional annotations; in the second round, it invokes a super-resolution agent to crop and upscale those regions, then revisits and verifies its earlier reasoning to produce the final answer. We also introduce a novel curriculum learning strategy that enables data-efficient RL training, needing as few as only 40 training samples to achieve reasonable performance. Experiments on VSR, TallyQA, and GQA subsets show its superior performance against recent strong competing methods, demonstrating high-quality reasoning with precise visual grounding. Code and weights will be released soon.

URL PDF HTML ☆

赞 0 踩 0

2605.27958 2026-05-28 cs.CL cs.AI cs.LG

Pressure-Testing Deception Probes in LLMs: Scaling, Robustness, and the Geometry of Deceptive Representations

压力测试LLM中的欺骗探针：扩展性、鲁棒性与欺骗表示的几何结构

Sachin Kumar

发表机构 * LexisNexis（LexisNexis公司）

AI总结本文通过系统压力测试，诊断线性探针在分布偏移下失效的原因，发现风格增强可恢复近完美检测，并证明欺骗编码非单一线性方向或熵代理，而是分布式亚阈值特征。

Comments Accepted at the GEM Workshop @ ACL 2026

详情

AI中文摘要

基于LLM激活训练的线性探针越来越多地被提议作为欺骗检测指标，但在干净基准上报告AUROC超过0.96，而在分布偏移下崩溃。本文系统地对Gemma 3模型家族（1B-27B参数）的探针指标进行压力测试，诊断其失败原因而不仅仅是记录失败。我们测试了关于欺骗编码的四个假设：（1）单一线性方向，（2）多维子空间，（3）凸锥包，（4）熵代理。我们的设计包括跨域转移矩阵、基于排列零基线的多维探针分析、熵残差化测试以及8种风格偏移下的干扰评估。我们发现：（a）探针在干净数据上达到近乎完美的AUROC（>=0.998），但在风格偏移下崩溃；风格增强的探针在未见风格上恢复近乎完美的检测（平均AUROC 0.979-0.983）；（b）单一方向假设被拒绝（k=1仅捕获0.61-0.80 AUROC），跨域转移失败被确认为几何原因而非层不匹配驱动；（c）熵代理假设被拒绝（最大|rho|=0.454，残差化后最大Delta-AUROC=0.004）；（d）欺骗并未形成显著的线性子空间（每域k*=0），但多维探针（k>=5）通过分布式亚阈值特征恢复信号。探针脆弱性反映了分布狭窄性而非架构限制：风格增强的探针在4B和27B均恢复近乎完美的检测，表明逆缩放模式是训练分布伪影而非真正的规模依赖现象。

英文摘要

Linear probes trained on LLM activations are increasingly proposed as deception-detection metrics, yet report AUROC exceeding 0.96 on clean benchmarks while collapsing under distributional shift. This paper systematically pressure-tests probe-based metrics across the Gemma 3 model family (1B-27B parameters), diagnosing why they fail rather than merely documenting that they fail. We test four hypotheses about deception encoding: (1) single linear direction, (2) multi-dimensional subspace, (3) convex conic hull, (4) entropy proxy. Our design includes cross-domain transfer matrices, multi-dimensional probe analysis with permutation null baselines, entropy-residualization tests, and distractor evaluations across 8 stylistic shifts. We find that: (a) probes achieve near-perfect AUROC (>=0.998) on clean data but collapse under stylistic shifts; style-augmented probes recover near-perfect detection (mean AUROC 0.979-0.983) on unseen styles; (b) the single-direction hypothesis is rejected (k=1 captures only 0.61-0.80 AUROC), with cross-domain transfer failure confirmed as geometric rather than layer-mismatch-driven; (c) the entropy-proxy hypothesis is rejected (max |rho|=0.454, max Delta-AUROC after residualization=0.004); and (d) deception does not form a significant linear subspace (per-domain k*=0), yet multi-dimensional probes (k>=5) recover the signal through distributed sub-threshold features. Probe fragility reflects distributional narrowness rather than an architectural limitation: style-augmented probes recover near-perfect detection at both 4B and 27B, establishing that the inverse scaling pattern is a training-distribution artifact rather than a genuine scale-dependent phenomenon.

URL PDF HTML ☆

赞 0 踩 0

2605.27957 2026-05-28 cs.CL

DisasterBench: Benchmarking LLM Planning under Typed Tool Interface Constraints

DisasterBench: 在类型化工具接口约束下基准测试LLM规划

Zhitong Chen, Kai Yin, Weifeng Zhang, Zhiyuan Wang, Xiangjue Dong, Chengkai Liu, Zhewei Liu, Yiming Xiao, Ali Mostafavi, James Caverlee

发表机构 * Texas A&M University（德克萨斯A&M大学）； University of Toronto（多伦多大学）

AI总结提出DisasterBench基准，通过类型化工具接口评估LLM在灾害响应中的结构化多智能体规划能力，并引入首次故障点（FPoF）方法进行步骤级故障归因，揭示语义推理与执行约束之间的差距。

详情

AI中文摘要

灾害造成严重的社会影响，需要快速协调异构AI工具（从卫星分析到洪水预测和损害评估）形成连贯的多步骤工作流。随着LLM越来越多地充当此类管道的编排者，有效的协调需要的不仅仅是选择语义上合理的工具：LLM必须生成具有正确参数绑定和依赖传播的可执行工作流。我们引入了DisasterBench，这是一个基准，用于评估在语义相似但操作上不同的灾害响应工具上的结构化多智能体规划。为了实现步骤级故障归因，我们进一步提出了首次故障点（FPoF），它定位预测工作流中最早的根因，将主要错误与下游级联效应分开。我们的评估揭示了三个发现：规划方法的有效性强烈依赖于模型容量；工具不匹配和参数绑定错误主导了首次故障，揭示了语义基础和执行一致性是不同瓶颈；冗长的中间推理可能与结构化输出要求产生指令冲突，破坏计划生成。总之，这些发现凸显了语义推理与执行基础协调之间的根本差距，强调了需要联合建模语义意图、执行约束和工作流一致性的规划框架。代码、数据和评估资源可在 https://github.com/TamuChen18/DisasterBench_Open 获取。

英文摘要

Disasters cause severe societal impacts, demanding rapid coordination of heterogeneous AI tools, from satellite analysis to flood prediction and damage assessment, into coherent multi-step workflows. As LLMs increasingly serve as orchestrators of such pipelines, effective coordination requires more than selecting semantically plausible tools: LLMs must generate executable workflows with correct parameter binding and dependency propagation. We introduce DisasterBench, a benchmark for evaluating structured multi-agent planning over semantically similar but operationally distinct disaster-response tools. To enable step-level failure attribution, we further propose First-Point-of-Failure (FPoF), which localizes the earliest root cause in a predicted workflow, separating primary errors from downstream cascading effects. Our evaluation reveals three findings: planning method effectiveness depends strongly on model capacity; tool mismatch and parameter-binding errors dominate first failures, revealing semantic grounding and execution consistency as distinct bottlenecks; and verbose intermediate reasoning can create instruction clash with structured output requirements, disrupting plan generation. Together, these findings highlight a fundamental gap between semantic reasoning and execution-grounded coordination, underscoring the need for planning frameworks that jointly model semantic intent, execution constraints, and workflow consistency. Code, data, and evaluation resources are available at: https://github.com/TamuChen18/DisasterBench_Open

URL PDF HTML ☆

赞 0 踩 0

2605.27954 2026-05-28 cs.LG

Cyclical Entropy Eruption: Entropy Dynamics in Agent Reinforcement Learning

循环熵爆发：智能体强化学习中的熵动力学

Wendi Li, Shawn Im, Sharon Li

发表机构 * Department of Computer Sciences, University of Wisconsin–Madison（威斯康星大学麦迪逊分校计算机科学系）

AI总结本文发现智能体强化学习训练中存在循环熵爆发现象，并提出了SEAL辅助损失函数来稳定训练、提升性能。

详情

AI中文摘要

智能体大型语言模型通过推理目标、调用工具和与外部环境交互，越来越多地被用于解决现实世界任务。强化学习为改进这些行为提供了自然框架，最近的智能体RL方法在多个领域取得了强劲成果。然而，智能体RL的训练动力学仍然知之甚少，限制了诊断不稳定性和设计更有效训练算法的能力。在这项工作中，我们识别了智能体RL中一个先前未被充分探索的现象，我们称之为循环熵爆发。与单轮推理RL（其中熵通常崩溃并保持低位）不同，智能体RL训练表现出独特的重复循环：熵急剧爆发然后逐渐消退。我们将这种动态分解为三个阶段，并对每个阶段进行理论和实证分析，解释其循环振荡的机制。我们进一步表明，一旦在爆发期间获得，诸如句子重复和幻觉等退化模式可以在循环中持续并累积。受这些发现的启发，我们提出了SEAL（分离增强型智能体学习），一种轻量级辅助损失，它在表示空间中分离正确和错误轨迹，直接针对熵爆发的根本原因。跨多个基准、模型和RL算法的实验表明，SEAL稳定了训练并产生了更强的下游智能体性能。

英文摘要

Agentic large language models are increasingly used to solve real-world tasks by reasoning over goals, invoking tools, and interacting with external environments. Reinforcement learning provides a natural framework for improving these behaviors, and recent agent RL methods have achieved strong results across domains. However, the training dynamics of agent RL remain poorly understood, limiting our ability to diagnose instabilities and design more effective training algorithms. In this work, we identify a previously underexplored phenomenon in agent RL, which we term cyclical entropy eruption. Unlike single-turn reasoning RL, where entropy typically collapses and stays low, agent RL training exhibits unique recurring cycles of sharp entropy eruption and gradual subsidence. We decompose this dynamic into three phases and provide theoretical and empirical analyses of each, explaining the mechanisms underlying its cyclical oscillation. We further show that degenerate patterns such as sentence duplication and hallucination, once acquired during eruption, can persist and accumulate across cycles. Motivated by these findings, we propose SEAL (Separation-Enhanced Agent Learning), a lightweight auxiliary loss that separates correct and incorrect trajectories in representation space, directly targeting the root cause of entropy eruption. Experiments across multiple benchmarks, models, and RL algorithms demonstrate that SEAL stabilizes training and yields stronger downstream agent performance.

URL PDF HTML ☆

赞 0 踩 0

2605.27952 2026-05-28 cs.CV cs.RO

Con-DSO: Learning Short-Horizon Consistency Priors for RGB-D Direct Sparse Odometry

Con-DSO：学习RGB-D直接稀疏里程计的短时一致性先验

Haolan Zhang, Thanh Nguyen Canh, Chenghao Li, Ziyan Gao, Xiongwen Jiang, Nak Young Chong

发表机构 * School of Information Science, Japan Advanced Institute of Science and Technology（信息科学学系，日本科学技术先进研究院）； College of Information Engineering, Shenyang University of Chemical Technology（信息工程学院，沈阳化学工业大学）

AI总结提出Con-DSO框架，通过预测光度与深度几何一致性不确定性，实现质量感知的像素选择和加权，提升RGB-D直接稀疏里程计在动态、遮挡等挑战环境下的鲁棒性。

Comments Submitted

详情

AI中文摘要

视觉里程计（VO）是机器人和增强现实中的基础组件。RGB-D直接VO受益于度量深度测量，但在动态物体、遮挡、光照变化和不可靠深度违反直接对齐所使用的短时光度和深度几何一致性假设的挑战环境中，性能会下降。现有方法通过语义过滤、显式遮挡推理、光照适应或手工几何准则来缓解这些问题，但通常依赖外部模块或针对个别故障模式的固定假设，限制了其灵活性和以统一方式处理多样挑战的能力。本文提出Con-DSO，一种一致性感知的RGB-D直接稀疏里程计框架，从时间相邻的RGB-D帧对预测密集的光度和深度几何一致性不确定性。一致性网络通过流引导的光度误差和投影深度一致性误差进行训练，使得一致性违规可表示为像素级不确定性。这些成对不确定性预测被转换为关键帧跟踪的主机侧质量先验。该先验随后通过质量感知的支持像素选择和位姿估计中的解耦光度-几何加权应用于VO，使得不可靠观测持续衰减，而非硬拒绝或基于阈值的门控。在五个公开RGB-D基准上的实验表明，与直接RGB-D VO基线相比，在ICL-NUIM上绝对轨迹误差降低超过20%，在RGB-D Scenes V2、TUM/Bonn Dynamic和OpenLORIS序列上降低50%-80%。

英文摘要

Visual odometry (VO) is a fundamental component in robotics and augmented reality. RGB-D direct VO benefits from metric depth measurements, but it can degrade in challenging environments, where dynamic objects, occlusions, illumination changes, and unreliable depth violate the short-horizon photometric and depth-geometric consistency assumptions used by direct alignment. Existing approaches mitigate these issues through semantic filtering, explicit occlusion reasoning, illumination adaptation, or hand-crafted geometric criteria, but often rely on external modules or fixed assumptions tailored to individual failure modes, limiting their flexibility and ability to handle diverse challenges in a unified manner. In this work, we propose Con-DSO, a consistency-aware RGB-D direct sparse odometry framework that predicts dense photometric and depth-geometric consistency uncertainty from temporally adjacent RGB-D frame pairs. The consistency network is trained using flow-guided photometric errors and projective depth-consistency errors, allowing consistency violations to be represented as pixel-level uncertainty. These pairwise uncertainty predictions are converted into a host-side quality prior for keyframe-based tracking. The prior is then applied to VO through quality-aware support-pixel selection and decoupled photometric-geometric weighting during pose estimation, enabling continuous attenuation of unreliable observations rather than hard rejection or threshold-based gating. Experiments on five public RGB-D benchmarks show substantial gains over direct RGB-D VO baselines, with over 20\% absolute trajectory error reduction on ICL-NUIM and 50\%--80\% reductions on RGB-D Scenes V2, TUM/Bonn Dynamic, and OpenLORIS sequences.

URL PDF HTML ☆

赞 0 踩 0

2605.27950 2026-05-28 cs.CV

SEMAGIC: 从野外图像中学习语义一致的可变形3D表示

Sky Cen, Wufei Ma, Guofeng Zhang, Alan Yuille, Adam Kortylewski

发表机构 * Johns Hopkins University（约翰霍普金斯大学）； CISPA Helmholtz Center for Information Security（信息安全霍普金斯中心）

AI总结针对现有可变形3D重建方法语义对应不稳定的问题，提出SEMAGIC框架，通过特征级一致性损失和顶点索引条件变形，在重建过程中强制语义一致性，从而提升类别级语义对应性能。

详情

AI中文摘要

从单视图野外图像中学习可变形3D物体模型已实现了无需监督的令人印象深刻的3D形状重建。然而，这些模型是否捕捉到下游任务所需的语义结构仍不清楚。我们发现，现有的可变形重建方法尽管生成了视觉上合理的几何形状，但在实例间产生了不稳定的对应关系，并在语义对应基准上表现不佳。我们引入了SEMAGIC，一个从单视图野外图像中学习语义一致的可变形3D表示的框架。SEMAGIC不将重建视为最终目标，而是将可变形建模作为发现类别级对应关系的机制。每个类别由一个规范模板网格和一个学习到的变形场表示，其功能类似于一个从图像特征重建实例几何的自编码器，使得顶点能够在实例间保持一致的语义含义。训练过程中通过(i)对齐规范网格和变形网格之间语义特征的特征级一致性损失，以及(ii)保持实例间语义对应的顶点索引条件变形，来强制语义一致性。通过将几何变形与语义对齐显式耦合，SEMAGIC生成了在类别内变化中保持稳定部件对应的表示。实验表明，SEMAGIC在SPair-71k上将可变形模型的语义对应提高了+14.7 PCK@0.1，确立了可变形模型作为有效语义3D表示的地位。

英文摘要

Learning deformable 3D object models from single-view in-the-wild images has enabled impressive 3D shape reconstruction without supervision. However, it remains unclear whether these models capture the semantic structure required for downstream tasks. We find that existing deformable reconstruction approaches, despite producing visually plausible geometry, yield unstable correspondences across instances and perform poorly on semantic correspondence benchmarks. We introduce SEMAGIC, a framework for learning semantically consistent deformable 3D representations from single-view in-the-wild images. Rather than treating reconstruction as the end goal, SEMAGIC uses deformable modeling as a mechanism to discover category-level correspondences. Each category is represented by a canonical template mesh and a learned deformation field, functioning similarly to an autoencoder that reconstructs instance geometry from image features, enabling vertices to maintain consistent semantic meaning across instances. Semantic consistency is enforced during training through (i) a feature-level consistency loss aligning semantic features between canonical and deformed meshes, and (ii) vertex-index-conditioned deformation that preserves semantic correspondence across instances. By explicitly coupling geometric deformation with semantic alignment, SEMAGIC produces representations that maintain stable part correspondences across intra-category variation. Experiments demonstrate that SEMAGIC improves semantic correspondence of deformable models by +14.7 PCK@0.1 on SPair-71k, establishing deformable models as effective semantic 3D representations.

URL PDF HTML ☆

赞 0 踩 0

2605.27934 2026-05-28 cs.CL

结构引导的视觉扰动中和用于大型视觉语言模型

Yuanhe Zhang, Xueting Wang, YanBin Ren, Haoran Gao, Xinhan Zheng, Zhenhong Zhou, Fanyu Meng, Li Sun, Sen Su

发表机构 * Beijing University of Posts and Telecommunications（北京邮电大学）； University of Science and Technology of China（中国科学技术大学）； JIUTIAN Research（JIUTIAN研究所）； Nanyang Technological University（南洋理工大学）； Chongqing University of Posts and Telecommunications（重庆邮电大学）

AI总结提出结构诱导引导中和（SIGN）框架，通过先验结构提取和动态引导中和实现轻量级、即插即用的对抗性防御，在仅0.5%像素修改和0.16秒每图下达到87%以上防御成功率。

详情

AI中文摘要

图像输入使大型视觉语言模型（LVLMs）能够感知细粒度的视觉信息，但也引入了一个像素级攻击面，通过该攻击面，对抗性扰动可以引发不安全的模型行为。然而，大多数现有防御是为传统计算机视觉场景设计的，因此常常忽略LVLMs所需的跨模态对齐，导致性能下降。同时，针对LVLMs的有限防御通常需要大量的图像修改并引入可观的计算开销，从而损害推理质量和效率。为解决这些限制，我们提出了结构诱导引导中和（SIGN），一个轻量级、即插即用的防御框架，通过先验结构提取提高LVLM兼容性，并通过动态引导中和实现高效的扰动抑制。大量实验表明，SIGN在仅0.5%像素修改和每张图像0.16秒的情况下实现了超过87%的防御成功率，同时几乎保留了原始视觉表示和良性任务性能。我们的工作为需要昂贵模型训练的防御提供了一种轻量级替代方案，并突显了利用视觉编码器进行高效对抗性保护的潜力。我们的代码已在 https://anonymous.4open.science/r/SIGN-BCB1 开源。

英文摘要

Image inputs enable Large Vision Language Models (LVLMs) to perceive fine-grained visual information, but also introduce a pixel-level attack surface through which adversarial perturbations can elicit unsafe model behaviors. However, most existing defenses are designed for traditional computer vision settings and thus often overlook the cross-modal alignment required by LVLMs, leading to degraded performance. Meanwhile, the limited defenses tailored to LVLMs often require substantial image modifications and introduce considerable computational overhead, thereby compromising inference quality and efficiency. To address these limitations, we propose Structure-Induced Guided Neutralization (SIGN), a lightweight, plug-and-play defense framework that improves LVLM compatibility via Prior Structural Extraction and achieves efficient perturbation suppression via Dynamic Guided Neutralization. Extensive experiments show that SIGN achieves over 87\% defense success rate with only 0.5\% pixel modification and 0.16 seconds per image, while nearly preserving original visual representations and benign task performance. Our work offers a lightweight alternative to defenses that require costly model training and highlights the potential of exploiting a vision encoder for efficient adversarial protection. Our code is open source on https://anonymous.4open.science/r/SIGN-BCB1.

URL PDF HTML ☆

赞 0 踩 0

2605.27924 2026-05-28 cs.CV

SIGMA: Semantic-Difference Instruction-Grounding Mask Annotator for Text-Driven Image Manipulation Localization

SIGMA: 基于语义差异的指令引导掩码标注器用于文本驱动图像操作定位

Peiyu Zhuang, Jianquan Yang, Haodong Li, Zhuoying Cai, Ruitao Xie, Jishen Zeng, Baoying Chen, Jiwu Huang, Xiaochun Cao

发表机构 * Shenzhen Campus of Sun Yat-sen University（中山大学深圳校区）； Guangdong Provincial Key Laboratory of Intelligent Information Processing and Shenzhen Key Laboratory of Media Security（广东省智能信息处理重点实验室和深圳媒体安全重点实验室）； Shenzhen University of Advanced Technology and Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences（深圳先进技术大学和深圳先进技术研究所，中国科学院）； Alibaba Group（阿里巴巴集团）； Shenzhen MSU-BIT University（深圳MSU-BIT大学）

AI总结提出SIGMA方法，通过视觉基础模型中的语义特征差异和指令引导的空间先验，自动从公开编辑数据集中生成像素级掩码，用于训练图像操作定位模型，在五个基准上F1提升12.20%，并生成约110万训练集使六个检测器平均F1提升18.34%。

详情

AI中文摘要

文本驱动的图像编辑发展迅速，但可靠地定位这些操作需要在大规模像素标注数据集上训练的图像操作定位（IML）模型，目前尚无低成本获取此类训练数据的方法。我们观察到这些数据实际上已经以伪装形式存在：公开编辑数据集包含数百万个与IML训练样本结构相同的（原始、编辑）图像对，仅缺少像素级掩码。自动恢复这些掩码并非易事：像素差异被扩散引起的所有像素扰动淹没，而仅基于指令的定位只能定位提示描述的内容，遗漏了意外的编辑副作用。我们提出SIGMA（语义差异指令引导掩码标注器），它在视觉基础骨干网络中进行语义特征差异计算，并通过双向跨模态精炼将指令导出的空间先验注入视觉流，在编辑器忠实实现用户意图时放大预期编辑区域的差异信号。SIGMA通过两个互补阶段训练：第一阶段在修复掩码上进行监督；第二阶段通过VAE往返噪声校准、EMA自训练和编辑噪声解耦损失来弥合扩散域偏移。SIGMA在五个基准上优于现有自动掩码生成器（F1提升12.20%，IoU提升11.16%）。当应用于公开编辑语料库时，它生成了约110万IML训练集，使六个不同检测器在五个数据集上平均F1提升18.34%，将以前未使用的编辑数据转化为IML的模型无关监督资源。论文被接收后我们将立即发布完整代码库。

英文摘要

Text-driven image editing has advanced rapidly, but reliably localizing these manipulations requires image manipulation localization (IML) models trained on large pixel-annotated datasets, and there is still no low-cost way to obtain such training data at scale. We observe that these data already exist in disguise: public editing datasets contain millions of structurally identical (original, edited) pairs to IML training samples, lacking only pixel-level masks. Recovering these masks automatically is non-trivial: pixel differencing is overwhelmed by diffusion-induced perturbations across all pixels, and instruction-only grounding localizes only what the prompt describes, missing unintended editor side-effects. We propose SIGMA (Semantic-difference Instruction-Grounding Mask Annotator), which performs semantic-feature differencing in a vision foundation backbone and injects an instruction-derived spatial prior into this visual stream via bidirectional cross-modal refinement, amplifying the difference signal at intended-edit regions when the editor faithfully realizes user intent. SIGMA is trained in two complementary stages: Stage I supervises on inpainting masks; Stage II closes the diffusion-domain shift via VAE-roundtrip noise calibration, EMA self-training, and an edit-noise disentanglement loss. SIGMA outperforms existing automatic mask generators on five benchmarks (+12.20% F1, +11.16% IoU). When applied to public editing corpora, it produces a ~1.1M IML training set that improves six diverse detectors by +18.34% F1 across five datasets, turning previously unused editing data into a model-agnostic supervisory resource for IML. We'll release the full codebase as soon as the paper is accepted.

URL PDF HTML ☆

赞 0 踩 0

AI 大模型

视觉与机器人

科学与医疗

Where Does Toxicity Live? Mechanistic Localization and Targeted Suppression in Language Models

Rethinking Visual Neglect: Steering via Context-Preference for MLLM Hallucination Mitigation

Patched-DeltaNet: Token-Level Event-Driven Memory for Linear-Time Anomaly Detection

Geometry-Correct Diffusion Posterior Sampling with Denoiser-Pullback Curvature Guidance and Manifold-Aligned Damping

Law of Neural Interaction: Depth-Width Shape, Interaction Efficiency, and Generalization

An Evolutionary Approach for Designing Stable and Highly Expressible Low-Immunogenicity Therapeutic mRNA Sequences

KVoiceBench, KOpenAudioBench, and KMMAU: Agent-Driven Korean Speech Benchmarks for Evaluating SpeechLMs

STAB: Specification-driven Testing for Algorithmic Bottlenecks

Periodic RoPE for Infinite Context LLMs

ABot-OCR Technical Report

VoiceGiraffe: A Benchmark for Extreme Long-Context Audio-Language Understanding

Semantic Flow Regularization: Teaching LLMs to Generate Diverse Yet Coherent Responses

Geometry of Human Perceptual Domains Emerges Transiently in LLM Representations

The Shape of Overthinking: Backtracking Bursts in Long Reasoning Traces

Bridging the Generalization Gap in Adverse Weather Segmentation: A Training Recipe Perspective

Mags-RL: Wearing Multimodal LLMs a Magnifying Glass via Agentic Reinforcement Learning For Complex Scene Reasoning

Pressure-Testing Deception Probes in LLMs: Scaling, Robustness, and the Geometry of Deceptive Representations

DisasterBench: Benchmarking LLM Planning under Typed Tool Interface Constraints

Cyclical Entropy Eruption: Entropy Dynamics in Agent Reinforcement Learning

Con-DSO: Learning Short-Horizon Consistency Priors for RGB-D Direct Sparse Odometry

Evaluating the Feasibility of Inferring Dietary Behavior Change Receptivity from Egocentric Images of Eating Environment

VLM-Based Advanced Rider Assistance System for Motorcycle Safety

SANTS: A State-Adaptive Scheduler for World Action Models

From Talking to Singing: A New Challenge for Audio-Visual Deepfake Detection

SEMAGIC: Learning Semantically Consistent Deformable 3D Representations from In-the-Wild Images

GeneralThinker: Domain-General Reasoning through Likelihood-Guided Answer-Conditioned Optimization

When Think-with-Image Meets Safety: What Determines Multimodal Jailbreak Robustness?

DiagramRAG: A Lightweight Framework to Retrieve Scientific Diagram for Figure Generation

Structure-Guided Visual Perturbation Neutralization for LVLMs

SIGMA: Semantic-Difference Instruction-Grounding Mask Annotator for Text-Driven Image Manipulation Localization