arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.06819 2026-06-08 cs.CV 新提交

VideoSEG-O3: A Multi-turn Reinforcement Learning Framework for Reasoning Video Object Segmentation

VideoSEG-O3：用于推理视频对象分割的多轮强化学习框架

Ming Dai, Sen Yang, Boqiang Duan, Boyuan Tong, Jiedong Zhuang, Wankou Yang, Jingdong Wang

AI总结提出VideoSEG-O3，首个多轮强化学习框架，通过多轮时空思维链和SEG感知逻辑校准，实现从粗到细的推理视频对象分割，解决复杂视频中的精确像素定位问题。

详情

Comments: ICML2026

AI中文摘要

推理视频对象分割（RVOS）需要时间动态、空间细节和语言推理的复杂集成，以实现精确的像素级定位。现有方法局限于对固定初始输入进行推理，缺乏主动获取更多视觉证据的能力，而这对于解决长或复杂视频中的复杂引用通常至关重要。为了解决这个问题，我们提出了\textbf{VideoSEG-O3}，这是第一个用于RVOS的多轮强化学习框架，模拟人类的“从粗到细”认知过程。它采用\textit{多轮时空思维链}，通过迭代定位关键区间和关键帧来捕获细粒度细节。此外，为了使策略在强化学习阶段能够感知超出\texttt{[SEG]}文本概率的分割质量，我们引入了\textit{SEG感知逻辑校准}，将像素级分割反馈直接集成到令牌级逻辑中。此外，我们设计了一个\textit{解耦思考轨迹}，将推理过程分层分解为时间、空间和语言维度，并构建了\textbf{VTS-CoT}，一个包含全面推理轨迹的专门冷启动数据集。代码和模型将在以下网址发布：this https URL。

英文摘要

Reasoning Video Object Segmentation (RVOS) demands a sophisticated integration of temporal dynamics, spatial details, and linguistic reasoning to achieve precise pixel-level localization. Existing methods are limited to reasoning over fixed initial inputs and lack the capacity to actively acquire further visual evidence, which is often essential for resolving complex references in long or intricate videos. To address this, we propose \textbf{VideoSEG-O3}, the first multi-turn reinforcement learning framework for RVOS that emulates the human \textit{``coarse-to-fine''} cognitive process. It employs a \textit{multi-turn temporal-spatial chain-of-thought} to capture fine-grained details by iteratively pinpointing critical intervals and keyframes. Additionally, to enable the policy to perceive segmentation quality beyond mere text probability of \texttt{[SEG]} during the RL stage, we introduce \textit{SEG-aware logit calibration}, which integrates pixel-wise segmentation feedback directly into the token-level logits. Furthermore, we design a \textit{decoupled thinking trace} to hierarchically decompose the reasoning process into temporal, spatial, and linguistic dimensions, and construct \textbf{VTS-CoT}, a specialized cold-start dataset featuring comprehensive reasoning trajectories. The code and models will be released at https://github.com/Dmmm1997/VideoSEG-O3.

URL PDF HTML ☆

赞 0 踩 0

2606.06813 2026-06-08 cs.CV cs.AI 新提交

Breaking the Lock-in: Diversifying Text-to-Image Generation via Representation Modulation

打破锁定：通过表示调制实现文本到图像生成的多样化

Dahee Kwon, Haeun Lee, Jaesik Choi

AI总结针对文本到图像模型在固定提示下生成样本过于相似的问题，提出无训练表示级干预方法DAVE，通过选择性衰减早期生成中的零频空间平均分量来增强多样性，保持图像质量且计算开销极小。

详情

Comments: Accepted to ICML 2026. Code is available at: https://github.com/daheekwon/DAVE

AI中文摘要

近期基于大规模Transformer骨干和流目标的文本到图像模型在文本-图像对齐和视觉质量方面表现出色，但在固定提示下常生成过于相似的样本。现有的多样性增强方法缓解了这一问题，但通常需要昂贵的采样或辅助优化，带来显著开销。为探究这种同质性的根本原因，我们检查了中间Transformer特征，观察到零频空间平均（DC）分量在生成早期快速收敛，导致早期轨迹锁定，限制了后续变化。基于此观察，我们提出DC衰减多样性增强（DAVE），一种无训练的表示级干预，选择性地在早期阶段衰减该分量。DAVE以可忽略的开销保留采样流程，在保持竞争性图像质量的同时，提高了提示一致性的多样性。

英文摘要

Recent text-to-image models built on large-scale Transformer backbones and flow-based objectives deliver strong text-image alignment and high visual quality, yet often produce overly similar samples under a fixed prompt. Existing diversity-enhancement methods alleviate this issue, but typically require expensive sampling or auxiliary optimization, incurring non-trivial overhead. To investigate the root cause of this homogeneity, we examine intermediate Transformer features and observe that the zero-frequency spatial average (DC) component rapidly converges across seeds early in generation, causing early trajectory lock-in that limits downstream variation. Building on this observation, we propose DC Attenuation for diVersity Enhancement (DAVE), a training-free representation-level intervention that selectively attenuates this component in the early regime. DAVE preserves the sampling pipeline with negligible overhead, improving prompt-consistent diversity while maintaining competitive image quality.

URL PDF HTML ☆

赞 0 踩 0

2606.06805 2026-06-08 cs.RO cs.AI cs.SY eess.SY 新提交

Lane Change Trajectory Planning for Personalized Driving Comfort and Mobility Efficiency

车道变更轨迹规划：个性化驾驶舒适性与移动效率

Haoxuan Dong, Dongjun Li, Ziyou Song

AI总结提出一种神经网络驱动的轨迹规划器，结合三阶多项式轨迹生成与学习模块，通过双头共享骨干和基于误差胜者逻辑回归的统计门控机制，实现个性化舒适性与移动效率的平衡。

详情

Comments: Accepted by the IEEE Intelligent Vehicles Symposium (IEEE IV 2026), Detroit, MI, United States, June 22_25, 2026

AI中文摘要

车道变更涉及同时的纵向和横向运动，这些运动影响驾驶舒适性和移动效率。由于这些运动紧密耦合且存在显著的车辆间差异，车道变更操作的轨迹规划具有高度个性化的特点。本研究提出了一种神经网络驱动的规划器，该规划器将三阶多项式轨迹生成器与学习模块相结合，该学习模块在不同驾驶条件下推断最优轨迹参数。使用具有双头的共享骨干网络，一个头确保全工况操作保障，而另一个头捕捉驾驶员对舒适性或移动效率的特定偏好。通过基于误差胜者逻辑回归的统计门控实现头门控切换机制，该机制在不同驾驶条件下自适应地选择适当的头，从而实现上下文感知的车道变更轨迹规划。代表性案例和蒙特卡洛模拟表明，所提出的规划器在车道变更过程中实现了个性化的舒适性和移动性，而基线则在个性化数据不足或不可用的驾驶条件下确保可行的轨迹。

英文摘要

Lane changing entails simultaneous longitudinal and lateral motions that affect driving comfort and mobility efficiency. Because these motions are tightly coupled and subject to substantial inter-vehicle variability, trajectory planning for lane-change maneuvers is characterized by a highly personalized nature. This study proposes a neural network-driven planner that integrates a third-order polynomial trajectory generator with a learning module that infers optimal trajectory parameters across diverse driving conditions. Using a shared backbone with dual heads, one head ensures all-condition operational guarantees, while the other captures driver-specific preferences for comfort or mobility efficiency. A head-gated switching mechanism, realized through a statistical gate based on error-winner logistic regression, adaptively selects the appropriate head under varying driving conditions, which enables context-aware lane-change trajectory planning. Representative cases and Monte Carlo simulations show that the proposed planner achieves personalized comfort and mobility during lane changes, while the baseline ensures feasible trajectories under driving conditions where personalized data are insufficient or inaccessible.

URL PDF HTML ☆

赞 0 踩 0

2606.06786 2026-06-08 cs.LG cs.NI 新提交

Federated Foundation Models over Vehicular Networks

车辆网络中的联邦基础模型

Kasra Borazjani, Fardis Nadimi, Payam Abdisarabshali, Owen Palinski, Allan Salihovic, Dinh Nguyen, Minghui Liwang, Seyyedali Hosseinalipour

AI总结本文提出将多模态多任务联邦基础模型（M3T FedFMs）集成到车辆网络中的愿景，结合基础模型的表达力与联邦学习的隐私保护分布式学习能力，并通过Waymo数据集案例验证其潜力。

详情

Comments: 8 pages, 4 figures

AI中文摘要

本文提出了一种前瞻性的愿景，将新兴的多模态多任务联邦基础模型（M3T FedFMs）集成到车辆网络中，旨在统一多模态多任务基础模型（M3T FMs）的表达能力与联邦学习（FL）的隐私保护和分布式学习能力。鉴于这一研究方向在很大程度上尚未被探索，我们首先介绍了M3T FedFMs的基本训练/微调原理。然后，我们讨论了它们在车辆网络中的一系列代表性用例，展示了M3T FedFMs在实现下一代车辆智能方面的巨大潜力。之后，我们识别了车辆环境固有的关键约束，这些约束挑战了M3T FedFMs的实际部署，并阐述了一系列前瞻性的研究方向以应对这些挑战。此外，通过在真实车辆数据集（即Waymo开放数据集）上进行的案例研究，我们展示了M3T FedFMs在车辆网络中的前景，并发布了我们的实现以促进可重复性和刺激这一新兴领域的研究（仓库：此 https URL）。

英文摘要

This paper presents a forward-looking vision for integrating the emerging multi-modal multi-task federated foundation models (M3T FedFMs) into vehicular networks, with the goal of unifying the expressive power of multi-modal multi-task foundation models (M3T FMs) with the privacy-preserving and distributed learning capabilities of federated learning (FL). Given the largely underexplored nature of this research direction, we first introduce the fundamental training/fine-tuning principles of M3T FedFMs. We then discuss a range of their representative use cases in vehicular networks, illustrating the significant potential of M3T FedFMs to enable next-generation vehicular intelligence. Afterwards, we identify key constraints inherent to vehicular environments that challenge the practical deployment of M3T FedFMs, and articulate a set of forward-looking research directions to address these challenges. Furthermore, through a case study conducted on a real-world vehicular dataset (i.e., Waymo Open Dataset), we demonstrate the promise of M3T FedFMs for vehicular networks and release our implementation to facilitate reproducibility and stimulate research in this emerging area (repository: https://github.com/KasraBorazjani/vehicular-fedfm)

URL PDF HTML ☆

赞 0 踩 0

2606.06760 2026-06-08 cs.CV 新提交

MedSIGHT: Towards Grounded Visual Comprehension in Medical Large Vision-Language Models

MedSIGHT：迈向医学大型视觉语言模型中的基础视觉理解

Aofei Chang, Le Huang, Alex James Boyd, Parminder Bhatia, Taha Kass-Hout, Fenglong Ma, Cao Xiao

AI总结提出MedSIGHT框架，通过区域感知器、医学区域码本和渐进训练策略，统一医学视觉语言模型的语义理解和像素级分割，在72K数据上达到多模态理解与分割的SOTA。

详情

Comments: Accepted at ICML 2026

AI中文摘要

医学大型视觉语言模型（Med-LVLMs）最近在视觉语言理解和医学图像分割方面取得了显著进展。然而，现有模型仍难以统一这两种能力，而这对于实现连接视觉发现与语义解释的临床推理至关重要。我们提出MedSIGHT，一个统一框架，赋予Med-LVLMs结构化的像素级理解能力，实现基础视觉理解。MedSIGHT引入了一个新颖的区域感知器模块，生成以区域为中心的标记，将空间信息直接编码到语言模型的表示空间中。我们进一步将医学区域码本引入LLM词汇表，使模型能够生成离散的区域代码，作为解剖和病理区域的符号表示。这些代码通过区域感知器解码以重建分割掩码，实现端到端的空间基础。最后，MedSIGHT使用我们提出的渐进训练策略，将区域感知器、码本和LLM组合起来，逐步稳定地对齐这些模块。仅在72K多模态指令对上训练，MedSIGHT在多种成像模态的医学理解和分割任务上均达到了最先进的性能。

英文摘要

Medical large vision-language models (Med-LVLMs) have recently achieved remarkable progress in vision-language comprehension and medical image segmentation. However, existing models still struggle to unify these two capabilities, which is essential for achieving clinically reasoning that connects visual findings with semantic interpretation. We present MedSIGHT, a unified framework that equips Med-LVLMs with structured, pixel-level understanding for grounded visual comprehension. MedSIGHT introduces a novel Region Perceiver module that produces region-centric tokens, encoding spatial information directly into representation space of the language model. We further propose a medical region codebook into the LLM vocabulary, allowing the model to generate discrete region codes as symbolic representations of anatomical and pathological regions. These codes are decoded through the Region Perceiver to reconstruct segmentation mask, achieving end-to-end spatial grounding. Lastly, MedSIGHT combines Region Perceiver, Codebook and LLM using our proposed progressive training strategy to gradually aligns these modules stably. Trained on only 72K multimodal instruction pairs, MedSIGHT achieves state-of-the-art performance across diverse imaging modalities on both medical comprehension and segmentation tasks.

URL PDF HTML ☆

赞 0 踩 0

2606.06748 2026-06-08 cs.CL cs.AI cs.LG 新提交

Evidence Graph Consistency in Retrieval-Augmented Generation: A Model-Dependent Analysis of Hallucination Detection

检索增强生成中的证据图一致性：基于模型的幻觉检测分析

Jianru Shen

AI总结提出证据图一致性（EGC）框架，通过构建局部证据图并计算五种结构一致性指标检测幻觉，发现不同模型族间一致性特征方向相反，表明嵌入图一致性不能作为模型无关的检测信号。

详情

Comments: Accepted at the International Conference on Advanced Machine Learning and Data Science; to appear in the IEEE Xplore proceedings

AI中文摘要

检索增强生成（RAG）减少了但并未消除大型语言模型中的幻觉。现有检测方法依赖于生成答案与检索段落之间的平面相似性，忽略了证据片段与答案声明之间的结构关系。我们提出了证据图一致性（EGC）框架，该框架为每个响应构建一个局部证据图，并计算五种结构一致性度量作为幻觉指标。在RAGTruth的完整问答拆分上，跨六个LLM（5,767个响应）进行评估，EGC揭示了一个一致的模型族分裂：图一致性特征在Llama-2模型中显示出预期的诊断方向，但在GPT-4、GPT-3.5和Mistral-7B中表现出系统性逆转。这种逆转表明不同模型族之间存在定性的不同幻觉模式，并表明基于嵌入的图一致性不能作为模型无关的幻觉检测信号。

英文摘要

Retrieval-Augmented Generation (RAG) reduces but does not eliminate hallucination in large language models. Existing detection methods rely on flat similarity between generated answers and retrieved passages, ignoring structural relationships among evidence pieces and answer claims. We propose Evidence Graph Consistency (EGC), a framework that constructs a local evidence graph per response and computes five structural consistency measures as hallucination indicators. Evaluated on the full question answering split of RAGTruth across six LLMs (5,767 responses), EGC reveals a consistent model-family split: graph consistency features show the expected diagnostic direction for hallucinations in Llama-2 models but exhibit systematic reversal in GPT-4, GPT-3.5, and Mistral-7B. This reversal suggests qualitatively different hallucination patterns across model families and indicates that embedding-based graph consistency cannot serve as a model-independent hallucination detection signal.

URL PDF HTML ☆

赞 0 踩 0

2606.06722 2026-06-08 cs.LG 新提交

Flatland: The Adventures of Gradient Descent with Large Step Sizes

平面国：大步长梯度下降的冒险

Leonardo Galli, Curtis Fox, Wiebke Bartolomaeus, Mark Schmidt, Holger Rauhut

AI总结针对非全局L-光滑的神经网络目标函数，提出一种只需梯度局部Lipschitz连续的定义，设计自适应一阶方法实现大步长并始终处于稳定性边缘，发现过早进入全局平坦区域会降低收敛速度和泛化能力。

详情

Comments: Accepted for the International Conference on Machine Learning (ICML 2026)

AI中文摘要

神经网络的训练通常涉及非全局$L$-光滑的目标函数。对于这些函数，从理论和实践上都很难回答这样一个问题：确保梯度下降（GD）收敛的最大可能步长是多少？我们通过提供“大”步长的统一定义来解决深度学习中长期存在的这个开放问题，该定义仅要求梯度的局部Lipschitz（甚至Hölder）连续性。我们设计了一阶自适应方法，这些方法可证明产生大步长，并表明它们从训练开始就处于稳定性边缘（EoS）。特别地，损失非单调下降，并且步长与锐度（即Hessian矩阵的最大特征值）的乘积在整个训练过程中保持在EoS阈值2以上。使用我们的方法，我们还能将锐度最小化到其全局最小值。与预期相反，我们发现训练过早遇到全局平坦区域可能会减慢收敛速度并损害网络的泛化能力。利用自稳定化论证，我们允许GD进入稍尖锐的谷底，并将不成功的训练运行转变为非常成功的运行。

英文摘要

The training of neural networks often entails objective functions that are not globally $L$-smooth. For these functions, it is both theoretically and practically difficult to reply to the question: what is the largest possible step size that ensures the convergence of gradient descent (GD)? We address this longstanding open question in deep learning by providing a unifying definition of "large" step sizes that requires only local Lipschitz (or even Hölder) continuity of the gradient. We design first-order adaptive methods that provably yield large step sizes and show that they operate at the edge of stability (EoS) right from the start of the training. In particular, the loss decreases nonmonotonically and the product between the step size and sharpness, i.e., the largest eigenvalue of the Hessian, stays above the EoS threshold of 2 throughout training. Using our method, we are also able to minimize the sharpness all the way down to its global minimum. Contrary to expectation, we find that encountering globally-flat regions too early in the training may both slow down convergence and jeopardize the generalization ability of the network. Exploiting a self-stabilization argument, we allow GD to enter slightly sharper valleys and turn unsuccessful training runs into very successful ones.

URL PDF HTML ☆

赞 0 踩 0

2606.06712 2026-06-08 cs.CL cs.AI 新提交

Data-Efficient Autoregressive-to-Diffusion Language Models via On-Policy Distillation

数据高效的自回归到扩散语言模型通过策略内蒸馏

Xingyu Su, Jacob Helwig, Shubham Parashar, Atharv Chagi, Lakshmi Jotsna, Degui Zhi, James Caverlee, Dileep Kalathil, Shuiwang Ji

AI总结提出策略内扩散语言模型（OPDLM），通过策略内蒸馏将自回归模型转换为扩散语言模型，解决分布偏移和训练-推理不匹配问题，实现15倍至7000倍更少训练数据下的强性能。

详情

AI中文摘要

我们研究将自回归模型（ARLM）转换为扩散语言模型（DLM）。与从头预训练不同，先前的工作将ARLM中的因果注意力替换为双向注意力，然后使用DLM目标训练得到的模型。然而，这些方法会导致两种分布偏移。首先，从下一个词预测目标过渡到DLM目标可能会丢弃ARLM在训练期间获得的知识。其次，标准DLM存在训练-推理不匹配，因为训练损失定义在随机掩码序列上，而不是推理时基于置信度解码产生的轨迹。为了解决这两个挑战，我们引入了策略内扩散语言模型（OPDLM），其中采用策略内蒸馏（OPD）进行ARLM到DLM的转换。具体来说，OPDLM通过自OPD训练，其中学生（具有双向注意力的ARLM）生成自己的轨迹，而教师（原始冻结的ARLM）通过在这些轨迹上提供目标logits来蒸馏其知识。通过直接以策略内方式训练，OPDLM消除了DLM中的训练-推理不匹配，同时从原始模型蒸馏增强了ARLM的知识保留。实验结果表明，OPDLM需要15倍到7000倍更少的训练token，并在各种任务上表现出强大的性能。OPDLM避免了DLM预训练的高昂成本，并将DLM转换定位为ARLM后训练的一种形式。

英文摘要

We study the transformation of autoregressive models (ARLMs) into diffusion language models (DLMs). Rather than pretraining from scratch, prior work replaces the causal attention in ARLMs with bidirectional attention and then trains the resulting model using a DLM objective. However, these approaches incur two distribution shifts. First, transitioning from a next-token prediction objective to a DLM objective can discard knowledge acquired by the ARLM during training. Second, standard DLMs suffer from a train-inference mismatch, as the training loss is defined on randomly masked sequences rather than the trajectories encountered at inference produced by confidence-based decoding. To address both challenges, we introduce an On-Policy Diffusion Language Model (OPDLM) in which On-Policy Distillation (OPD) is employed for ARLM-to-DLM transformation. Specifically, OPDLM is trained via self-OPD, where the student, an ARLM with bidirectional attention, generates its own trajectories, and the teacher, the original frozen ARLM, distills its knowledge by providing target logits on these trajectories. By training directly in an on-policy manner, OPDLM eliminates the train-inference mismatch in DLMs, while distillation from the original model enhances knowledge retention from the ARLM. Empirical results demonstrate that OPDLM requires 15x to 7,000x fewer training tokens with strong performance across a wide variety of tasks. OPDLM avoids the prohibitive cost of DLM pretraining and positions DLM transformation as a form of ARLM post-training.

URL PDF HTML ☆

赞 0 踩 0

2606.06708 2026-06-08 cs.CL 新提交

Signal-Driven Observation for Long-Horizon Web Agents

信号驱动观测：面向长程任务的Web智能体

Shubham Gaur, Ian Lane

AI总结提出信号驱动观测（SDO）方法，通过专用子调用读取完整DOM但仅返回任务相关元素，并由轻量信号检测器触发重新调用，解决长程Web智能体中上下文退化问题。

详情

Comments: 10 pages, 1 figure

AI中文摘要

在长程任务中运行的Web智能体在每个动作步骤中都会处理原始DOM和可访问性树——通常包含数万个token——导致上下文逐渐退化，在任务完成前推理能力就已受损。我们认为，将观测频率与动作频率耦合是一种架构性错误。借鉴递归语言模型中查询文档优于整体阅读的见解，我们提出信号驱动观测（SDO）：一个专用子调用读取完整DOM但仅返回任务相关元素及其选择器，并且仅在轻量信号检测器触发时重新调用——触发条件包括URL变化、新出现的可交互元素、动作失败或外部浏览器事件。我们概述了SDO引入的开放问题，并呼吁社区将观测压缩视为Web智能体设计中的核心架构决策。

英文摘要

Web agents operating over long horizons ingest raw DOM and accessibility trees -- routinely tens of thousands of tokens -- at every action step, causing progressive context degradation that erodes reasoning well before tasks complete. We argue that this coupling of observation frequency to action frequency is an architectural mistake. Drawing on the insight from Recursive Language Models that querying a document outperforms reading it wholesale, we propose Signal-Driven Observation (SDO): a dedicated sub-call reads the full DOM but returns only task-relevant elements and their selectors, and is re-invoked only when a lightweight signal detector fires -- triggered by URL transitions, newly visible interactive elements, action failures, or exogenous browser events. We outline the open problems SDO introduces and call on the community to treat observation compression as a core architectural decision in web agent design.

URL PDF HTML ☆

赞 0 踩 0

2606.06694 2026-06-08 cs.LG cs.AI cs.CY 新提交

The Geography of Algorithmic Judgment: LLM Intermediaries, Place Identity, and Racial Steering in Housing Search

算法判断的地理：LLM中介、地方身份与住房搜索中的种族引导

Hana Samad, Trung Lam, Christoph Mügge-Durum, Michael Akinwumi

AI总结通过行为审计七种LLM在四个美国城市的住房推荐，发现种族引导是模型解释性许可的涌现行为，而非静态属性，且城市并非中性测试单元。

详情

Comments: 13 pages with supplemental tables and figures, AIES '26 Submission

AI中文摘要

大型语言模型（LLM）正迅速在住房搜索中扮演中介角色，通过将列表平台集成到对话界面中，调解城市环境中的信息访问、搜索和推荐。我们扩展了先前关于LLM中种族引导的研究，对七个开放权重和闭源LLM在四个美国城市进行了行为审计，测试了三种迭代提示条件下的位置推荐，这些条件逐步添加生活方式偏好背景并反映公平住房配对测试方法。我们发现，引导是模型解释性许可的涌现行为，而非主要静态属性。引导源于用户身份、偏好表达以及模型内化的关于特定城市中地点、偏好和机会的学习表征的空间逻辑之间的相互作用，以及不同类型用户如何与之关联。虽然存在引导，但在评估条件下其方向和幅度并不一致。偏好条件测试通常增加或重新配置了相对于基线条件表现出引导行为的模型数量，表明LLM可能根据用户的种族身份对相同的住房偏好做出不同解释。我们的发现还表明，在基于地点的领域进行LLM评估时，城市并非中性测试单元，一个本地市场的结果不能假定推广到另一个市场。在住房领域，需要本地和领域专业知识，以确保法律和制度对公平住房的承诺不会因采用调解空间访问的AI工具而受到损害。

英文摘要

Large language models (LLMs) are rapidly assuming an intermediary role in housing search through the integration of listing platforms within conversational interfaces, mediating access to information, search, and recommendations within urban settings. We expand on prior work on racial steering in LLMs by conducting a behavioral audit of seven open-weight and closed-source LLMs across four U.S. cities, testing location recommendations across three iterative prompting conditions that progressively add lifestyle preference context and reflect fair housing paired-testing methodologies. We find that steering is an emergent behavior of the model's interpretive license rather than primarily a static property. Steering results from the interaction of a user's identity, preference articulation, and the spatial logic that a model has internalized about learned representations of place, preference, and opportunity in a given city, and how different types of users relate to it. While steering was present, it was not uniform in direction or magnitude across evaluated conditions. Preference-conditioned testing often increased or reconfigured the number of models that exhibited steering behaviors relative to baseline conditions, suggesting that LLMs may interpret what the same housing preference means differently depending on the racial identity of the user. Our findings also demonstrate that the city is not a neutral testing unit for LLM evaluation in place-based sectors, and results from one local market cannot be assumed to generalize to another. Local and domain expertise will be required in the housing sector to ensure that legal and institutional commitments to fair housing are not undermined while adopting AI tools that mediate spatial access.

URL PDF HTML ☆

赞 0 踩 0

2606.06682 2026-06-08 cs.LG 新提交

Spatiotemporal Imputation with Graph-Informed Flow Matching

基于图信息流匹配的时空插补

Zepeng Zhang, Aref Einizade, Jhony H. Giraldo, Olga Fink

AI总结提出GiFlow框架，利用图信息先验和混合向量场模型进行时空插补，优于现有方法。

详情

Comments: Accepted at ICML 2026

AI中文摘要

缺失数据是时空系统中的常见挑战，出现在空气质量监测和城市交通管理等应用中。传统的机器学习方法，如循环神经网络和图神经网络，依赖于迭代传播，这往往会在时间和空间上累积误差。最近的基于扩散的方法减轻了误差传播，但需要迭代采样，并且通常依赖于问题无关的高斯先验，限制了效率和有效性。为了解决这些局限性，我们提出了GiFlow，一种用于时空插补的图信息流匹配框架。GiFlow将典型的高斯先验替换为通过时空滤波可观测信号构建的图信息先验，这更好地使源分布与目标对齐，从而简化了生成轨迹。流场由一个混合向量场模型参数化，该模型整合了空间注意力、时间注意力和时空传播，能够联合建模空间和时间依赖性。在合成和真实世界数据集上的大量实验表明，所提出的GiFlow在时空插补中优于最先进的方法。代码可在该 https URL 获取。

英文摘要

Missing data is a common challenge in spatiotemporal systems, arising in applications such as air quality monitoring and urban traffic management. Traditional machine learning approaches, like recurrent and graph neural networks, rely on iterative propagation, which tends to accumulate errors over time and space. Recent diffusion-based methods mitigate error propagation but require iterative sampling and often depend on problem-agnostic Gaussian priors, limiting both efficiency and effectiveness. To address these limitations, we propose GiFlow, a Graph-Informed Flow Matching framework for spatiotemporal imputation. GiFlow replaces the typical Gaussian prior with a graph-informed prior constructed via spatiotemporal filtering of observable signals, which better aligns the source distribution to the target and thereby simplifies the generation trajectory. The flow field is parameterized by a hybrid vector field model that integrates spatial attention, temporal attention, and spatiotemporal propagation, enabling joint modeling of spatial and temporal dependencies. Extensive experiments on both synthetic and real-world datasets demonstrate that the proposed GiFlow outperforms the state-of-the-art approaches in spatiotemporal imputation. The code is available at https://github.com/zepengzhang/GiFlow.

URL PDF HTML ☆

赞 0 踩 0

2606.06666 2026-06-08 cs.CV 新提交

Architecture-Adaptive Uncertainty Fusion for Deepfake Detection

面向深度伪造检测的架构自适应不确定性融合

Ritesh Sharma, Mohammad Ghasemigol, Yuichi Motai

AI总结提出相关性优化融合（COF）框架，通过最大化融合不确定性分数与预测误差的皮尔逊相关性，自适应融合五种不确定性来源，无需模型修改且优化仅需42秒，在分布偏移下表现优于随机森林。

详情

AI中文摘要

深度伪造检测系统在基准测试中达到近乎完美的准确率，但法医部署需要可靠的预测不确定性。现有的不确定性量化（UQ）方法依赖单一来源，忽略了最优不确定性组合因架构而异。我们提出相关性优化融合（COF），这是一种架构自适应框架，通过概率单纯形上的约束优化最大化融合不确定性分数与预测误差之间的皮尔逊相关性，融合五种互补的不确定性来源——认知、偶然、校准、共形和分布。COF无需模型修改，权重优化仅需42秒，而5模型深度集成需要20-45小时。在FaceForensics++上对11种架构的评估揭示了一个基本权衡：在匹配的训练/评估协议下，非线性方法在域内相关性上比COF高约5-6%（平均r=0.438），但在分布偏移下情况反转。在CelebDF上，COF在11种架构中的9种上优于随机森林，相关性高出高达7.3倍（MaxViT-B: r=0.249 vs. 0.034）；RF跨域退化85%至r=0.071，而COF保留显著更多的信号（下降74%至r=0.116）。在CelebDF和DFDC上的跨数据集评估揭示了所有方法的灾难性泛化失败：域内相关性0.41-0.47在外部崩溃至接近零（平均退化90.7%），其中11种架构中有7种出现不确定性反转。这些结果确立了COF作为受控分布部署的实用、可解释框架，并指出域自适应UQ是法医部署的核心开放挑战。

英文摘要

Deepfake detection systems achieve near-perfect accuracy on benchmarks, yet forensic deployment demands reliable prediction uncertainty. Existing uncertainty quantification (UQ) methods rely on single sources and ignore that optimal uncertainty composition varies across architectures. We propose Correlation-Optimized Fusion (COF), an architecture-adaptive framework that fuses five complementary uncertainty sources -- epistemic, aleatoric, calibration, conformal, and distributional -- by maximizing Pearson correlation between fused uncertainty scores and prediction errors via constrained optimization on the probability simplex. COF requires no model modifications and only 42 s of weight optimization, compared to 20--45 h for a 5-model Deep Ensemble. Evaluation across eleven architectures on FaceForensics++ reveals a fundamental trade-off: under matched train/evaluation protocol, non-linear methods achieve approximately 5--6% higher in-domain correlation than COF (mean r = 0.438), but this reverses under distribution shift. On CelebDF, COF outperforms Random Forest in 9/11 architectures with up to 7.3x higher correlation (MaxViT-B: r = 0.249 vs. 0.034); RF degrades 85% cross-domain to r = 0.071, whereas COF retains substantially more signal (74% drop to r = 0.116). Cross-dataset evaluation on CelebDF and DFDC reveals catastrophic generalization failure across all methods: in-domain correlations of 0.41--0.47 collapse to near-zero externally (mean degradation 90.7%), with seven of eleven architectures exhibiting uncertainty inversion. These results establish COF as a practical, interpretable framework for controlled-distribution deployment and identify domain-adaptive UQ as the central open challenge for forensic deployment.

URL PDF HTML ☆

赞 0 踩 0

2606.06664 2026-06-08 cs.CV cs.AI cs.LG 新提交

Inside the Visual Mind: Neuroscience-Motivated Concept Circuits for Interpreting and Steering Vision Transformers

内在视觉：神经科学启发的概念电路用于解释和引导视觉变换器

Tang Li, Yanlin Chen, Mengmeng Ma, Xi Peng

AI总结提出ViSAE工具箱，通过神经科学启发的概念电路解释视觉变换器内部机制，包含高效概念集、自动电路追踪算法和概念编辑应用，在WaterBirds上最差组准确率提升48.2%。

详情

Comments: In Proceedings of the International Conference on Machine Learning, 2026. (acceptance rate 26.6%)

AI中文摘要

尽管视觉变换器（ViT）具有高准确率，但其预测可能受到虚假线索的驱动，因此在安全部署前需要理解其内部工作机制。稀疏自编码器（SAE）为将模型表示分解为人类可解释的概念提供了有前景的视角，但由于对概念覆盖范围的控制有限以及特征解释的主观性和不可扩展性，将基于SAE的解释方法应用于ViT仍然具有挑战性。为填补这些空白，受神经科学启发原理的驱动，我们提出了ViSAE，一个通过概念电路理解ViT内部工作机制的机械可解释性工具箱。ViSAE包含三个组成部分：（1）一个包含64K图像和16K视觉基础概念词汇的探测套件，与ImageNet相比，概念覆盖效率提高了20倍，与现有概念集相比，解释准确率提高了28.7%。（2）自上而下的概念读取和自下而上的电路追踪算法，通过概念电路自动恢复ViT内部工作机制。（3）用于审计和引导ViT行为的应用。通过概念编辑，ViSAE在WaterBirds上将最差组准确率提高了48.2%，比现有方法高出23.8%。我们的数据和代码：此 https URL。

英文摘要

Despite high accuracy, Vision Transformer (ViT) predictions can be driven by spurious cues, raising the need to understand their inner workings before safe deployment. Sparse autoencoders (SAEs) provide a promising lens for decomposing model representations into human-interpretable concepts, yet adapting SAE-based interpretation to ViTs remains challenging due to limited control over concept coverage and subjective, non-scalable feature interpretation. To fill the gaps, motivated by neuroscience-inspired principles, we propose ViSAE, a mechanistic interpretability toolbox for understanding ViT inner workings through concept circuits. ViSAE consists of three components: (1) A probing suite with 64K images and a 16K visually grounded concept vocabulary, improving concept coverage efficiency by 20x over ImageNet and interpretation accuracy by 28.7% over existing concept sets. (2) Top-down concept reading and Bottom-up circuit tracing algorithms that automatically recover ViT inner workings via concept circuits. (3) Applications for auditing and steering ViT behavior. Through concept editing, ViSAE improves the worst-group accuracy on WaterBirds by 48.2%, outperforming existing methods by 23.8%. Our data and code: https://github.com/deep-real/ViSAE.

URL PDF HTML ☆

赞 0 踩 0

2606.06663 2026-06-08 cs.LG 新提交

Explainable Runtime Dependency Tracking for AI-RAN Conflict Monitoring

面向AI-RAN冲突监控的可解释运行时依赖追踪

Christie Djidjev, Nicholas Kaminski

AI总结针对AI-RAN中参数-KPI依赖关系可能失效的问题，提出基于布尔矩阵的滑动窗口推理方法，通过事件流一致性检测实现轻量级可解释依赖追踪。

详情

AI中文摘要

未来集成AI的无线接入网络（AI-RAN）将结合开放可编程性与支持学习的xApps、rApps以及作用于共享参数和关键性能指标（KPI）的控制功能。对于冲突监控，仅知道部署了哪些应用是不够的；系统还必须知道运行时诊断所假设的参数-KPI依赖关系在当前运行状态下是否仍然有效。本文研究了一种轻量级的监控原语：从流式遥测事件中追踪可解释的依赖关系表示。我们将活跃依赖关系表示为布尔矩阵，并使用布尔矩阵乘法来检查最近的参数活动事件和KPI响应事件是否与当前估计一致。我们提出了一种滑动窗口推理过程，当估计一致时重复使用，当最近观测表明结构变化时重新计算。该追踪器旨在作为冲突诊断和慢循环模型刷新的可解释信号，而非自主缓解机制。在受控的布尔事件流上的实验表明，在依赖关系变化和布尔观测噪声下，该追踪器能够高效且准确地追踪。

英文摘要

Future AI-integrated Radio Access Networks (AI-RAN) will combine open programmability with learning-enabled xApps, rApps, and control functions that act on shared parameters and key performance indicators (KPIs). For conflict monitoring, it is not enough to know which applications are deployed; the system must also know whether the parameter--KPI dependencies assumed by runtime diagnosis remain valid under the current operating regime. This paper studies a lightweight monitoring primitive for that purpose: tracking an interpretable dependency representation from streaming telemetry events. We represent active dependencies by a Boolean matrix and use Boolean matrix multiplication to check whether recent parameter-activity and KPI-response events are consistent with the current estimate. We propose a sliding-window inference procedure that reuses the estimate when it remains consistent and recomputes it when recent observations indicate structural change. The tracker is intended as an explainable signal for conflict diagnosis and slow-loop model refresh, not as an autonomous mitigation mechanism. Experiments on controlled Boolean event streams show efficient and accurate tracking under dependency changes and Boolean observation noise.

URL PDF HTML ☆

赞 0 踩 0

2606.06574 2026-06-08 cs.LG 新提交

Skip a Layer or Loop It? Learning Program-of-Layers in LLMs

跳过一层还是循环它？学习LLM中的层程序

Ziyue Li, Yang Li, Tianyi Zhou

AI总结本文提出PoLar方法，通过轻量级预测网络学习为每个输入动态跳过或重复预训练层，在数学推理任务上以更少层数提升准确率。

详情

Comments: Accepted at ICML 2026. Substantially extends arXiv:2507.07996. Code: https://github.com/tianyi-lab/PoLar

AI中文摘要

大型语言模型（LLM）通过遵循所有层的固定深度和顺序（非循环执行）进行推理。我们揭示了无训练、灵活、动态的层程序（PoLar）的广泛存在，其中预训练层可以作为模块打包，然后跳过或循环以形成针对每个输入的自定义程序。对于大多数输入，显著更短的程序执行可以达到相同或更好的准确性，而原始LLM的错误预测可以通过具有更少层的替代程序来纠正。这些观察表明，推理允许超出标准前向传递的多种有效潜在计算。为了在实践中高效实现PoLar，我们提出了一个轻量级的PoLar预测网络，该网络学习生成执行程序，为每个输入动态跳过或重复预训练层。在数学推理基准上的实验表明，PoLar在通常执行更少层的情况下，持续提高了标准推理和先前动态深度方法的准确性，并且这些增益在分布外评估中仍然存在。我们的结果表明，固定深度执行仅捕获了LLM潜在推理能力的一小部分。

英文摘要

Large language models (LLMs) perform inference by following a fixed depth and order, non-recurrent execution of all layers. We reveal the wide existence of training-free, flexible, dynamic program-of-layers (PoLar), where pretrained layers can be packed as modules and then skipped or looped to form a customized program for each input. For most inputs, substantially shorter program executions can achieve the same or better accuracy, while incorrect predictions of the original LLM can be corrected by alternative programs with fewer layers. These observations indicate that inference admits multiple valid latent computations beyond the standard forward pass. To efficiently achieve PoLar in practice, we propose a lightweight PoLar prediction network, which learns to generate execution programs that dynamically skip or repeat pretrained layers for each input. Experiments on mathematical reasoning benchmarks demonstrate that PoLar consistently improves accuracy over standard inference and prior dynamic-depth methods, often while executing fewer layers, and that these gains persist under out-of-distribution evaluation. Our results suggest that fixed-depth execution captures only a narrow subset of an LLM's latent reasoning capacity.

URL PDF HTML ☆

赞 0 踩 0

2606.06572 2026-06-08 cs.LG cs.AI cs.CY econ.GN q-fin.EC 新提交

Generative Models Erode Human Temporal Learning Through Market Selection

生成模型通过市场选择侵蚀人类时间学习

Wenjun Cao

AI总结本文论证现代生成模型在亚AGI能力水平上通过市场选择机制侵蚀人类时间学习，提出价值崩溃路径并用昂贵检验框架形式化，跨领域证据显示验证侵蚀四阶段。

详情

Journal ref: Forty-third International Conference on Machine Learning Position Paper Track (2026)
Comments: Accepted at ICML 2026

AI中文摘要

我们认为，现代生成模型在当前亚AGI能力水平上对知识和文化生产造成了结构性风险。我们将人类时间学习（HTL）定义为通过长期持续参与问题而形成的路径依赖的知识积累。生成输出在表面特征上越来越像HTL密集型工作，因此验证给定输出是否反映真正的人类学习的成本相对于其预期收益变得高昂。一旦验证失去经济合理性，评估者就会奖励输出而不论其生产模式，而投入多年学习的生产者则在与几乎零成本生成的输出的价格竞争中处于劣势。我们将这一路径称为价值崩溃，并通过一个昂贵检验框架将其形式化。来自学术出版、法律实践、内容平台和软件安全的跨领域证据映射出验证侵蚀的四个阶段。对齐成功是正交的。更好的对齐模型缩小了人类与AI输出之间的可观察差距，使得来源验证更加困难，并加剧了对HTL密集型工作的竞争压力，即使单个AI输出有所改进。

英文摘要

We argue that modern generative models create structural risks for knowledge and cultural production at current, sub-AGI capability levels. We define Human Temporal Learning (HTL) as path-dependent knowledge accumulation through sustained engagement with problems over time. Generative outputs increasingly resemble HTL-intensive work in surface features, so verifying whether a given output reflects genuine human learning grows costly relative to its expected benefit. Once verification loses economic justification, evaluators reward outputs regardless of production mode, and producers who invested years of learning compete on price against outputs that cost almost nothing to generate. We call this pathway value collapse and formalize it through a costly-inspection framework. Cross-domain evidence from academic publishing, legal practice, content platforms, and software security maps onto four stages of verification erosion. Alignment success is orthogonal. Better-aligned models narrow observable gaps between human and AI outputs, making source verification harder and intensifying competitive pressure against HTL-intensive work even when individual AI outputs improve.

URL PDF HTML ☆

赞 0 踩 0

2606.06567 2026-06-08 cs.LG 新提交

Are you sure? A Comprehensive and Comprehensible Survey of Uncertainty Quantification in Symbolic Regression

你确定吗？符号回归中不确定性量化的全面且可理解的综述

Julia Reuter, Fabricio Olivetti de Franca

AI总结综述符号回归中的不确定性量化方法，涵盖频率派、贝叶斯和模型选择三个研究方向，指出该领域尚待探索。

2606.06547 2026-06-08 cs.LG cs.AI 新提交

FAIR-Calib: Frontier-Aware Instability-Reweighted Calibration for Post-Training Quantization of Diffusion Large Language Models

FAIR-Calib：面向扩散大语言模型训练后量化的前沿感知不稳定重加权校准

Haoyu Huang, Linlin Yang, Sheng Xu, Boyu Liu, Guodong Guo, Zhongqian Fu, Hang Zhou, Baochang Zhang

AI总结针对扩散大语言模型训练后量化中前沿决策易翻转并永久锁定放大的问题，提出两阶段PTQ框架FAIR-Calib，通过前沿命中与掩码阶段可靠性估计位置先验，并利用重加权隐状态MSE校准优先保护脆弱前沿状态，理论证明其作为输出KL散度代理，实验显著优于基线。

详情

Comments: Accepted as a poster at the 43rd International Conference on Machine Learning (ICML 2026)

AI中文摘要

扩散大语言模型（dLLMs）迭代地精炼token，但不可逆地提交它们，导致“稳定性滞后”，即早期决策即使在写入后仍然脆弱。我们揭示训练后量化（PTQ）误差容易在写入前沿翻转这些边界决策，然后这些决策被永久锁定并放大。为了解决这个问题，我们提出了前沿感知不稳定重加权校准（FAIR-Calib），一个针对dLLMs的两阶段PTQ框架。第一阶段探测全精度教师模型以估计结合前沿命中与掩码阶段可靠性的位置先验。第二阶段通过最小化重加权隐状态MSE执行离策略、逐层校准，有效优先保护脆弱前沿状态，而无需昂贵的端到端扩散展开。我们进一步从理论上证明我们的加权目标是输出KL散度的代理。实验上，FAIR-Calib在LLaDA和Dream（W4A4）上持续优于最先进的基线，显著减少前沿决策翻转并抑制跨不同基准的提交后不匹配。

英文摘要

Diffusion Large Language Models (dLLMs) refine tokens iteratively but commit them irreversibly, leading to a "stability lag" where early decisions remain fragile even after being written. We reveal that Post-Training Quantization (PTQ) error easily flips these borderline decisions at the write frontier, which are then permanently locked in and amplified. To address this, we propose Frontier-Aware Instability-Reweighted Calibration (FAIR-Calib), a two-stage PTQ framework for dLLMs. Stage I probes a full-precision teacher to estimate a position prior that combines frontier hits and masked-stage reliability. Stage II performs off-policy, layer-wise calibration by minimizing a reweighted hidden-state MSE, effectively prioritizing the protection of fragile frontier states without requiring expensive end-to-end diffusion rollouts. We further theoretically justify our weighted objective as a surrogate for output KL divergence. Empirically, FAIR-Calib consistently outperforms state-of-the-art baselines on LLaDA and Dream (W4A4), significantly reducing frontier decision flips and suppressing post-commit mismatches across diverse benchmarks.

URL PDF HTML ☆

赞 0 踩 0

2606.06536 2026-06-08 cs.CV cs.AI cs.LG 新提交

Attention-Guided Autoencoder Fusion for Insulator Defect Detection Using UAV Transmission-Line Imaging

基于注意力引导自编码器融合的无人机输电线路绝缘子缺陷检测

Malak Allam, Khaled Shaban, Ali Hamdi

AI总结提出AE-YOLO框架，通过注意力引导自编码器融合与方差最大化正则化，解决无人机图像中绝缘子缺陷检测的类别不平衡和尺度变化问题，在mAP@0.5上达95.10%，优于YOLO基线5个百分点。

详情

AI中文摘要

高压输电线路绝缘子的自动缺陷检测仍然具有挑战性，原因在于无人机（UAV）图像中严重的类别不平衡、尺度变化大以及缺陷实例的空间范围小。为了解决这些问题，本文提出了AE-YOLO，一种注意力引导的自编码器增强型YOLO框架，用于鲁棒的绝缘子缺陷检测。该架构在特征金字塔网络-路径聚合网络（FPN-PAN）颈部集成了轻量级瓶颈自编码器，在多尺度特征融合过程中保留了异常敏感信息。整个骨干网络使用卷积块注意力模块（CBAM），增强了特征辨别能力并抑制了背景干扰。该框架还引入了一种方差最大化的自编码器正则化策略，鼓励生成多样化、缺陷判别性的潜在表示。网络使用统一目标进行训练，该目标结合了焦点损失、完全IoU（CIoU）损失和自编码器正则化，以解决前景-背景不平衡问题并提高定位精度。在推理过程中，加权框融合（WBF）结合了YOLOv8、YOLOv10和YOLO11的预测结果。自编码器引导的置信度提升机制提高了对罕见缺陷类别的敏感性。在绝缘子缺陷检测数据集上的实验表明，采用EfficientNetV2骨干网络的AE-YOLO在mAP@0.5上达到95.10%，精度为96.40%，召回率为93.80%。这一性能在mAP@0.5上比最强的YOLO系列基线高出5.0个百分点，在召回率上高出6.7个百分点。这些结果证实了该框架的有效性和适应性。该模型是基于UAV的输电线路巡检和缺陷监测的实用且可扩展的解决方案。

英文摘要

Automated defect detection in high-voltage transmission-line insulators remains challenging due to severe class imbalance, large scale variation, and the small spatial extent of defect instances in Unmanned Aerial Vehicle (UAV) imagery. To address these challenges, this paper proposes AE-YOLO, an Attention-Guided AutoEncoder-Enhanced YOLO framework for robust insulator defect detection. The architecture integrates lightweight bottleneck autoencoders within a Feature Pyramid Network-Path Aggregation Network (FPN-PAN) neck. This preserves anomaly-sensitive information during multi-scale feature fusion. Convolutional Block Attention Modules (CBAM) are used throughout the backbone, enhancing feature discrimination and suppressing background interference. The framework also introduces a variance-maximizing autoencoder regularization strategy, which encourages diverse, defect-discriminative latent representations. The network trains using a unified objective that combines focal loss, Complete IoU (CIoU) loss, and autoencoder regularization to address foreground-background imbalance and improve localization accuracy. During inference, Weighted Boxes Fusion (WBF) combines predictions from YOLOv8, YOLOv10, and YOLO11. An autoencoder-guided confidence boosting mechanism improves sensitivity to rare defect categories. Experiments on the Insulator-Defect Detection dataset show that AE-YOLO with an EfficientNetV2 backbone achieves 95.10 percent mAP at 0.5, 96.40 percent precision, and 93.80 percent recall. This performance surpasses the strongest YOLO-family baseline by 5.0 points in mAP at 0.5 and 6.7 points in recall. These results confirm the effectiveness and adaptability of the framework. The model is a practical and scalable solution for UAV-based transmission-line inspection and defect monitoring.

URL PDF HTML ☆

赞 0 踩 0

2606.06533 2026-06-08 cs.AI cs.CL 新提交

Position: Don't Just "Fix it in Post": A Science of AI Must Study Training Dynamics

立场：不要仅仅“在后期修复它”：AI科学必须研究训练动态

Stella Biderman, Mohammad Aflah Khan, Niloofar Mireshghallah, Catherine Arnett, Fazl Barez, Naomi Saphra

AI总结本文主张AI科学应超越事后分析，研究训练动态以预测、干预和设计模型行为，并指出当前在可解释性、公平性等领域的进展及开放问题。

详情

Comments: Accepted as an oral to the ICML: https://icml.cc/virtual/2026/poster/67142

AI中文摘要

拥有对AI的科学理解意味着什么？模型不是静态对象：它们是由数据、目标、架构和优化动态塑造的时间演化过程的快照。然而，许多AI研究将模型视为固定的人工制品，分析训练后的行为，而不追问它们为何出现。这篇立场论文认为，AI科学必须超越事后修复，研究产生模型行为的训练动态。这样的科学应该支持逐渐增强的理解形式：从早期训练信号预测结果，在轨迹出错时进行干预，并最终设计出更可靠地产生期望属性的训练程序。缩放定律已使损失预测成为常规；挑战在于将这一成功扩展到能力、偏见、鲁棒性和安全相关行为。我们基于科学史和科学哲学阐述了此类理论的要求，考察了在机械可解释性、公平性、记忆化和简单性偏差方面的进展，并确定了具体的开放问题。

英文摘要

What would it mean to have a scientific understanding of AI? Models are not static objects: they are snapshots of time-evolving processes shaped by data, objectives, architectures, and optimization dynamics. Yet much of AI research treats models as fixed artifacts, analyzing behaviors after training rather than asking why they emerge. This position paper argues that a science of AI must move beyond post-hoc fixes and study the training dynamics that produce model behavior. Such a science should support progressively stronger forms of understanding: predicting outcomes from early training signals, intervening when trajectories go wrong, and ultimately designing training procedures that more reliably produce desired properties. Scaling laws have made prediction routine for loss; the challenge is extending this success to capabilities, biases, robustness, and safety-relevant behaviors. We articulate requirements for such theories grounded in the history and philosophy of science, examine progress in mechanistic interpretability, fairness, memorization, and simplicity bias, and identify concrete open problems.

URL PDF HTML ☆

赞 0 踩 0

2606.06531 2026-06-08 cs.AI quant-ph 新提交

CARVE-Q: Quantum-Proposed, Classically Certified Interactive Driving Repair

CARVE-Q：量子提议、经典认证的交互式驾驶修复

Yifan Wang

AI总结针对被否决的驾驶操作，提出CARVE-Q架构，通过量子最小搜索加速修复格搜索，同时保持安全认证的经典性，实现可审计的交互修复。

详情

Comments: 9 pages, 3 figures

AI中文摘要

在正确的驾驶否决之后，关键问题不仅在于某个操作是否不安全，还在于被阻止的交互是否允许合法的、可审计的且责任有限的修复。预测和博弈论规划器可以提出合理的合作，但它们不能提供修复符合硬性规则、路权、成本分配和自车后备的证明。我们引入了CARVE（通过包络线对被否决操作进行认证的可负担修复），一种无预测的交互式修复证书架构。给定一个被否决的操作，CARVE构建一个有限的修复格并发出一个结构化证书，记录绑定规则、选定的联合修复、按路权缩放的合作包络线、按责任加权的成本分配以及仅自车的后备。这个证书视图揭示了算法瓶颈：多主体修复产生一个乘积格 $M = \prod_j |\mathcal{A}_j|$。因此，我们引入了CARVE-Q，一个验证器屏蔽的量子AI搜索层，它仅对此黑盒格应用量子最小值查找，同时将所有安全权限保留在经典侧。在保守的验证器-预言机模型中，精确的经典最小值查找在最坏情况下需要 $\Theta(M)$ 次查询，而Durr-Hoyer/Grover最小值查找以高概率使用 $O(\sqrt{M})$ 次预言机查询。我们证明了验证器屏蔽的证书可靠性、优先级非泄露、黑盒查询分离以及有限精度可逆预言机的可构造性。然后，我们在最多65,536个分配的CARVE修复预言机上展示了状态向量最小值查找，并在基于Lanelet2的INTERACTION回放中验证了证书保留，实现了100%的路权尊重、100%的责任一致性以及零优先级误报。结果是一种用于认证自主性的信任有界量子AI模式：量子提议；CARVE认证。

英文摘要

The critical question after a correct driving veto is not only whether a maneuver is unsafe, but whether the blocked interaction admits a lawful, auditable, and responsibility-bounded repair. Prediction and game-theoretic planners can suggest plausible cooperation, yet they do not return a proof that the repair respects hard rules, right-of-way, cost allocation, and ego fallback. We introduce CARVE, Certified Affordable Repair of Vetoed maneuvers via Envelopes, a certificate architecture for prediction-free interactive repair. Given a vetoed maneuver, CARVE constructs a finite repair lattice and emits a structured certificate recording the binding rule, selected joint repair, right-of-way-scaled cooperation envelope, responsibility-weighted cost split, and ego-only fallback. This certificate view reveals the algorithmic bottleneck: multi-owner repair induces a product lattice $M = \prod_j |\mathcal{A}_j|$. We therefore introduce CARVE-Q, a verifier-shielded quantum-AI search layer that applies quantum minimum finding only to this black-box lattice while leaving all safety authority classical. In the conservative verifier-oracle model, exact classical minimum finding requires $Θ(M)$ queries in the worst case, whereas Durr-Hoyer/Grover minimum finding uses $O(\sqrt{M})$ oracle queries with high probability. We prove verifier-shielded certificate soundness, priority non-elicitation, black-box query separation, and finite-precision reversible-oracle constructibility. We then demonstrate state-vector minimum finding on CARVE repair oracles up to 65,536 assignments and validate certificate preservation on Lanelet2-grounded INTERACTION replay with 100% right-of-way respect, 100% blame consistency, and zero priority false positives. The result is a trust-bounded quantum-AI pattern for certified autonomy: quantum proposes; CARVE certifies.

URL PDF HTML ☆

赞 0 踩 0

2606.06529 2026-06-08 cs.AI cs.LG 新提交

Attack Selection in Agentic AI Control Evaluations Meaningfully Decreases Safety

智能体AI控制评估中的攻击选择显著降低安全性

Catherine Ge-Wang, Tyler Crosse, Benjamin Hadad, Joachim Schaeffer, Ram Potham, Tyler Tracy

AI总结本文研究攻击者策略性选择攻击时机对AI控制安全性的影响，通过分解攻击决策为开始和停止策略，实验表明两者均显著降低安全性，现有评估可能高估安全性。

详情

AI中文摘要

一个策略性选择攻击时机的攻击者比无差别攻击者更难被捕获。AI控制是一种安全框架，用于在较弱但可信的监控者和有限的人工审计预算下部署能力强但不可信的AI智能体。控制评估通过让红队攻击策略对抗蓝队监控者来压力测试这些协议，但当前评估通常假设攻击者不会策略性地选择攻击时机。我们通过将攻击决策分解为开始策略（决定攻击者何时应开始攻击）和停止策略（决定攻击者何时应中止正在进行的攻击），在智能体环境中研究了这种能力——攻击选择。在两个智能体环境BashArena和LinuxArena中，两种策略都在不改变底层攻击能力的情况下显著降低了测量的经验安全性。在1%的审计预算下，我们的开始策略在BashArena和LinuxArena上均将安全性降低了20个百分点，停止策略在BashArena上降低了20个百分点，在LinuxArena上降低了28个百分点。这些降低应被解释为攻击选择效果的上限。因此，现有的控制评估可能对选择性攻击者产生过于乐观的安全性估计。我们建议未来的评估、系统卡和安全案例应引出攻击选择，以产生更现实的安全性估计。

英文摘要

An attacker that strategically chooses when to attack is much harder to catch than one that attacks indiscriminately. AI control is a safety framework for deploying capable but untrusted AI agents under the oversight of a weaker, trusted monitor and a limited human audit budget. Control evaluations stress-test these protocols by pitting a red-team attack policy against the blue-team monitor, but current evaluations typically assume attackers that do not strategically select when to attack. We study this capability, attack selection, in agentic settings by decomposing attack decisions into a start policy, which decides when an attacker should attack, and a stop policy, which decides when an attacker should abort an ongoing attack. Across two agentic settings, BashArena and LinuxArena, both policies substantially lower measured empirical safety without changing the underlying attack capability. At a 1% audit budget, our start policy reduces safety by 20pp on both BashArena and LinuxArena, and our stop policy reduces safety by 20pp on BashArena and 28pp on LinuxArena. These reductions should be interpreted as upper bounds on the effect of attack selection. Existing control evaluations may therefore yield overly optimistic safety estimates against selective attackers. We recommend that future evaluations, system cards, and safety cases elicit attack selection to produce more realistic safety estimates.

URL PDF HTML ☆

赞 0 踩 0

2606.06520 2026-06-08 cs.CV cs.GR 新提交

Applying Deep Learning for cockpit segmentation in the context of mixed reality

应用深度学习进行混合现实场景中的驾驶舱分割

Alexandre Leles Sousa, Pedro de Oliveira Nielson, Erick Oliveira Rodrigues, Rafael Francisco dos Santos, Giovani Bernardes Vitor

AI总结本文提出利用U-net和DeepLabV3+卷积神经网络对驾驶舱图像进行前景与背景分割，以促进混合现实中的虚实融合，实现了约90%的准确率。

详情

DOI: 10.20906/CBA2024/4844
Comments: XXV Congresso Brasileiro de Automática - CBA 2024

AI中文摘要

计算机视觉是一个持续发展的领域。随着第一人称视角技术的进步，该领域内出现了新的发展机遇。混合现实通过实时显示物理世界中的物体来促进虚拟环境。为此，必须关注用户在此模拟环境中的沉浸感，不断寻求使其更接近可能的期望现实。本文提出开发图像处理，以执行图像分割，识别前景和背景，从而便于虚拟和真实图像的融合。因此，本研究通过摄像头获取用户使用CAT793F非公路卡车模拟器的真实图像，利用人工智能对这些图像进行分割。应用了卷积神经网络架构“U-net”和“DeepLabV3+”来执行图像分割。结果显示，准确率约为90%，并确定了最佳模型。

英文摘要

Computer vision is an area that has been growing continuously. With the advance of technologies with a first-person view, new development opportunities have emerged inside the area. Mixed reality promotes virtual environments with objects from the physical world shown in real time. For that, it's necessary to be concerned with the immersion of the user in this simulated environment, increasingly seeking to bring it closer to a possible desired reality. This paper proposes the development of image processing in order to perform the segmentation of images to identify what is foreground and background in order to facilitate the union of virtual and real images. Thus, the present work obtain real images of the user using the off-highway truck simulator CAT793F, through a camera, to be able to perform the segmentation of such images with artificial intelligence techniques.The convolutional neural network architectures "U-net" and "DeepLabV3+" are applied to perform image segmentation. As a result, metrics with around 90% accuracy were presented and and the best model was determined.

URL PDF HTML ☆

赞 0 踩 0

2606.06518 2026-06-08 cs.AI cs.LG 新提交

DiBS: Diffusion-Informed Branch Selection

DiBS: 扩散模型引导的分支选择

Bo Liu, Yuan Xie, Yuan Gao, Xiaolong Luo, Peng Ye, Tao Chen, Fujun Han

AI总结针对数独求解中学习型求解器缺乏正确性保证而符号求解器存在长尾搜索的问题，提出扩散模型引导的分支选择方法DiBS，在保持符号求解器完备性的同时，利用扩散模型排序候选值，显著降低搜索成本。

详情

Comments: 12 pages, 6 figures, 3 tables

AI中文摘要

数独是一个典型的约束满足问题，需要在严格的离散约束下进行全局结构推理。现有的数独求解方法主要集中于两种主流方法，即传统启发式方法和深度学习求解器。然而，它们存在互补的局限性：基于学习的求解器缺乏硬正确性保证，而完全符号求解器仍容易陷入长尾搜索。为了解决这些不足，我们提出了一种新颖的扩散模型引导方法，称为DiBS，用于分支选择搜索过程。具体来说，DiBS保持符号求解器的完备性，并使用扩散模型作为分支排序指南。核心方法是在当前部分赋值和轻量级一致性信号下对候选值进行排序。此外，我们提供了深入的理论证明来揭示其工作原理和原因。在具有挑战性的Royle 17线索数独基准上的实验表明，与强启发式基线相比，我们的DiBS显著降低了搜索成本，特别是在节点、回溯和长尾百分位数方面。此外，这些结果证实，在分支顺序错误代价最高的困难实例上，学习到的全局指导是有效的。所有代码均可在此https URL获取。

英文摘要

Sudoku is a representative constraint satisfaction problem that requires global structural reasoning under strict discrete constraints. The existing works of solving Sudoku mainly focus on two dominant approaches, i.e., traditional heuristic and deep learning solver. However, they suffer from two complementary limitations: learning-based solvers lack hard correctness guarantees, while complete symbolic solvers are still prone to long-tail search. To address these shortcomings, we propose a novel diffusion model-guided approach, termed as DiBS, for the branch selection search process. Specifically, DiBS keeps the symbolic solver complete and uses the diffusion model as a branch-ordering guide. The core method is ranking candidate values under the current partial assignment and lightweight consistency signal. Furthermore, we provide an in-depth theoretical proof to reveal how it works and why it works. Experiments on the challenging Royle 17-clue Sudoku benchmark show that our DiBS substantially reduces search cost relative to strong heuristic baselines, especially in nodes, backtracks, and long-tail percentiles. Besides, these results confirm that learned global guidance is effective on hard instances where branch-order mistakes are most expensive. All codes are available at https://github.com/shanxierdan/DiBS.

URL PDF HTML ☆

赞 0 踩 0

2606.06514 2026-06-08 cs.AI cs.LG 新提交

Detecting and Mitigating Bias by Treating Fairness as a Symmetry Operation

通过将公平性视为对称操作来检测和缓解偏见

Nishit Singh

AI总结将偏见形式化为对称性破缺操作，通过损失正则化恢复对称性，在合成数据集上实现90%以上的违规减少，准确率成本约5%。

2606.05510 2026-06-08 cs.AI cs.CL 交叉投稿

Severity-Aware Curriculum Learning with Multi-Model Response Selection for Medical Text Generation

基于严重性感知的课程学习与多模型响应选择用于医疗文本生成

Ahmed Alansary, Molham Mohamed, Ali Hamdi

AI总结提出一种结合课程学习策略和相关性响应选择的多模型框架，通过三阶段课程训练和五个大语言模型独立训练，在MAQA数据集上实现医疗文本生成性能提升。

详情

Comments: 6 pages, 3 figures, IMSA2026

AI中文摘要

远程医疗系统在提供可获取且及时的医疗信息方面变得越来越重要。现有的大语言模型通常难以在不同病例严重程度下提供一致且上下文恰当的医疗响应。这一局限性凸显了需要能够有效适应医疗查询渐进复杂性的模型。为了解决这一挑战，我们引入了一个严重性感知的多模型框架，该框架将课程训练策略与基于相关性的响应选择相结合。所提出的框架采用三阶段课程学习策略，每个模型依次在轻度、中度和危重病例上进行训练，以逐步获取领域知识。该方法利用五个大语言模型，每个模型在相同的课程方案下独立训练。在推理过程中，所有模型生成候选响应，并选择最合适的响应作为最终输出。该框架在MAQA数据集上进行训练和评估，该数据集提供带注释的医疗问答对。使用BERTScore评估的实验结果表明，与基线和微调模型相比，所提出的方法取得了优越的性能，在基线设置下达到86.71%，微调后达到90.30%。这些结果凸显了将课程学习与多模型响应选择相结合在提高医疗文本生成中的响应质量和相关性方面的有效性。

英文摘要

Telehealth systems have become increasingly important for delivering accessible and timely medical information. Existing large language models often struggle to provide consistent and contextually appropriate medical responses across varying levels of case severity. This limitation highlights the need for models that can effectively adapt to the progressive complexity in medical queries. To address this challenge, we introduce a severity-aware multi-model framework that integrates curriculum training strategy with relevance-based response selection. The proposed framework employs a three-stage curriculum learning strategy, where each model is trained sequentially on mild, moderate, and critical cases to progressively acquire domain knowledge. The approach utilizes five large language models, each independently trained under the same curriculum scheme. During inference, all models generate candidate responses, and the most appropriate response is selected as the final output. The framework is trained and evaluated on the MAQA dataset, which provides annotated medical question-answer pairs. Experimental results evaluated using BERTScore demonstrate that the proposed method achieves superior performance compared to both baseline and fine-tuned models, attaining 86.71% in the baseline setting and 90.30% after fine-tuning. These results highlight the effectiveness of combining curriculum learning with multi-model response selection in improving response quality and relevance in medical text generation.

URL PDF HTML ☆

赞 0 踩 0

2606.05152 2026-06-08 cs.LG cs.AI cs.CL 版本更新

Reinforcement Learning from Rich Feedback with Distributional DAgger

利用丰富反馈的强化学习与分布式DAgger

Rishabh Agrawal, Jacob Fein-Ashley, Paria Rashidinejad

AI总结提出DistIL算法，通过分布式DAgger利用丰富反馈（如执行轨迹、工具输出等）进行前向交叉熵优化，实现单调策略改进和更好的Pass@N性能。

详情

AI中文摘要

推理模型发展迅速，但主流的基于可验证奖励的强化学习（RLVR）方法仍然非常狭窄：采样多个响应，并用单个比特奖励每个响应，指示最终答案是否正确。然而，许多设置提供了丰富的反馈，包括执行轨迹、工具输出、专家修正和模型自我评估。我们研究如何通过经典模仿学习算法DAgger的分布式变体来使用这种反馈，其中学习器可以局部访问当前策略所访问状态上的专家分布。这产生了一个简单的前向交叉熵目标，该目标接受黑盒专家，并且其序列级梯度通过将未来的专家-学生分歧传播回早期决策来进行丰富的信用分配。我们表明，基于反向KL或Jensen-Shannon的先前具有自蒸馏目标的强化学习无法保证单调策略改进：即使专家具有更高的奖励，它们的更新也可能增加更差动作的概率。相比之下，我们证明前向交叉熵允许单调策略改进并享有遗憾保证。我们进一步表明，我们的目标优化了教师加权的成功可能性的下界，从而改进了Pass@N。实验上，我们的方法DistIL在科学推理、编程和解决困难数学问题等多个领域优于RLVR和基于自蒸馏的强化学习基线。

英文摘要

Reasoning models have advanced rapidly, but the dominant reinforcement learning from verifiable rewards (RLVR) recipe remains surprisingly narrow: sample many responses and reward each with a single bit indicating whether the final answer is correct. Yet many settings provide rich feedback, including execution traces, tool outputs, expert corrections, and model self-evaluations. We study how to use such feedback through a distributional variant of the classic imitation learning algorithm DAgger, where the learner has local access to an expert distribution on states visited by the current policy. This yields a simple forward cross-entropy objective that admits a blackbox expert and whose sequence-level gradient {conduct rich credit assignment by propagating} future expert-student disagreement back to earlier decisions. We show that prior RL with self-distillation objectives based on reverse KL or Jensen-Shannon fail to guarantee monotonic policy improvement: even when the expert has higher reward, their updates may increase probability on worse actions. In contrast, we show that forward cross-entropy admits monotonic policy improvement and enjoys guarantees on regret. We further show that our objective optimizes a lower bound on teacher-weighted likelihood of success, leading to improved Pass@N. Empirically, our approach, DistIL, improves over RLVR and RL with self-distillation baselines across a variety of domains: scientific reasoning, coding, and solving hard mathematical problems.

URL PDF HTML ☆

赞 0 踩 0

2606.01120 2026-06-08 cs.AI 版本更新

Diagnosing LLM Arbitration Behavior over Pre-evidence Epistemic States in RAG-based Fact-Checking

诊断基于RAG的事实核查中LLM对证据前认知状态的仲裁行为

Yuxi Sun, Wenbo Shang, Wei Gao, Xin Huang, Jing Ma

AI总结提出PAVE测试平台，通过将LLM验证器分为四种认知状态，评估其在检索增强生成事实核查中仲裁参数知识与检索证据的能力，发现不可靠且高度依赖模型的仲裁行为，并提出轻量级JSD测试时仲裁方法。

详情

Comments: Accepted to ACL-2026 Findings (voluntarily withdraw)

AI中文摘要

在基于RAG的事实核查中，LLM越来越多地被用作验证器，以根据检索到的证据检查给定声明。它们的参数知识可能引发与检索上下文冲突的证据前倾向，然而现有的评估框架并未描述这种先验-上下文差异，也未衡量验证器如何在参数信号和上下文信号之间进行仲裁。我们引入了 extsc{PAVE}（\emph{先验感知验证器评估}），这是一个诊断测试平台，根据LLM验证器在证据前的先验正确性和置信度将其分为四种认知状态，并在这个新基准上评估其仲裁行为，即：在误导性证据下是否坚持正确的先验，以及在提供准确证据时是否纠正错误的先验。对七个LLM的实验揭示了不可靠且高度依赖模型的先验-上下文仲裁，突显了在真实世界的基于RAG的事实核查应用中选择验证器的重要性。基于这些发现，我们提出了一种轻量级的基于JSD的测试时仲裁方法，该方法在不修改底层模型的情况下提高了事实可靠性，并在不同的LLM家族中实现了竞争性能。

英文摘要

In RAG-based fact-checking, LLMs are increasingly used as verifiers to check given claims against retrieved evidence. Their parametric knowledge can induce pre-evidence tendencies that may conflict with the retrieved context, yet existing evaluation frameworks do not characterize such prior-context discrepancy or measure how verifiers arbitrate between parametric and contextual signals. We introduce \textsc{PAVE} (\emph{Prior-Aware Verifier Evaluation}), a diagnostic testbed that stratifies an LLM verifier into four epistemic states based on the correctness and confidence of its pre-evidence prior and evaluates its arbitration behavior on this new benchmark, i.e., whether it persists in correct prior under misleading evidence, and whether it corrects wrong prior when accurate evidence is provided. Experiments across seven LLMs reveal unreliable and highly model-dependent prior-context arbitration, highlighting the importance of verifier selection for real-world RAG-based fact-checking applications. Based on these findings, we propose a lightweight JSD-based test-time arbitration method that improves factual reliability without modifying the underlying model, achieving competitive performance across diverse LLM families.

URL PDF HTML ☆

赞 0 踩 0

2605.19611 2026-06-08 cs.CV cs.ET 版本更新

Physics Guided Conditional Diffusion Framework for Generative Inverse Design of Manufacturable Metasurface based Absorbers

基于物理引导的条件扩散模型的超材料吸收体逆向设计

Vineetha Joy, Jamshed Palai, Satwik Sahu, Anshuman Kumar, Amit Sethi, Hema Singh

AI总结本文提出了一种基于物理引导的条件扩散框架，用于设计具有特定电磁响应的超材料吸收体，通过特征线性调制和预训练的替代电磁模拟器，提高了设计效率和条件准确性，实验表明该方法在2-18GHz频率范围内能够快速生成实用的超材料结构。

详情

AI中文摘要

针对特定电磁响应的超材料逆向设计需要生成满足严格频谱约束且可制造的几何结构。传统设计方法依赖于全波仿真进行迭代优化，对于大设计空间来说非常耗时且计算密集。此外，常用的生成方法往往条件保真度有限，生成的设计通常包含精细或不规则特征，难以制造。为此，我们提出了一种物理引导的条件质量增强扩散框架，用于超材料吸收体的逆向设计。在这里，由目标反射特性构成的条件信息通过特征线性调制（FiLM）整合到模型中。此外，为了确保符合目标频谱，嵌入了预训练的替代电磁模拟器，通过频谱级损失函数引入物理感知的正则化。通过在2至18GHz频率范围内生成不同类型的反射特性实用的超材料结构，证明了所提模型的有效性。该框架实现了目标频谱与生成设计频谱之间的平均频谱均方误差为0.0006，频段对齐精度为0.958，显示出高条件准确性。此外，模型为相同条件生成多种几何结构，从而为工程师提供多样化的设计选择。所提模型在约30秒内生成合适的设计，而传统方法在同等计算资源下需要数月时间。模型的效率还通过实验测量得到验证。

英文摘要

Inverse design of metasurfaces under continuous electromagnetic constraints requires generation of geometries that simultaneously satisfy stringent spectral specifications and remain manufacturable. Conventional approaches based on iterative full wave simulations are computationally prohibitive for large design spaces, while existing generative models often suffer from poor conditional controllability and limited fabrication awareness. In this regard, we propose a physics guided condition quality enhanced diffusion framework for the inverse design of metasurface based absorbers. Fabrication-aware constraints are incorporated to ensure practical realizability of the generated designs. The framework introduces a conditioning mechanism for continuous spectral specifications, wherein feature-wise linear modulation propagates the condition across the denoising hierarchy, enabling stable and accurate generation with improved spectral controllability. Further, to embed EM consistency directly into the generative learning process, a pre trained surrogate EM simulator is integrated within the diffusion training pipeline. The proposed framework generated physically realizable metasurface designs for diverse reflection characteristics in the frequency range of 2 to 18 GHz, achieving a very low average spectral mean squared error of 0.0006 and a high band alignment accuracy of 0.958. The framework also addresses the fundamentally non-unique nature of inverse EM design by enabling structured multimodal generation of geometrically distinct yet spectrally consistent metasurface designs for the same target response. The proposed model produces the suitable design in approximately 30 seconds, whereas the conventional approach can take several months under comparable computational resources. The efficiency of the model is also established via experimental measurements.

URL PDF HTML ☆

赞 0 踩 0

2605.00737 2026-06-08 cs.AI 版本更新

To Call or Not to Call: A Framework to Assess and Optimize LLM Tool Calling

调用还是不调用：评估和优化LLM工具调用的框架

Qinyuan Wu, Soumi Das, Mahsa Amani, Arijit Nag, Seungeon Lee, Krishna P. Gummadi, Abhilasha Ravichander, Muhammad Bilal Zafar

AI总结提出一个基于决策理论的框架，从必要性、效用和可负担性三个关键因素评估LLM的网页搜索工具调用决策，并训练轻量级估计器优化调用，提升任务性能。

详情

Comments: Preprint, under review

AI中文摘要

智能体AI架构通过外部工具增强LLM，解锁了强大的能力。然而，工具使用并不总是有益的；某些调用可能是冗余甚至有害的。因此，有效的工具使用取决于LLM的核心决策：在执行任务时是否调用工具。这一决策对于网页搜索工具尤其具有挑战性，因为外部信息的收益取决于模型的内部知识及其整合潜在噪声工具响应的能力。我们引入了一个受决策理论启发的原则性框架，用于评估网页搜索工具使用决策的三个关键因素：必要性、效用和可负担性。我们的分析结合了两种互补视角：规范性视角，从工具调用的最优分配推断真实需求和效用；描述性视角，从模型观察到的行为推断其自我感知的需求和效用。我们在四种网页搜索工具和三种任务上，评估了六种开源和一种闭源前沿模型，采用两种测试环境：一种仅基于当前轮次及其搜索结果，另一种基于完整执行轨迹。在每个设置中，我们发现模型的感知需求和效用经常与真实需求和效用不一致。基于此框架，我们从模型的隐藏状态训练轻量级需求和效用估计器。这些估计器驱动简单的控制器，提高了决策质量，并为大多数开源模型带来了比自我感知基线更强的任务性能。

英文摘要

Agentic AI architectures augment LLMs with external tools, unlocking strong capabilities. However, tool use is not always beneficial; some calls may be redundant or even harmful. Effective tool use, therefore, hinges on a core LLM decision: whether to call or not call a tool when performing a task. This decision is particularly challenging for web search tools, where the benefits of external information depend on the model's internal knowledge and its ability to integrate potentially noisy tool responses. We introduce a principled framework inspired by decision-making theory to evaluate web search tool-use decisions along three key factors: necessity, utility, and affordability. Our analysis combines two complementary lenses: a normative perspective that infers true need and utility from an optimal allocation of tool calls, and a descriptive perspective that infers the model's self-perceived need and utility from their observed behaviors. We evaluate six open and one closed-source frontier models under two harnesses, one conditioning on only the current turn and its search results, the other on the full execution traces, across four web-search tools and three tasks. In every setting, we find that a model's perceived need and utility are frequently misaligned with the true need and utility. Building on this framework, we train lightweight estimators of need and utility from the models' hidden states. These estimators drive simple controllers that improve decision quality and yield stronger task performance than the self-perceived baseline for most of the open-source models.

URL PDF HTML ☆

赞 0 踩 0