arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2119
2606.11634 2026-06-11 cs.AI 新提交

Architecture-Aware Reinforcement Learning Makes Sliding-Window Attention Competitive in Math Reasoning

架构感知强化学习使滑动窗口注意力在数学推理中具有竞争力

Kai Liu, Peijie Dong, Xinchen Xie, Jianfei Gao, Qipeng Guo, Xiaowen Chu, Shaoting Zhang, Kai Chen

AI总结 提出SWARR方法,通过监督微调将预训练自注意力模型高效转换为滑动窗口注意力,并利用强化学习策略适应,缩小了与自注意力的性能差距,同时保持线性复杂度的高效性。

详情
AI中文摘要

推理和智能体大型语言模型的快速进展增加了对长上下文推理的需求,但自注意力的计算复杂度随上下文长度呈二次增长。为了解决这个问题,我们研究了SWARR(用于数学推理的滑动窗口注意力强化适应),这是一种将SWA模型适应数学推理的实用方案。SWARR包含两个阶段:(1)从预训练的SA模型高效转换为SWA,并通过监督微调(SFT)避免重新训练基础模型;(2)使用强化学习(RL)进行策略适应。我们发现,在SFT后SWA的性能仍低于SA,我们假设这一差距部分由数据-架构不匹配导致:大多数SFT数据是为SA模型准备的,可能包含SWA难以建模的长距离依赖。由于在策略RL在SWA约束下优化自生成轨迹,它可以使轨迹更好地匹配SWA。在数学推理基准上的实验表明,该方案显著缩小了SWA与SA之间的差距,恢复了SWA转换过程中丢失的大部分准确性,同时保持了线性复杂度注意力的效率优势。我们的核心贡献是实证发现,RL改变了仅通过转换和SFT得出的关于SWA在数学推理中可行性的结论。

英文摘要

The rapid progress of reasoning and agentic large language models (LLMs) has increased the demand for long-context inference, but self-attention (SA) scales quadratically with context length. To address this, we study SWARR (Sliding-Window Attention with Reinforced Adaptation for Math Reasoning), a practical recipe for adapting SWA models to mathematical reasoning. SWARR has two stages: (1) efficient conversion from a pretrained SA model to SWA with supervised fine-tuning (SFT), which avoids pretraining a new base model, and (2) policy adaptation with reinforcement learning (RL). We find that SWA still underperforms SA after SFT, and we hypothesize that this gap is caused in part by a data-architecture mismatch: most SFT data are prepared for SA models and may contain long-range dependencies that are difficult for SWA to model. Because on-policy RL optimizes self-generated trajectories under the SWA constraint, it can adapt trajectories to better match SWA. Experiments on mathematical reasoning benchmarks show that this recipe substantially narrows the gap between SWA and SA, recovering much of the accuracy lost during SWA conversion while preserving the efficiency benefits of linear-complexity attention. Our central contribution is the empirical finding that RL changes the conclusion one would draw from conversion and SFT alone about SWA's viability for math reasoning.

2606.11614 2026-06-11 cs.LG cs.AI cs.CV 新提交

Information-Theoretic Decomposition for Multimodal Interaction Learning

多模态交互学习的信息论分解

Zequn Yang, Yake Wei, Haotian Ni, Zhihao Xu, Di Hu

AI总结 提出基于信息论的多模态交互分解方法DMIL,通过变分分解架构和微调策略学习样本特定的冗余、独特和协同交互,提升多模态学习性能。

详情
Comments
Accepted to CVPR 2026
AI中文摘要

多模态学习依赖于捕获跨模态的冗余、独特和协同信息,这些信息共同构成多模态交互。一个关键但尚未充分探索的挑战是,这些隐式交互在不同样本间动态变化。在这项工作中,我们首次进行了系统的信息论分析,强调了学习这些动态的、样本特定的交互对于有效多模态学习的重要性。我们的分析进一步揭示了传统范式在学习这些不同交互类型方面的缺陷:模态集成方法难以捕获协同,而联合学习范式往往未能充分利用冗余信息。这突显了对一种能够基于每个样本自适应地从不同交互类型中学习的方法的需求。为此,我们提出了基于分解的多模态交互学习(DMIL),一种显式建模并学习样本特定交互的新范式。首先,我们设计了一个变分分解架构来分离组成交互组件。其次,我们采用了一种新的学习策略,在微调过程中利用这些显式交互组件来实现全面的交互学习。跨不同任务和架构的大量实验表明,DMIL通过适应整体的样本特定交互,始终实现了优越的性能。我们的框架灵活且广泛适用,建立了一个以交互为中心的多模态学习范式。代码可在以下网址获取:此 https URL。

英文摘要

Multimodal learning hinges on capturing redundant, unique, and synergistic information across modalities, which collectively constitute multimodal interactions. A critical yet underexplored challenge is that these implicit interactions vary dynamically across samples. In this work, we present the first systematic, information-theoretic analysis highlighting why learning these dynamic, sample-specific interactions is critical for effective multimodal learning. Our analysis further reveals deficits in conventional paradigms at learning these distinct interaction types: modality ensemble approaches struggle to capture synergy, while joint learning paradigms often under-utilize redundant information. This highlights the need for an approach that can adaptively learn from different interaction types on a per-sample basis. To this end, we propose Decomposition-based Multimodal Interaction Learning (DMIL), a novel paradigm that explicitly models and learns from sample-specific interactions. First, we design a variational decomposition architecture to isolate the constituent interaction components. Second, we employ a new learning strategy that leverages these explicit interaction components in a fine-tuning process to achieve comprehensive interaction learning. Extensive experiments across diverse tasks and architectures demonstrate that DMIL consistently achieves superior performance by adapting to holistic sample-specific interactions. Our framework is flexible and broadly applicable, establishing an interaction-centric paradigm for multimodal learning. The code is available at https://github.com/GeWu-Lab/DMIL.

2606.11605 2026-06-11 cs.LG cs.AI 新提交

Physics-Distilled Neural Network enabled by Large Language Models for Manufacturing Process-Property Predictive Modeling

基于大语言模型的物理蒸馏神经网络用于制造过程-性能预测建模

Ge Song, Kiarash Naghavi Khanghah, Anandkumar Patel, Rajiv Malhotra, Hongyi Xu

AI总结 提出一种知识蒸馏框架,利用大语言模型从文献中提取物理先验,通过图掩码注意力层捕获变量依赖,蒸馏至轻量学生模型,在数据稀缺下实现高精度预测与实时部署。

详情
Comments
Under review, Journal of Computing and Information Science in Engineering
AI中文摘要

预测制造过程中的过程-性能关系常面临高实验成本和复杂'黑箱'模型可解释性有限的挑战。本文提出一种新颖的知识蒸馏框架,旨在数据稀缺场景下实现高精度预测。该框架将分析性物理先验(通过大语言模型从科学文献中系统提取)集成到特权教师模型中。我们采用图掩码注意力层来捕获输入变量间复杂的物理依赖关系,这些变量表现为严格设定点或静态与高频时间特征的组合。这种特权知识被蒸馏到轻量级学生预测器中进行推理。通过在五种不同制造过程中的综合实验,评估了该框架的可行性和鲁棒性。为确保统计可靠性,鉴于数据集规模较小,采用重复K折交叉验证技术来量化模型稳定性和泛化能力。结果表明,所提框架在所有评估领域均持续实现高预测精度。最重要的是,该架构表现出显著的容错性,即使在LLM推导的分析先验次优或不完整的情况下,也能保持稳健的预测性能。此外,学生预测器的推理频率超过6000 Hz,便于在标准工业硬件上进行实时边缘部署。这项工作为在数据受限环境下弥合理论物理与实时工业监测之间的差距提供了可扩展的解决方案。

英文摘要

Predicting process-property relationships in manufacturing is often challenged by high experimental costs and the limited interpretability of complex 'black-box' models. This paper proposes a novel knowledge distillation framework designed to achieve high-accuracy predictions in data-scarce scenarios. The framework integrates analytical physics priors, which are systematically extracted from scientific literature via Large Language Models, into a privileged teacher model. We employ a Graph-Masked Attention layer to capture the complex physical dependencies among input variables showing strict setpoints or a combination of static and high-frequency temporal signatures. This privileged knowledge is distilled into a lightweight student predictor for inference. The feasibility and robustness of the framework are evaluated through a comprehensive experiment across five diverse manufacturing processes. To ensure statistical reliability, given the small dataset sizes, a repeated K-fold cross-validation technique is employed to quantify model stability and generalization. Results indicate that the proposed framework consistently achieves high predictive accuracy across all evaluated domains. Most importantly, the architecture demonstrates significant fault tolerance by maintaining robust predictive performance even in scenarios where LLM-derived analytical priors are suboptimal or incomplete. Furthermore, the student predictor achieves an inference frequency exceeding 6000 Hz, which facilitates real-time edge deployment on standard industrial hardware. This work provides a scalable solution for bridging the gap between theoretical physics and real-time industrial monitoring in data-limited environments.

2606.11578 2026-06-11 cs.CV 新提交

Contactless 3D Human Body Measurement Using Depth Cameras for Smart Health Monitoring

基于深度相机的非接触式3D人体测量用于智能健康监测

Martha Asare, Xuan Wang, Juan Lopez Alvarenga, Lois Akosua Serwaa, Jinghao Yang

AI总结 提出一种基于深度相机和3D点云的非接触式人体测量框架,通过空间滤波、地标选择及体素/网格分析实现身高、臂展、体积和表面积等关键指标的准确估计。

详情
Comments
6 pages, 4 figures. Depth camera-based framework for contactless anthropometric measurement and geometric analysis using 3D point clouds
AI中文摘要

非接触式人体测量技术对于智能健康监测、数字健康应用和远程患者评估日益重要。传统的人体测量通常需要物理接触和训练有素的人员,这可能限制其在远程医疗环境中的可扩展性。在本研究中,我们介绍了一种基于深度相机的框架,利用3D点云数据估计人体测量值。使用Orbbec Astra 2深度相机捕获参与者的RGB图像、深度图和3D点云。利用基于Python的工具(包括Open3D、NumPy和OpenCV)处理捕获的点云,将人体从背景中分割出来。计算关键的人体测量值,如身高和臂展。通过3D点云上的空间滤波和地标选择组合获得测量值,然后利用相机内参将计算出的测量值投影到对应的RGB图像上。除了线性测量外,还使用基于体素的占用分析和基于网格的表面重建方法估计了近似身体体积和可见表面积。单次深度捕获的实验结果表明,无需物理接触即可从深度相机数据中获得准确的人体测量值和几何估计。本研究为未来将深度感知与智能健康监测和生成式AI模型相结合的实时系统奠定了基础,用于智能医疗应用。

英文摘要

Contactless body measurement technologies are becoming increasingly significant for smart health monitoring, digital health applications, and remote patient assessment. Traditional anthropometric measurements typically necessitate physical contact and trained personnel, which may constrain scalability in remote healthcare settings. In this study, we introduce a depth camera-based framework for estimating human body measurements utilizing 3D point cloud data. An Orbbec Astra 2 depth camera was employed to capture RGB images, depth maps, and 3D point clouds of participants. The captured point cloud was processed using Python-based tools, including Open3D, NumPy, and OpenCV, to segment the human body from the background. Key anthropometric measurements, such as height and arm span, were computed. The measurements were obtained through a combination of spatial filtering and landmark selection on the 3D point cloud, followed by the projection of the computed measurements onto the corresponding RGB image using camera intrinsic parameters. In addition to linear measurements, the approximate body volume and visible surface area were estimated using voxel-based occupancy analysis and mesh-based surface reconstruction methods. The experimental results from a single depth capture demonstrated that accurate body measurements and geometric estimates could be obtained from depth camera data without physical contact. This study provides a foundation for future real-time systems that integrate depth sensing with intelligent health monitoring and generative AI models for smart healthcare applications.

2606.11563 2026-06-11 cs.CV cs.RO 新提交

Cross-Modal Benchmarking for Robotic Perception in Natural Environments

自然环境中机器人感知的跨模态基准测试

David Hall, Joshua Knights, Mark Cox, Peyman Moghadam

AI总结 针对自然环境中机器人感知的挑战,提出WildCross跨模态基准,用于大规模自然场景下的地点识别和度量深度估计,并扩展了度量深度估计实验。

详情
Comments
Accepted to the IEEE ICRA Workshop on Open Challenges for Rigorous Robot Perception 2026
AI中文摘要

自然环境对机器人感知系统提出了复杂挑战。当前模型,特别是视觉基础模型,主要在有结构的城市环境中训练,导致其在野外机器人任务的感知中存在弱点。我们利用最近发布的WildCross基准展示了当前模型的局限性,这是一个用于大规模自然环境中地点识别和度量深度估计的新型跨模态基准。WildCross包含超过476K个顺序RGB帧,带有半稠密深度和表面法线标注,每个帧都与准确的6DoF姿态和同步的稠密激光雷达子地图对齐。在这项工作中,我们提供了对最近WildCross基准结果的扩展分析,特别强调扩展的度量深度估计实验。本工作的代码仓库和数据集可在https://csiro-robotics.github.io/WildCross获取。

英文摘要

Natural environments present a complex challenge to robotics perception systems. Current models, particularly vision foundation models, are largely trained on structured, urban environments leading to weaknesses in their perception for field robotics tasks. We showcase the limitations of current models using our recently released WildCross benchmark, a new cross-modal benchmark for place recognition and metric depth estimation in large-scale natural environments. WildCross comprises over 476K sequential RGB frames with semi-dense depth and surface normal annotations, each aligned with accurate 6DoF pose and synchronized dense lidar submaps. In this work, we provide an expanded analysis of the benchmark results from the recent WildCross benchmark, with particular emphasis on expanded metric depth estimation experiments. Access to the code repository and dataset for this work can be found at https://csiro-robotics.github.io/WildCross.

2606.11552 2026-06-11 cs.CL cs.LG 新提交

Teaching Diffusion to Speculate Left-to-Right

教导扩散模型从左到右推测

Lexington Whalen, Yuki Ito, Ryo Sakamoto

AI总结 针对自回归解码的推理瓶颈,提出三种训练时干预方法(位置加权、首次错误焦点损失、链损失)来弥合块扩散草稿模型的双向生成与自回归目标模型从左到右验证之间的不对称性,显著提升接受草稿长度。

详情
Comments
13 pages, technical report
AI中文摘要

大型语言模型(LLMs)在广泛任务中表现出色,但其自回归解码过程由于固有的顺序令牌生成而带来大量推理成本。推测解码通过使用轻量级草稿模型提出多个未来令牌,随后由更大的目标模型并行验证,从而解决这一瓶颈。近期工作表明,扩散语言模型非常适合此设置,因为它们可以并行生成整个草稿令牌块,从而缓解自回归草稿的顺序约束。该机制的一个微妙之处在于,块扩散草稿生成器在块内双向生成令牌,而验证由自回归目标模型以严格从左到右的方式评估令牌,导致对称的训练目标与非对称的验证奖励之间存在差距。在本工作中,我们对三种缩小这一差距的训练时干预措施进行了实证分析:令牌位置加权、针对每个块内破坏已接受前缀位置的首次错误焦点损失,以及用可微替代项替代期望接受长度的链损失项。这三种干预措施沿正交轴(位置、块条件首次错误、联合前缀)起作用,并且可加性组合;它们同样与测试时对齐机制(如多草稿自选)正交,原则上可以与之结合。在四个目标模型和六个推理、代码及对话基准测试中,与位置均匀基线相比,这三种干预措施使每个基准测试的接受草稿长度提高了21-76%,且无需增加额外前向传递,也无需改变推理流程或拒绝采样精确性约束。

英文摘要

Large language models (LLMs) achieve remarkable performance across a wide range of tasks, but their autoregressive decoding process incurs substantial inference costs due to inherently sequential token generation. Speculative decoding addresses this bottleneck by employing a lightweight draft model to propose multiple future tokens that are subsequently verified in parallel by a larger target model. Recent work has demonstrated that diffusion language models are well suited for this setting, as they can generate entire blocks of draft tokens in parallel and thereby alleviate the sequential constraints of autoregressive drafting. A subtlety of this regime is that block-diffusion drafters generate tokens bidirectionally within a block, whereas verification is performed by an autoregressive target model that evaluates tokens in a strictly left-to-right manner, leaving a gap between the symmetric training-time objective and the asymmetric verification-time reward. In this work, we offer an empirical analysis of three training-time interventions that narrow this gap: token positional weighting, a first-error focal loss that targets the position that breaks the accepted prefix within each block, and a chain loss term that substitutes a differentiable surrogate for the expected accepted length. The three interventions act along orthogonal axes (position, block-conditional first error, joint prefix) and compose additively; they are likewise orthogonal to test-time alignment mechanisms such as multi-draft self-selection, with which they can in principle be combined. Across four target models and six reasoning, code, and dialogue benchmarks, the three interventions raise accepted draft length by 21-76% per benchmark over a position-uniform baseline, without adding additional forward passes and without changing the inference pipeline or the rejection-sampling exactness contract.

2606.11520 2026-06-11 cs.CL cs.AI cs.LG 新提交

ISE: An Execution-Grounded Recipe for Multi-Turn OS-Agent Trajectories

ISE:一种基于执行的多轮操作系统代理轨迹合成方法

Siyuan Luo, Nairong Zheng, Lin Zhou, Tiankuo Yao, Shengyou Yuan, Haojia Yu, Cong Pang, Jiapeng Luo, Lewei Lu

AI总结 提出ISE三阶段范式,通过结构化意图构建、角色锁定用户模拟和真实执行环境,生成多轮代理轨迹,微调后显著提升代理工具使用性能。

详情
Comments
13 pages, 6 figures. Dataset and code: https://github.com/Valiere01/ISE-Trace
AI中文摘要

训练有能力的操作系统代理需要同时捕获结构化用户意图、多轮任务委派和基于工具执行的数据——这些属性在现有数据集中缺失。我们提出ISE(意图->模拟->执行),一种三阶段合成范式,联合解决这些差距。阶段1通过4D框架(人物角色x领域x任务x复杂度)构建约50000个结构化意图;去重后池中包含43956个唯一意图,并在mpnet-base-v2嵌入(余弦核,q=1)上获得61.57的Vendi分数。阶段2通过角色锁定的用户模拟器驱动多轮用户-代理交互,将每轮用户交互基于实际执行结果,生成23132条完整轨迹,平均8.12轮用户交互和68.24轮总对话。阶段3在实时、隔离的操作系统工作空间中执行每个工具调用,生成真实的故障恢复动态而非模拟响应。在ISETrace上微调后,使用Qwen3-8B在标准协议下的代理工具使用任务中,ClawEval pass@1从19.3提升至37.7。该结果优于零样本GPT-4o和四倍大的Qwen3-32B基础模型。对阶段2的消融实验证明多轮模拟带来了大部分性能提升。我们在该https URL发布所有源代码和数据集。

英文摘要

Training capable OS agents requires data that simultaneously captures structured user intents, multi-turn task delegation, and grounded tool execution--properties absent from existing datasets. We propose ISE (Intent -> Simulate -> Execute), a three-stage synthesis paradigm that addresses these gaps jointly. Stage 1 constructs roughly 50000 structured intents via a 4D framework (Persona x Domain x Task x Complexity); after deduplication the pool contains 43956 unique intents and attains a Vendi Score of 61.57 over the entire pool on mpnet-base-v2 embeddings (cosine kernel, q=1). Stage 2 drives multi-turn user-agent interaction through a role-locked user simulator that grounds each user turn in actual execution outcomes, producing 23132 complete trajectories averaging 8.12 user turns and 68.24 total dialogue turns. Stage 3 runs every tool call inside a live, isolated OS workspace, generating authentic failure-recovery dynamics instead of simulated responses. Fine-tuning on ISETrace improves ClawEval pass@1 from 19.3 to 37.7 using Qwen3-8B on agent tool-use tasks with a standard protocol. This result outperforms zero-shot GPT-4o and the larger Qwen3-32B base model which is four times bigger. An ablation on Stage 2 proves multi-turn simulation brings a large portion of the performance gain. We release all source code and dataset at https://github.com/Valiere01/ISE-Trace.

2606.11505 2026-06-11 cs.CV cs.AI cs.CR 新提交

On the Study of Biometric Spoofing Detection using Deep Learning

基于深度学习的生物特征欺骗检测研究

Kumar Kartikey, Nikos Komninos

AI总结 评估MobileNetV2、DenseNet-121、Inception-v3和STD模型在面部识别系统欺骗检测中的性能,MobileNetV2以92%准确率最优,适合实际应用。

详情
AI中文摘要

生物特征系统越来越多地部署在安全应用中;然而,它们仍然容易受到欺骗攻击,攻击者利用伪造的生物特征数据获取未经授权的访问。本研究评估了最先进的机器学习模型MobileNetV2、DenseNet-121、Inception-v3和欺骗痕迹解缠(STD)在面部识别系统中检测欺骗攻击的有效性。使用CelebA-Spoof数据集,研究通过准确率、精确率、召回率和F1分数等指标评估模型有效性。在MSU-MFSD数据集上进行跨数据集验证以评估泛化能力。结果表明MobileNetV2是最有效的模型,在平衡计算效率的同时达到92%的准确率,使其适用于实际应用。Inception-v3表现出中等鲁棒性,而DenseNet-121和STD在泛化方面存在困难。研究结果强调了在领域自适应和混合架构方面取得进展以增强生物特征安全系统的必要性。

英文摘要

Biometric systems are increasingly deployed in security applications; however, they remain vulnerable to spoofing attacks, in which attackers exploit counterfeit biometric data to gain unauthorized access. This research evaluates the effectiveness of state-of-the-art machine learning models, MobileNetV2, DenseNet-121, Inception-v3, and Spoof Trace Disentanglement (STD) in detecting spoofing attacks within facial recognition systems. Using the CelebA-Spoof dataset, the study evaluates model effectiveness using metrics such as accuracy, precision, recall, and F1 Score. Cross-dataset validation is carried out on the MSU-MFSD dataset to assess generalizability. The results show MobileNetV2 as the most efficient model, achieving 92% accuracy while balancing computational effectiveness, making it appropriate for real-life applications. Inception-v3 shows moderate robustness, while DenseNet-121 and STD struggle with generalization. The findings highlight the need for advances in domain adaptation and hybrid architectures to enhance biometric security systems.

2606.11490 2026-06-11 cs.LG cs.SY eess.SY 新提交

OmniLoc: A Geometry-Aware Foundation Model for Anchor-Free UE Localization Across Diverse Indoor Environments

OmniLoc: 一种几何感知的基础模型,用于跨多样室内环境的无锚点用户设备定位

Lei Chu, Yuning Zhang, Omer Gokalp Serbetci, Anushka Katiyar, Bassel Abou Ali Modad, Andreas F. Molisch

AI总结 提出OmniLoc,首个基于无线测量的基础模型,通过统一输入分词、几何感知Transformer和几何感知位置估计模块,实现跨室内环境的鲁棒无锚点定位,显著优于现有方法。

详情
AI中文摘要

由于建筑几何形状、可检测接入点(AP)集合以及接收信号异质性的显著变化,基于无线测量的室内定位在大规模部署中仍然具有挑战性。现有的基于学习的方法通常仅在有限环境下表现良好,并在环境变化下性能下降,使得在多样室内环境中进行鲁棒的无锚点定位变得极其困难。本文提出OmniLoc,一种环境交互式基础模型,用于跨多样室内环境的无锚点用户设备定位。据我们所知,OmniLoc是首个直接基于无线测量构建的用于此任务的基础模型。OmniLoc基于三个关键设计。首先,统一输入分词模块将异构无线测量转换为更易于学习的通用表示。其次,几何感知Transformer通过强调主导AP同时聚合来自辅助AP的互补证据,执行AP感知特征提取。第三,几何感知位置估计模块根据几何嵌入进行回归,以生成几何一致的位置预测。我们在大规模内部数据集和公共基准数据集上评估OmniLoc。结果表明,OmniLoc显著优于现有方法,当其设计组件集成时能持续改进现有骨干网络,并在跨环境评估中展现出强大的泛化能力。

英文摘要

Indoor localization from wireless measurements remains challenging in large-scale deployments due to substantial variation in building geometry, the set of detectable access points (APs), and the heterogeneity of received signals. Existing learning-based methods often perform well only in limited settings and degrade under environmental shifts, making robust anchor-free localization across diverse indoor environments notoriously difficult. In this paper, we present OmniLoc, an environment-interactive foundation model for anchor-free user equipment localization across diverse indoor environments. To the best of our knowledge, OmniLoc is the first foundation-model-based approach built directly on wireless measurements for this task. OmniLoc is built on three key designs. First, a unified input tokenization module converts heterogeneous wireless measurements into a common representation that is more amenable to learning. Second, a geometry-aware Transformer performs AP-aware feature extraction by emphasizing dominant APs while aggregating complementary evidence from supporting APs. Third, a geometry-aware location estimation module conditions regression on geometric embeddings to produce geometrically consistent location predictions. We evaluate OmniLoc on both a large-scale in-house dataset and a public benchmark dataset. Results show that OmniLoc significantly outperforms existing methods, consistently improves existing backbones when its design components are integrated, and demonstrates strong generalization in cross-environment evaluations.

2606.11474 2026-06-11 cs.LG cs.SY eess.SY physics.acc-ph 新提交

Mahalanobis-Guided Latent OOD Detection for Hybrid ES-DRL Control in Time-Varying Systems

基于马氏距离的潜在分布外检测用于时变系统中混合ES-DRL控制

Shaifalee Saxena, Alexander Scheinker

AI总结 针对时变系统中强化学习控制器性能下降问题,提出基于变分自编码器潜在空间马氏距离的分布外检测方法,实现与极值搜索控制器的自适应切换,并在粒子加速器控制中验证有效性。

详情
AI中文摘要

本文研究了非线性时变系统中基于马氏距离的潜在分布外(OOD)检测,用于测试时RL控制器切换。RL控制器可以在训练分布内快速控制高维系统,但当时间变化动力学产生未见过的观测时,其性能可能下降。我们考虑一个组合的ES-DRL控制器,其中RL提供快速的分布内动作,而有界极值搜索(ES)在OOD操作下提供鲁棒的模型无关控制。关键挑战在于决定何时切换。我们在分布内束流剖面观测上训练变分自编码器(VAE),并使用VAE潜在空间中的马氏距离在测试时检测OOD束流剖面。此OOD决策设置一个二元开关,选择RL控制器或ES控制器。我们在安全关键的粒子加速器控制中评估该方法。在此设置中,空间磁体运动产生RL训练期间未见过的OOD束流剖面。VAE潜在空间的可视化表明,所提方法识别出此OOD场景,并为组合控制器中RL和ES之间的切换提供可解释信号。

英文摘要

In this paper, we study Mahalanobis-guided latent out-of-distribution (OOD) detection for test-time RL controller switching in nonlinear time-varying systems. RL controllers can quickly control high-dimensional systems within the training distribution, but their performance can degrade when time-varying dynamics produce unseen observations. We consider a combined ES--DRL controller, where RL provides fast in-distribution actions and bounded extremum seeking (ES) provides robust model-independent control under OOD operation. The key challenge is deciding when to switch. We train a variational autoencoder (VAE) on in-distribution beam-profile observations and use Mahalanobis distance in the VAE latent space to detect OOD beam profiles at test time. This OOD decision sets a binary switch that selects either the RL controller or the ES controller. We evaluate the approach in safety-critical particle accelerator control. In this setting, spatial magnet motion creates OOD beam profiles that were not seen during RL training. Visualization of the VAE latent space shows that the proposed method identifies this OOD scenario and provides an interpretable signal for switching between RL and ES in the combined controller.

2606.11417 2026-06-11 cs.LG cs.AI stat.ML 新提交

Signed Compression Progress on a Sealed Audit is Goodhart-Resistant

密封审计上的有符号压缩进展是古德哈特抵抗的

Ayush Mittal, Dhruv Gupta

AI总结 提出有符号压缩进展作为内在动机,证明其累积奖励等于审计改进,且对有限审计面板具有假阳性预算,抵抗古德哈特定律。

详情
Comments
16 pages, 7 figures. Lean 4 (Mathlib) mechanized core and ARC-TGI experiment code: https://github.com/Zetetic-Dhruv/audit-compression-progress
AI中文摘要

压缩进展是一个长期提出的内在动机方案:当智能体的世界模型在预测或压缩经验方面变得更好时给予奖励。民间声称这种奖励是“可信的”,因为它只在学习时支付。我们使这一点精确化并证明它。如果内在奖励是固定密封审计损失的有符号减少,即 r_t = E(theta_{t-1}) - E(theta_t),那么累积奖励恰好望远镜式地归结为端点审计改进,因此没有策略可以在真实审计性能停滞或下降时无限推高奖励。对于有限审计面板,同样的结果成立,并带有尖锐的假阳性预算:累积经验奖励最多为真实审计改进加上 2 Delta_n(F, delta),即模型类的均匀审计偏差。这是无水平依赖的:一旦密封面板均匀控制该类,随时间变化的适应性无需付出代价。该定理还识别了失败模式:如果进展被截断、在智能体自身流上评分、暴露于可重用面板上的高容量模型,或应用于使 Delta_n 无效的神经类,则保证消失。我们给出了结构核心(望远镜式、有限审计界、有限吉布斯和熵下限)的 Lean 4 机械化,以及在 ARC-TGI 网格变换生成器上带有自适应保留攻击的实验套件。实验证实了理论:有限审计偏差按 n^{-0.527} 缩放;有符号进展抵抗截断农场、流泄漏和噪声电视好奇心;朴素的可重用审计可被黑盒标量反馈利用,而标准发布防御将攻击保持在 2 Delta_n 阈值以下。密封审计上的有符号压缩进展是真正改进的会计信号。

英文摘要

Compression progress is a long-standing proposal for intrinsic motivation: reward an agent when its world model becomes better at predicting or compressing experience. The folk claim is that this reward is "credible" because it is paid only for learning. We make this precise and prove it. If intrinsic reward is the signed decrease of a fixed sealed-audit loss, r_t = E(theta_{t-1}) - E(theta_t), then cumulative reward telescopes exactly to endpoint audit improvement, so no policy can push reward up indefinitely while true audit performance stagnates or degrades. For finite audit panels the same result holds with a sharp false-positive budget: cumulative empirical reward is at most true audit improvement plus 2 Delta_n(F, delta), the uniform audit deviation of the model class. This is horizon-free: adaptivity over time costs nothing once the sealed panel uniformly controls the class. The theorem also identifies the failure modes: the guarantee disappears if progress is clipped, scored on the agent's own stream, exposed to a high-capacity model on a reusable panel, or applied to a neural class that makes Delta_n vacuous. We give a Lean 4 mechanization of the structural core (telescoping, the finite-audit bound, finite Gibbs, and the entropy floor) and an experiment suite on ARC-TGI grid-transformation generators with adaptive holdout attacks. Experiments confirm the theory: finite-audit deviation scales as n^{-0.527}; signed progress resists clip-farming, stream leakage, and noisy-TV curiosity; naive reusable audits are exploitable by black-box scalar feedback, while standard release defenses keep the attack below the 2 Delta_n threshold. Signed compression progress on a sealed audit is an accounting signal of genuine improvement.

2606.11400 2026-06-11 cs.SD cs.AI eess.AS 新提交

Steering Where to Listen: Instruction-Based Activation Steering Redirects Temporal Attention in Large Audio-Language Models

引导听哪里:基于指令的激活操控重定向大型音频语言模型中的时间注意力

Tsung-En Lin, Hung-Yi Lee

AI总结 提出基于指令的向量操控方法,通过对比不同指令下的激活来重定向音频令牌的时间注意力,实现无需训练的声音事件定位,显著优于直接提示和随机基线。

详情
AI中文摘要

大型音频语言模型(LALMs)在音频理解方面表现出色,但很少揭示它们关注音频信号的哪个部分。我们引入了基于指令的向量操控,该方法通过对比不同指令提示下的激活来构建操控向量,同时保持音频不变。通过对LALM注意力的系统探测,我们发现——与标准提示或基于音频的操控不同——这种干预显著重新分配了分配给音频令牌的时间注意力,将其集中在声学相关的区域。然后我们展示了这种注意力转移在行为上是有意义的:在受控的三事件设置中,读取由操控引起的最大注意力变化的时间位置,可以恢复查询声音事件的位置,而无需任何训练,在Qwen2-Audio和Audio Flamingo 3上分别达到60.87%和68.72%与真实区间的重叠,远高于直接提示(31.84%,46.75%)和随机基线(27.74%)。我们的结果表征了LALMs中基于指令的操控的机制特性,并为这些模型编码的潜在时间结构提供了一种无需训练的探测方法。

英文摘要

Large Audio-Language Models (LALMs) excel at audio understanding but expose little about where in an audio signal they attend. We introduce instruction-based vector steering, which constructs a steering vector by contrasting activations from differently instructed prompts while keeping the audio fixed. Through a systematic probe of LALM attention, we find that - unlike standard prompting or audio-based steering - this intervention significantly redistributes the temporal attention allocated to audio tokens, concentrating it on acoustically relevant regions. We then show that this attention shift is behaviorally meaningful: in a controlled three-event setting, reading out the temporal position of maximal steering-induced attention change recovers the location of a queried sound event without any training, attaining 60.87% and 68.72% overlap with ground-truth intervals on Qwen2-Audio and Audio Flamingo 3, far above direct prompting (31.84%, 46.75%) and random baselines (27.74%). Our results characterize a mechanistic property of instruction-based steering in LALMs and provide a training-free probe for the latent temporal structure these models encode.

2606.11399 2026-06-11 cs.CL 新提交

Scenario-based Probing and Steering Cultural Values in Large Language Models--Extended Version

基于场景的大型语言模型文化价值观探测与引导——扩展版

Trung Duc Anh Dang, Tung Kieu, Sarah Masud

AI总结 提出基于场景的行为困境方法,通过令牌级概率和激活引导探测并调整LLM在英格尔哈特-韦尔泽尔文化轴上的潜在价值观,发现不同文化维度的引导存在耦合效应。

详情
Comments
18 pages
AI中文摘要

大型语言模型(LLM)被部署在不同文化背景下,但往往反映出从训练数据中继承的同质化价值观。对文化一致性的评估通常依赖于直接提示调查式问题,这常常引发中性或安全对齐的回应,无法捕捉模型的潜在偏好。我们提出了一个框架,用于沿着世界价值观调查(WVS)的英格尔哈特-韦尔泽尔两个轴探测和引导LLM中的潜在文化表征。通过将社会价值观问题转化为基于场景的行为困境,我们提取令牌级概率来测量隐含价值观,并应用激活引导(可选地与基于国家的提示结合),无需重新训练即可改变模型行为。在三个开源LLM和四种目标文化中,我们发现引导能力存在显著差异,并识别出潜在纠缠,即沿着一个文化维度的干预会引发另一个维度的变化。这种耦合反映了人类WVS数据中的相关性,并在激活、提示和混合引导中持续存在。它限制了轴独立的对齐,尽管一般任务性能基本保持。

英文摘要

Large Language Models (LLMs) are deployed across cultural contexts but often reflect homogenized values inherited from training data. Evaluations of cultural alignment typically rely on direct prompting with survey-style questions, which frequently elicit neutral or safety-aligned responses and fail to capture underlying model preferences. We propose a framework for probing and steering latent cultural representations in LLMs along the two Inglehart--Welzel axes of the World Values Survey (WVS). By translating social value questions into scenario-based behavioral dilemmas, we extract token-level probabilities to measure implicit values and apply activation steering, optionally combined with country-conditioned prompting, to shift model behavior without retraining. Across three open-source LLMs and four target cultures, we find substantial variation in steerability and identify latent entanglement, where interventions along one cultural dimension induce shifts along another. This coupling mirrors correlations in human WVS data and persists across activation, prompt, and hybrid steering. It constrains axis-independent alignment, though general task performance is largely preserved.

2606.11379 2026-06-11 cs.AI 新提交

Automated Mediator for Human Negotiation: Pre-Mediation via a Structured LLM Pipeline

人类谈判的自动调解器:通过结构化LLM流水线进行预调解

Jamie Bergen, Sarit Kraus

AI总结 提出一种结构化LLM流水线作为自动调解器,在整合性谈判中支持预调解,通过分解准备任务为专用模块,在短期自我报告结果上与人类调解员相当,并在偏好推理任务上误差降低36%。

详情
Comments
12 pages, 7 figures
AI中文摘要

预调解是直接人类谈判前的准备阶段,在达成互利协议中起着关键作用,但由于成本、时间和缺乏训练有素的调解员而常被省略。我们引入了一种用于人类谈判的自动调解器,实现为结构化LLM模块流水线,在整合性谈判环境中支持预调解。该流水线将准备分解为对话、偏好预测、响应级批评和结构化总结的专用模块,分离推理、生成和评估,以解决单一提示方法的局限性。我们按照常见的LLM系统术语将每个模块称为“智能体”,但组件并非自主且不进行点对点交互;输出按固定顺序向前传递。我们在两个受控人类受试者实验中评估该系统,在多议题谈判场景中将基于AI的预调解与专业人类调解员进行比较。在短期自我报告测量中,自动调解器在准备结果上与人类调解员大致相当,包括对调解员的信任和达成互利协议的信心,同时在我们场景和提示下,偏好推理任务的误差显著降低(RMSE降低36%)。第二项研究表明,有针对性的提示优化将过度肯定模式从36.6%降至16.8%,与人类调解员基线匹配。我们的发现表明,结构化LLM流水线可以在短期自我报告准备结果上提供与人类调解员大致相当的可扩展、低投入的预调解支持。该流水线的单方设计反映了当前人类调解员进行预调解的方式,并支持在争议各方之间并行部署,从而实现可扩展性。

英文摘要

Pre-mediation, the preparatory phase preceding direct human negotiation, plays a critical role in achieving mutually beneficial agreements, yet is often omitted due to cost, time, and limited access to trained mediators. We introduce an automated mediator for human negotiation, implemented as a structured pipeline of LLM modules, that supports pre-mediation in integrative negotiation settings. The pipeline decomposes preparation into specialized modules for dialogue, preference prediction, response-level critique, and structured summarization, separating inference, generation, and evaluation to address limitations of monolithic single-prompt approaches. We use the term "agent" for each module following common LLM-systems terminology, but the components are not autonomous and do not interact peer-to-peer; outputs are passed forward in a fixed sequence. We evaluate the system in two controlled human-subject experiments comparing AI-based pre-mediation with professional human mediators in a multi-issue negotiation scenario. On short-term self-reported measures, the automated mediator achieves preparation outcomes broadly comparable to human mediators, including trust in the mediator and confidence in reaching mutually beneficial agreements, while achieving substantially lower error on the preference-inference task under our scenario and prompts (36% lower RMSE). A second study shows that targeted prompt refinements reduce excessive affirmation patterns from 36.6% to 16.8%, matching human mediator baselines. Our findings suggest that structured LLM pipelines can provide scalable, low-effort pre-mediation support broadly comparable to human mediators on short-term self-reported preparation outcomes. The pipeline's single-party design mirrors how human mediators run pre-mediation today and enables parallel deployment across all parties to a dispute, supporting scalability.

2606.11371 2026-06-11 cs.CL cs.AI eess.AS eess.SP 新提交

The Dynamics of Human and AI-Generated Language: How Semantics Fluctuates across Different Timescales

人类与AI生成语言的动态:语义如何在不同时间尺度上波动

Han-Jen Chang, Yasir Çatal, Angelika Wolman, Agustín Ibáñez, David Smith, I-Wen Su, Kai-Yuan Cheng, Georg Northoff

AI总结 提出语义时间尺度分析流程,通过自相关窗口度量(ACW-0)量化人类与AI生成语音中语义特异性与上下文相似性的时间组织,发现ACW-0长度与词汇通用性相关,且该关联在随机化后被削弱。

详情
Journal ref
Computer Speech & Language (2026) 102013
Comments
45 pages, 4 figures, 4 tables. Accepted manuscript; published in Computer Speech & Language
AI中文摘要

口语,无论是人类还是大型语言模型(LLM)产生的,都会随时间展开,具有变化的语义内容。然而,我们仍然缺乏简单、可解释的时间序列特征来捕捉通用与特定内容如何随时间分布,并可用于比较人类和AI生成的语音。我们引入了一个语义时间尺度分析流程,将带有时间戳的词级转录转换为语义时间序列。对于每个口语叙述,我们计算(i)基于WordNet词深度的语义特异性,以及(ii)基于SBERT嵌入的上下文相似性,并使用自相关窗口度量(ACW-0及相关指标)量化其时间依赖性。然后,我们将原始语音与多种随机化对照进行比较,这些对照选择性地破坏词汇身份、时间顺序和词时长。在人类朗读的自传叙述、TTS朗读和LLM生成的文本(通过TTS渲染)中,我们发现语义时间序列中ACW-0较长的片段往往包含更多通用词汇,而ACW-0较短的片段则富含更具体的词汇。当词序和计时被随机化时,这些关联被强烈削弱或消除,表明基于ACW的度量捕捉了语义内容超越静态词汇分布的非平凡时间组织。我们的结果表明,基于ACW的语义时间尺度是分析和比较人类与AI生成语音时间结构的有用特征系列。

英文摘要

Spoken language, whether produced by humans or large language models (LLM), unfolds over time with varying semantic content. However, we still lack simple, interpretable time-series features that capture how generic versus specific content is distributed over time, and that can be used to compare human and AI-generated speech. We introduce a semantic-timescale analysis pipeline that turns word-level transcripts with timestamps into semantic time-series. For each spoken narrative, we compute (i) semantic specificity using WordNet-based word depth and (ii) contextual similarity using SBERT embeddings and quantify their temporal dependence using autocorrelation-window measures (ACW-0 and related metrics). We then compare original speech to multiple shuffled controls that selectively disrupt lexical identity, temporal order, and word duration. Across human-read autobiographical narratives, TTS readings, and LLM-generated texts rendered with TTS, we find that segments with longer ACW-0 in the semantic time-series tend to contain more generic vocabulary, whereas segments with shorter ACW-0 are enriched in more specific words. These associations are strongly attenuated or abolished when word order and timing are randomized, indicating that ACW-based measures capture non-trivial temporal organization of semantic content beyond static lexical distributions. Our results suggest that ACW-based semantic timescales are a useful family of features for analyzing and comparing the temporal structure of human and AI-generated speech.

2606.11348 2026-06-11 cs.LG 新提交

SwiftCTS: Fast Cross-Design Prediction and Pareto Optimization of Clock Tree Metrics via Few-Shot Calibration

SwiftCTS: 通过少样本校准实现时钟树指标的快速跨设计预测与帕累托优化

Barsat Khadka, Kawsher Roxy, Md Rubel Ahmed

AI总结 提出SwiftCTS框架,利用物理信息代理模型和K-shot乘法校准机制,在数秒内训练、亚毫秒推理,实现跨设计时钟树指标的准确预测与帕累托优化。

详情
AI中文摘要

时钟树综合(CTS)是物理设计流程中计算成本高昂的阶段,需要迭代调用EDA工具以探索庞大的配置空间,从而优化功耗、线长和时序偏差。现有的机器学习方法需要昂贵的重新训练或微调周期来适应未见过的宏架构,并且在架构上与穷举组合搜索所需的数百万次评估不匹配。我们提出了SwiftCTS,一个物理信息代理框架,同时解决了这两个局限性。通过将轻量级、基于物理的统计特征与梯度提升集成相结合,SwiftCTS在CPU上训练时间不到五秒,且无需GPU支持即可实现亚毫秒级推理。为了处理分布外(OOD)设计而无需重新训练或微调,我们引入了一种K-shot乘法校准机制,该机制仅需一到两次物理参考运行即可锚定预测,将未见过的宏上的功耗预测误差从24.5%降低到3.3%,线长误差从56.6%降低到1%以下。将该引擎与进化优化器集成,SwiftCTS在十秒内评估了100,000个CTS配置,生成了在OpenROAD流程中经过物理验证的帕累托最优前沿。闭环验证确认了功耗和线长的预测误差低于0.5%,时序偏差预测在OOD基准上在五皮秒以内,在所有目标指标上始终优于默认工具启发式方法。代码公开于:\href{this https URL}{this https URL}

英文摘要

Clock Tree Synthesis (CTS) is a computationally expensive stage in the physical design flow, requiring iterative EDA tool invocations to navigate a vast configuration space for optimal power, wirelength, and timing skew. Existing machine learning approaches require computationally expensive retraining or fine-tuning cycles to adapt to unseen macro architectures and are architecturally mismatched to the millions of evaluations demanded by exhaustive combinatorial search. We present SwiftCTS, a physics-informed surrogate framework that addresses both limitations simultaneously. By coupling lightweight, physics-grounded statistical features with gradient-boosted ensembles, SwiftCTS trains in under five seconds on a CPU and delivers sub-millisecond inference without GPU support. To handle out-of-distribution (OOD) designs without retraining or fine-tuning, we introduce a K-shot multiplicative calibration mechanism that anchors predictions to just one or two physical reference runs, reducing power prediction error from 24.5\% to 3.3\% and wirelength error from 56.6\% to under 1\% on unseen macros. Integrating this engine with an evolutionary optimizer, SwiftCTS evaluates 100,000 CTS configurations in under ten seconds, yielding Pareto-optimal frontiers that are physically validated within the OpenROAD flow. Closed-loop validation confirms prediction errors below 0.5\% for power and wirelength, and timing skew predictions within five picoseconds on an OOD benchmark, consistently outperforming default tool heuristics across all target metrics. Code publicly available at: \href{https://anonymous.4open.science/r/SwiftCTS-7E6E}{https://github.com/BarsatKhadka/SwiftCTS}

2606.11316 2026-06-11 cs.CL 新提交

Schützen: Evaluating LLM Safety in Bulgarian and German Contexts

Schützen: 在保加利亚语和德语语境中评估LLM安全性

Kiril Georgiev, Yuxia Wang, Dimitar Iliyanov Dimitrov, Preslav Nakov, Ivan Koychev

AI总结 针对现有安全评估数据集以英语和中文为主的问题,构建了覆盖低资源语言保加利亚语和高资源语言德语的Schützen安全数据集,实验揭示多语言LLM在安全行为上的显著跨语言差异,强调了区域特定评估资源的必要性。

详情
Comments
19 pages, 13 tables, 12 figures
AI中文摘要

大型语言模型越来越多地部署在专业领域,带来了难以预测的风险,包括生成有害或不尊重的内容。尽管在开发安全评估数据集方面取得了实质性进展,但现有资源仍然 overwhelmingly 以英语和中文为中心。这种限制在评估共享社会文化、法律和伦理背景下的语言时尤为明显。为了解决这一差距,我们引入了Schützen:一个德语-保加利亚语安全数据集,旨在评估模型在风险下的可回答性,涵盖低资源语言(保加利亚语)和高资源语言(德语)。使用多语言和特定语言LLMs的实验揭示了安全行为中显著的跨语言差异,强调了需要定制的、特定区域的评估资源,以支持在德国和保加利亚负责任地部署LLMs。数据集和代码可在以下网址获取:https://this URL。警告:本文包含可能具有冒犯性、有害性或偏见性的示例。

英文摘要

Large language models are increasingly deployed across professional domains, bringing hard-to-predict risks, including the generation of harmful or disrespectful content. Although substantial progress has been made in developing safety evaluation datasets, existing resources remain overwhelmingly English- and Chinese-centric. This limitation is particularly pronounced when evaluating languages that operate within shared sociocultural, legal, and ethical contexts. To address this gap, we introduce Schützen: a German--Bulgarian safety dataset designed to assess model answerability under risk, covering both a low-resource language (Bulgarian) and a high-resource language (German). Experiments with multilingual and language-specific LLMs reveal pronounced cross-language differences in safety behavior, highlighting the necessity of tailored, region-specific evaluation resources to support the responsible deployment of LLMs in Germany and Bulgaria. Datasets and code are available at https://github.com/xnlp-lab/Schutzen. Warning: this paper contains examples that may be offensive, harmful, or biased.

2606.11290 2026-06-11 cs.LG cs.AI cs.CL 新提交

FlowBank: Query-Adaptive Agentic Workflows Optimization through Precompute-and-Reuse

FlowBank: 通过预计算与复用实现查询自适应智能体工作流优化

Lingzhi Yuan, Chenghao Deng, Fangxu Yu, Souradip Chakraborty, Mohammad Rostami, Furong Huang

AI总结 提出FlowBank框架,通过预计算多样化工作流并压缩为紧凑组合,在推理时自适应选择最优工作流,平衡性能与成本,在五个基准上平均得分最高且成本可控。

详情
AI中文摘要

基于大型语言模型的多智能体系统日益强大,但当前的智能体工作流优化范式存在令人不满意的权衡。任务级方法花费大量离线计算却只部署单个工作流,导致互补候选未被使用;而查询级方法为每个查询合成新工作流,推理成本高昂。我们的动机分析表明,这些范式更多是互补而非竞争:离线搜索中发现的工作流通常解决不同子集的查询,许多由昂贵查询级生成处理的查询已经可以通过更便宜的预计算工作流解决。这暗示了一个不同的目标:与其寻找一个普遍最佳的工作流或为每个实例重新生成,不如构建一个紧凑的、可复用的互补工作流库,并在推理时自适应地选择。为此,需要解决三个耦合问题:生成互补而非冗余的候选、压缩成小型可部署组合、在性能-成本权衡下为每个查询分配正确的工作流。我们提出FlowBank,一个基于组合的智能体工作流优化的三阶段框架。多样化阶段提出DiverseFlow,引导搜索覆盖未充分覆盖的查询,产生高覆盖率的候选池。精炼阶段提出CuraFlow,将候选池压缩为冗余最小的紧凑组合。匹配阶段将部署建模为查询-工作流二分图上的边值预测,将每个传入查询路由到预测效用最佳的组合成员。在五个基准上,FlowBank在评估方法中实现了最高平均得分,同时保持成本竞争力,相比最强的自动和手工基线分别相对提升4.26%和14.92%。

英文摘要

Large Language Model (LLM)-based multi-agent systems are increasingly powerful, but current agentic workflow optimization paradigms make an unsatisfying trade-off. Task-level methods spend substantial offline compute yet deploy only a single workflow, leaving complementary candidates unused, while query-level methods synthesize a new workflow per query at substantial inference cost. Our motivating analysis shows these paradigms are more complementary than competing: workflows discovered during offline search often solve different subsets of queries, and many queries handled by expensive query-level generation can already be solved by cheaper precomputed workflows. This suggests a different objective: rather than searching for one universally best workflow or regenerating one per instance, we should build a compact bank of reusable, complementary workflows and select among them adaptively at inference time. Doing so requires solving three coupled problems: generating complementary rather than redundant candidates, compressing them into a small deployable portfolio, and assigning each query to the right workflow under a performance-cost trade-off. To this end, we present FlowBank, a three-stage framework for portfolio-based agentic workflow optimization. Diversifying proposes DiverseFlow to steer search toward under-covered queries and produce a high-coverage candidate pool. Curating proposes CuraFlow to compress this pool into a compact portfolio with minimal redundancy. Matching casts deployment as edge-value prediction on a query-workflow bipartite graph and routes each incoming query to the portfolio member with the best predicted utility. Across five benchmarks, FlowBank achieves the highest average score among the evaluated methods while remaining cost-competitive, improving over the strongest automated and handcrafted baselines by 4.26% and 14.92% relative, respectively.

2606.11285 2026-06-11 cs.CV 新提交

EventRadar: Long-Range Visual UAV Discovery through Spatiotemporal Event Sensing

EventRadar:通过时空事件感知实现远程视觉无人机发现

Zhiting Zhou, Xingchen Liu, Xinglin Yu, Jiashen Chen, Haoyang Wang, Jingao Xu, Yunhao Liu, Xinlei Chen

AI总结 针对远程小目标无人机检测难题,提出EventRadar方法,利用事件相机捕捉螺旋桨引起的时域周期性,结合场景锚定几何证据(SAGE)和梳状引导谐波组学习迭代收缩阈值算法(CHG),在700-1500米距离上实现高精度检测。

详情
AI中文摘要

机场、公共场所及其他敏感区域周围的未经授权无人机活动使得受保护空域监测日益重要。一个实用的感知系统必须搜索广阔的角度区域,发现小型远程目标,并在限制周界被突破前返回方位支持和无人机特定证据。现有的无人机检测路径通常依赖空间组织的证据,如身体范围、轮廓或轨迹连续性。然而,在远距离上,随着目标足迹减弱和图像平面支撑缩小,这些线索变得难以保持和验证。EventRadar遵循一种互补线索:螺旋桨引起的时域周期性,最近的事件相机感知研究表明,在目标外观变弱后,这种周期性可以揭示无人机特有的运动。我们将这一线索扩展到千米级主动感知,使用事件相机原型。场景锚定几何证据(SAGE)将扫描事件与IMU姿态融合,维护一个方位索引的场景记忆,将瞬态候选支撑与持久背景杂波分离。然后,梳状引导谐波组学习迭代收缩阈值算法(CHG)将每个候选视为一个弱的高速率定时信号,并以固定计算量恢复相位不敏感的谐波证据。与相关事件相机基线在700-1500米无人机事件记录上的比较,EventRadar实现了0.990 mAP$_{.3}$和0.949 F1$_{.3}$,将FN$_{.3}$降低到0.009,并在原型分析中展示了实时可行性。

英文摘要

Unauthorized unmanned aerial vehicle (UAV) activity around airports, public venues, and other sensitive sites has made protected-airspace monitoring increasingly important. A practical sensing system must search a wide angular region, find small long-range targets, and return both bearing support and UAV-specific evidence before a restricted perimeter is breached. Existing UAV detection paths often rely on spatially organized evidence, such as body extent, silhouette, or track continuity. At long range, however, these cues become difficult to preserve and verify as the target footprint weakens and its image-plane support shrinks. EventRadar follows a complementary cue: propeller-induced temporal periodicity, which recent event-camera sensing studies have shown can reveal UAV-specific motion after appearance becomes weak. We extend this cue to kilometer-scale active sensing with an event-camera prototype. Scene-Anchored Geometry Evidence (SAGE) fuses scanning events with IMU pose to maintain a bearing-indexed scene memory, separating transient candidate support from persistent background clutter. Comb-guided Harmonic-Group Learned Iterative Shrinkage and Thresholding Algorithm (CHG) then treats each candidate as a weak high-rate timing signal and recovers phase-insensitive harmonic evidence with fixed compute. Compared with related event-camera baselines on 700-1500 m UAV event recordings, EventRadar achieves 0.990 mAP$_{.3}$ and 0.949 F1$_{.3}$, reduces FN$_{.3}$ to 0.009, and shows real-time feasibility in prototype profiling.

2606.11270 2026-06-11 cs.LG cs.AI cs.CL 新提交

Quantifying Subliminal Behavioral Transfer Ratios in Language Model Distillation

量化语言模型蒸馏中的潜意识行为迁移比率

Uwe Konig, Hamza Kazmi, Ruizhe Li, Maheep Chaudhary

AI总结 通过控制教师模型行为强度并蒸馏学生模型,量化了潜意识行为迁移比率,发现迁移具有鲁棒性且呈现不同缩放行为。

详情
AI中文摘要

旨在将良性行为迁移到学生模型的语言模型蒸馏,也可能迁移教师模型中存在的不良特征,这种现象称为潜意识学习。虽然定性证据支持该效应的存在,但其程度尚未被系统表征。本研究通过控制两个教师模型(Llama-2-7B-Chat 和 Qwen2.5-7B-Instruct)在不同引导强度下,并仅使用良性数据蒸馏学生模型,量化了潜意识行为迁移比率。使用 GPT-4.1 作为评估器对 100 个 JailbreakBench 提示进行评估,结果表明迁移是鲁棒的,但表现出不同的缩放行为。Llama-2 表现出一个尖锐的阈值($\tau = {0.25,0.32} \ \text{beyond} \ \alpha = -0.15$),而 Qwen2.5 表现出连续且更高水平的迁移($\tau$ 高达 $0.61$)。

英文摘要

Distillation of a language model intended to transfer benign behavior to a student model may also transfer undesirable characteristics, if they are present in the teacher model, a phenomenon known as subliminal learning. While qualitative evidence supports the existence of this effect, its magnitude has not been systematically characterized. This study quantifies subliminal behavioral transfer ratios by steering two teacher models (Llama-2-7B-Chat and Qwen2.5-7B-Instruct) at varying steering strengths and distilling student models using only benign data. Evaluation on 100 JailbreakBench prompts with GPT-4.1, serving as the evaluator, indicates that transfer is robust but exhibits distinct scaling behaviors. Llama-2 demonstrates a sharp threshold ($τ= {0.25,0.32} \ \text{beyond} \ α= -0.15$), whereas Qwen2.5 displays continuous and higher levels of transfer ($τ$ up to $0.61$).

2606.11258 2026-06-11 cs.LG nlin.PS physics.comp-ph 新提交

Loss Landscape Diagnosis for Gradient-Based Gray-Scott System Inversion: Disentangling the Roles of PINN Components

基于梯度的Gray-Scott系统反演的损失景观诊断:解构PINN各组件的角色

Yan Yang

AI总结 通过直接反向传播稳态损失至未折叠的Gray-Scott模拟,发现优化因损失景观中的平坦高原和陡峭悬崖而失败,而PINN中的残差损失通过隐式编码完整PDE动力学避免了该病理现象。

详情
Comments
Accepted at the AI4Physics Workshop, ICML 2026 (non-archival). 14 pages, 10 figures
AI中文摘要

反应扩散系统的梯度基反演通常通过代理模型或物理信息神经网络(PINN)进行,而最直接的路径——通过PDE结构本身进行反向传播——在很大程度上被避免。我们将这条直接路径作为诊断探针,通过未折叠的Gray-Scott模拟反向传播稳态损失以恢复其参数,无需代理或神经网络增强。优化未能收敛,直接绘制损失景观将其失败定位于其几何结构——平坦高原无梯度信号,被与分岔边界对齐的陡峭悬崖所包围——这种结构在损失函数中重复出现,并且无论梯度如何路由到参数都会继承。将这一最小设置视为PINN的消融实验,我们解构了每个组件的作用:在神经网络固定的情况下,残差损失是PDE参数的二次函数,产生平滑的损失景观,因此仅凭它就能避免病理现象,通过隐式编码所有初始条件下的完整PDE动力学。而神经网络无法修复不适定的参数子空间,因此仅用于完成观测数据——这种分工此前未被明确。这些发现对PINN类方法具有具体的设计意义,并提供了关于何时添加维度实际上有帮助的更广泛启发。

英文摘要

Gradient-based inversion of reaction-diffusion systems is typically approached via surrogate models or physics-informed neural networks (PINNs), while the most direct route, backpropagation through the PDE's structure itself, has largely been avoided. We pursue this direct route as a diagnostic probe, backpropagating a steady-state loss through unrolled Gray-Scott simulation to recover its parameters, with no surrogate or neural-network augmentation. Optimization fails to converge, and plotting the landscape directly locates the failure in its geometry -- flat plateaus with no gradient signal, bounded by sharp cliffs that align with bifurcation boundaries -- a structure that recurs across loss functions and is inherited however the gradients are routed to parameters. Reading this minimal setup as an ablation of PINN, we disentangle each component's role: with the neural network fixed, the residual loss is quadratic in the PDE parameters and yields a smooth landscape, so it alone already avoids the pathology, by implicitly encoding the full PDE dynamics across all initial conditions. The neural network, for its part, cannot repair an ill-posed parameter subspace, and so serves only to complete the observed data -- a division of labor not previously made explicit. These findings carry concrete design implications for PINN-type methods and a broader heuristic on when added dimensions actually help.

2606.11247 2026-06-11 cs.LG cs.AI cs.AR 新提交

Physics-informed generative AI for semiconductor manufacturing: Enforcing hard physical constraints in generative models by construction

物理信息驱动的生成式AI在半导体制造中的应用:通过构造强制生成模型中的硬物理约束

Yaser Mike Banad, Sarah Sharif

AI总结 针对半导体制造中生成模型必须满足硬物理约束的问题,本文提出通过构造集成物理信息(如物理信息扩散、PDE约束变分模型等)来强制约束,而非事后过滤,并给出四种集成模式和未来研究方向。

详情
AI中文摘要

生成模型越来越多地被用于为物理系统提出设计、数据和控制动作,然而许多此类系统受硬物理约束而非感知合理性支配。半导体制造提供了一个严苛的测试案例:生成的掩模、布局、合成缺陷数据和工艺配方必须遵守光刻、传输、反应和器件物理约束,因为物理无效的样本不仅质量低劣,而且无法使用。本文认为,半导体制造揭示了一个更广泛的计算科学挑战,即用于受约束物理领域的生成式AI必须通过构造实现物理信息驱动,而非仅通过事后过滤来纠正。我们调查了新兴的架构工具包,包括物理信息扩散、PDE约束变分模型、神经算子先验和守恒律尊重生成网络,并展示了它如何与可微分光刻、TCAD、工艺仿真和自主实验相联系。我们识别了生成模型与基于物理的模拟器之间的四种集成模式,并提出了一个以物理保真度基准、可微分模拟器基础设施以及面向物理设计和制造的多模态基础模型为中心的研究议程。核心主张是分析性的而非修辞性的:在物理有效性是成功的关键标准的情况下,通过构造强制约束的架构应被期望优于事后过滤的架构,而晶圆厂正是这种区别最鲜明的环境。

英文摘要

Generative models are increasingly used to propose designs, data, and control actions for physical systems, yet many such systems are governed by hard physical constraints rather than by perceptual plausibility. Semiconductor manufacturing provides a demanding test case: generated masks, layouts, synthetic defect data, and process recipes must obey lithography, transport, reaction, and device-physics constraints, because physically invalid samples are not merely low quality but unusable. This Perspective argues that semiconductor manufacturing exposes a broader computational-science challenge, namely that generative AI for constrained physical domains must be physics-informed by construction, not corrected only through post-hoc filtering. We survey the emerging architectural toolkit, including physics-informed diffusion, PDE-constrained variational models, neural-operator priors, and conservation-law-respecting generative networks, and show how it connects to differentiable lithography, TCAD, process simulation, and autonomous experimentation. We identify four integration patterns between generative models and physics-based simulators, and we propose a research agenda centered on physics-fidelity benchmarks, differentiable simulator infrastructure, and multimodal foundation models for physical design and manufacturing. The central claim is analytical rather than rhetorical: where physical validity is the binding criterion of success, architectures that enforce it by construction should be expected to outperform those that filter for it after the fact, and the fab is the setting where this distinction is sharpest.

2606.11245 2026-06-11 cs.AI cs.NE q-bio.NC 新提交

Position: Hippocampal Explicit Memory Is the Cornerstone for AGI

立场:海马体显式记忆是通用人工智能的基石

Sangjun Park

AI总结 本文主张,将显式记忆整合到大语言模型中是迈向通用人工智能的关键,因为LLM的学习机制类似人类内隐记忆,而高阶认知功能依赖海马体显式记忆。

详情
Comments
Accepted to ICML 2026 (Position Paper Track)
AI中文摘要

大语言模型(LLM)在各种任务中展现了卓越的能力,提升了人们对通用人工智能(AGI)的期望。这篇立场论文认为,整合显式记忆是推动LLM迈向AGI的基石。关键原因在于,LLM的底层学习机制与人类内隐记忆高度相似。然而,AGI所需的高阶认知功能,如长期战略规划、元认知和符号推理,严重依赖海马体显式记忆,无法仅从内隐统计学习中产生。借鉴神经科学的发现,我提出这一观点,并辅以人工显式记忆系统的计算要求,希望促进进一步研究,为显式记忆整合奠定基础。

英文摘要

Large Language Models (LLMs) have demonstrated remarkable capabilities across various tasks, raising expectations for Artificial General Intelligence (AGI). This position paper argues that integrating explicit memory is the cornerstone for advancing LLMs toward AGI. The key reason is that the underlying learning mechanism of LLMs is highly analogous to human implicit memory. However, higher-order cognitive functions necessary for AGI, such as long-term strategic planning, metacognition, and symbolic reasoning, heavily rely on hippocampal explicit memory and cannot arise solely from implicit statistical learning. Drawing on findings from neuroscience, I advance this perspective and complement it with computational requirements for artificial explicit memory systems, hoping to foster further research and lay the groundwork for explicit memory integration.

2606.11221 2026-06-11 cs.CV 新提交

LAST: Bridging Vision-Language and Action Manifolds via Gromov-Wasserstein Alignment

LAST: 通过Gromov-Wasserstein对齐连接视觉-语言与动作流形

Huaihai Lyu, Chaofan Chen, Yuheng Ji, Xiansheng Chen, Pengwei Wang, Shanghang Zhang, Changsheng Xu

AI总结 提出LAST方法,通过李代数线性化和局部度量离散化,对齐视觉-语言语义几何与动作流形,解决异构空间不兼容问题,提升VLA模型收敛性和泛化性。

详情
AI中文摘要

我们从Gromov-Wasserstein视角研究视觉-语言-动作(VLA)学习,目标是使动作表征的关系几何与VL嵌入的语义几何兼容。然而,由于领域间的数学异质性,这种对齐并非易事:视觉-语言的语义空间在拓扑上是线性和各向同性的,而机器人动作的物理流形是非欧几里得和各向异性的。它们不兼容的度量结构使得直接回归不适定。为了解决这种不兼容性,我们引入了LAST(李代数动作空间分词器),它通过两阶段变换重建动作空间以建立与VL模态的局部度量兼容性:(1)全局拓扑线性化:通过李代数映射线性化动作流形,将轨迹转换为固定长度、物理可加的表示。(2)局部度量离散化:将表示分层离散化为模式和白化残差,生成近似各向同性的局部图表,这些图表在统计上与语义度量对齐。通过在全局和局部层面解决结构不匹配问题,LAST使VLA模型具有更优的收敛性和泛化性。

英文摘要

We take a Gromov-Wasserstein perspective on Vision-Language-Action (VLA) learning, where the goal is to make the relational geometry of action representations compatible with the semantic geometry of VL embeddings. However, this alignment is non-trivial due to the mathematical heterogeneity between the domains: the semantic space of vision-language is topologically linear and isotropic, whereas the physical manifold of robotic action is non-Euclidean and anisotropic. Their disjoint metric structures render direct regression ill-posed. To resolve this incompatibility, we introduce LAST (Lie-algebraic Action Space Tokenizer), which reconstructs the action space to establish local metric compatibility with the VL modality via a two-stage transformation: (1) Global Topological Linearization: linearizing the action manifold via Lie-algebraic mapping, converting trajectories into a fixed-length, physically additive representation. (2) Local Metric Discretization: hierarchically discretizing the representation into schemas and whitened residuals, yielding approximately isotropic local charts that are statistically aligned with the semantic metric. By resolving the structural mismatch at both global and local levels, LAST enables VLA models with superior convergence and generalizability.

2606.11220 2026-06-11 cs.CL 新提交

LifeSentence: Language models can encode human life course trajectories from longitudinal panel data

LifeSentence: 语言模型可以从纵向面板数据编码人类生命历程轨迹

Samuel Liu, Muchen Xi, William Yeoh, Joshua J. Jackson

AI总结 提出LifeSentence模型,将大型语言模型与纵向面板数据结合,通过结构化自然语言记录生命事件并微调预训练模型,在少样本条件下超越传统方法,实现生命事件预测与时间顺序重建。

详情
AI中文摘要

预测人类生命结果对于理解个体如何获得长寿健康的生活至关重要。传统的统计方法准确度有限,可能是因为忽略了生命历程的序列结构。现代方法如Transformer架构需要大规模训练数据,而大多数纵向面板研究缺乏此类数据。本文介绍LifeSentence,一种将大型语言模型与纵向面板数据相结合的生命历程推理模型。通过将每个生命事件表示为结构化的自然语言记录,并在一个包含预测、鲁棒性和推理的18任务评估分类体系上对预训练的240亿参数语言模型进行指令微调,LifeSentence利用预训练期间已编码的分布知识补充面板数据。该模型在来自德国社会经济面板的约65,000名个体上训练——比之前基于Transformer的方法少约45倍——在所有任务族上均优于经典和深度学习基线,在联合事件与时间预测上相比最佳基线实现三倍改进,并在从去除时间戳的事件集重建时间顺序时达到91.2%的Kendall tau系数。在没有显式监督的情况下,该模型仅从离散事件序列中恢复出记录的社会分层模式,包括教育溢价、性别工资差距和母亲惩罚。自然语言接口进一步支持定性新研究查询,例如将早期生活史连接到指定的晚年终点,使LifeSentence成为预测工具和对人类传记进行反事实探索的探针。

英文摘要

Forecasting human life outcomes is important to gain insights into how individuals attain long and healthy lives. Conventional statistical approaches yield limited accuracy, potentially due to discarding the sequential structure of the life course. Modern methods such as transformer architectures require large scale training data that most longitudinal panel studies lack. Here we introduce LifeSentence, a model for life-course reasoning that bridges large language models with longitudinal panel data. By representing each life event as a structured natural-language record and instruction-tuning a pretrained 24-billion-parameter language model across an 18-task evaluation taxonomy spanning prediction, robustness and reasoning, LifeSentence supplements panel data with distributional knowledge already encoded during pretraining. Trained on approximately 65,000 individuals from the German Socio-Economic Panel - roughly 45 times fewer than prior transformer-based approaches - LifeSentence outperforms classical and deep learning baselines across all task families, achieving a threefold improvement in joint event-and-timing prediction from best baselines and 91.2% Kendall's tau when reconstructing chronological order from timestamp-stripped event sets. Without explicit supervision, the model recovers documented patterns of social stratification, including the education premium, the gender wage gap and the motherhood penalty, from discrete event sequences alone. A natural-language interface further enables qualitatively new research queries, such as connecting an early-life history to a specified late-life endpoint, establishing LifeSentence as both a predictive tool and a probe for counterfactual exploration of human biographies.

2606.11205 2026-06-11 cs.LG cs.AI cs.CL 新提交

Dual-Stance Evaluation of Sycophancy: The Structure of Agreement and the Limits of Intervention

谄媚的双立场评估:同意的结构与干预的局限

Matthew James Buchan

AI总结 提出双立场评估方法,发现激活引导在减少谄媚时也会抑制对事实正确陈述的同意,揭示了表示可读但不可写的普遍差距。

详情
Comments
18 pages, 9 figures, accepted to TAIS 2026
AI中文摘要

激活引导可以改变LLM的行为,但标准评估通常不测试减少谄媚的方向是否也抑制对事实正确陈述的同意。我们引入了双立场评估,测试每个话题的两个立场,并将其应用于Llama-3-8B-Instruct上的质心差引导。我们发现一种分离:模型在几何上不同的子空间中表示谄媚和事实同意,但引导方向在两者上的投影相等,无法差异化地针对任一。因此,该方向同样减少对事实正确陈述(例如地球是圆的)和谄媚陈述的同意。两个激活组的所有其他静态属性都匹配,表明行为分离源于生成动态或残差流分析无法解析的更细粒度结构。该模式说明了一个普遍差距:从激活中可读的表示可能无法通过它们写入。

英文摘要

Activation steering can shift LLM behaviour, but standard evaluations do not typically test whether a sycophancy-reduction direction also suppresses agreement with factually correct statements. We introduce dual-stance evaluation, which tests both stances of each topic, and apply it to centroid-difference steering on Llama-3-8B-Instruct. We find a dissociation: the model represents sycophantic and factual agreement in geometrically distinct subspaces, yet the steering direction projects equally onto both and cannot differentially target either. The direction accordingly reduces agreement with factually correct statements (e.g. that the Earth is round) as well as sycophantic ones. All other static properties of the two activation groups are matched, suggesting the behavioural dissociation arises from generation dynamics or from finer-grained structure that residual-stream analysis cannot resolve. The pattern illustrates a general gap: representations that are readable from activations may not be writable through them.

2606.11199 2026-06-11 cs.CL cs.AI cs.IR cs.LG 新提交

NightFeats @ MMU-RAGent NeurIPS 2025: A Context-Optimized Multi-Agent RAG System for the Text-to-Text Track

NightFeats @ MMU-RAGent NeurIPS 2025: 面向文本到文本轨道的上下文优化多智能体RAG系统

Quentin Fever, Naziha Aslam

AI总结 提出一种结构化多智能体RAG系统NightFeats,通过检索、策展和组合三阶段分解知识合成,引入时序语义重排序、矛盾协调和引用保留架构,在MMU-RAGent竞赛中超越商业基线。

详情
Comments
5 pages, 1 figure, 1 table. NeurIPS 2025 Competition Track (MMU-RAGent). System developed October 2025
AI中文摘要

我们提出NightFeats,一个结构化的多智能体检索增强生成(RAG)系统,提交至NeurIPS 2025的MMU-RAGent竞赛,并在文本到文本轨道中获得最佳动态评估奖。本文并非以基准最大化目标,而是提出一个原则性流水线,将知识合成为三个协调阶段:检索、策展和组合,每个阶段由显式的中间表示和交接契约控制。受智能体上下文工程(ACE)启发,该系统引入时序语义重排序、有界矛盾协调和保留引用的组合作为核心架构原语。竞赛结果表明,NightFeats在LLM-as-a-Judge和人类Likert评估中超越了包括Claude-SonnetV2和Nova-Pro在内的商业基线,证实了架构透明性和可验证证据基础比单纯优化自动相似度指标的系统更符合人类偏好。

英文摘要

We present NightFeats, a structured multi-agent retrieval-augmented generation (RAG) system submitted to the MMU-RAGent competition at NeurIPS 2025, where it was awarded Best Dynamic Evaluation in the text-to-text track. Rather than targeting benchmark maximization, this work proposes a principled pipeline that decomposes knowledge synthesis into three coordinated phases: retrieval, curation, and composition, each governed by explicit intermediate representations and handoff contracts. Inspired by Agentic Context Engineering (ACE), the system introduces temporal-semantic reranking, bounded contradiction reconciliation, and citation-preserving composition as core architectural primitives. Competition results show that NightFeats surpasses proprietary baselines including Claude-SonnetV2 and Nova-Pro on LLM-as-a-Judge and Human Likert evaluations, confirming that architectural transparency and verifiable evidence grounding are better aligned with human preferences than systems optimizing narrowly for automatic similarity metrics.

2606.07537 2026-06-11 cs.CL cs.AI cs.LG 交叉投稿

From Architecture to Output: Structural Origins of Hallucination in Large Language Models and the Amplifying Role of Data

从架构到输出:大语言模型中幻觉的结构性起源及数据的放大作用

Md. Rejaul Korim Sadi, Toufiqur Rahman Tasin, Golam Mostofa Naeem

AI总结 本文分析大语言模型幻觉的结构性根源,指出自注意力、最大似然估计训练目标和自回归解码三个架构决策构成复合失效系统,并揭示数据病理如何放大这些脆弱性。

详情
Comments
11 pages, 7 figures, 15 references
AI中文摘要

大语言模型会产生幻觉——生成流畅、自信但事实错误的输出——这种一致性跨越代际和规模。现有分类法按输出类型对幻觉进行分类,区分内在与外在失败以及忠实性与事实性偏差。这些框架在描述上严谨,但未能识别产生特定实例的内部机制。本文将幻觉分析为三个架构决策的结构性后果,这些决策共同构成一个复合失效系统。自注意力的共现学习用统计邻近性替代语义含义,导致实体混淆、事实错误归因和语义漂移。最大似然估计训练目标在无事实约束下优化下一个词元概率,奖励统计上合理的输出,无论其真值如何。自回归解码在暴露偏差下的永久从左到右承诺确保单个错误词元级联向前传递整个输出序列而无法修正。数据集病理——长尾缺陷、训练偏差和合成污染——放大了这些脆弱性,但并非独立导致它们。我们做出三项贡献。首先,我们将每个机制映射到Alansari和Luqman分类法中的特定输出类别,将内在幻觉定位于自注意力,外在幻觉定位于MLE,逻辑不一致定位于自回归解码。其次,我们表明每个常被引用的数据集病理利用这些机制之一,而非独立产生幻觉。第三,我们识别出仅基于输出类型分类的诊断局限性,并将其与推理层缓解方法进行对比。

英文摘要

Large language models hallucinate--producing fluent, confident, factually wrong outputs--with a consistency that persists across generations and scales. Existing taxonomies classify hallucination by output type, distinguishing intrinsic from extrinsic failures and faithfulness from factuality divergence. These frameworks are descriptively rigorous but do not identify which internal mechanism produced a given instance. This paper analyses hallucination as a structural consequence of three architectural decisions that together form a compound failure system. Self-attention's co-occurrence learning substitutes statistical proximity for semantic meaning and produces entity confusion, fact misattribution, and semantic drift. The maximum likelihood estimation training objective optimises next-token probability without factual constraint, rewarding statistically plausible outputs regardless of their truth value. Autoregressive decoding's permanent left-to-right commitment under exposure bias ensures that a single wrong token cascades forward through the entire output sequence without revision. Dataset pathologies--long-tail deficiencies, training bias, and synthetic pollution--amplify these vulnerabilities but do not independently cause them. We make three contributions. First, we map each mechanism to a specific output category in the Alansari and Luqman taxonomy, locating intrinsic hallucination in self-attention, extrinsic hallucination in MLE, and logical inconsistency in autoregressive decoding. Second, we show that each commonly cited dataset pathology exploits one of these mechanisms rather than originating hallucination independently. Third, we identify the diagnostic limitation of output-type-only classification and contrast it with inference-layer mitigation approaches.

2605.04893 2026-06-11 cs.LG cs.CL stat.ML 版本更新

Self-Attention as Transport: Limits of Symmetric Spectral Diagnostics

自注意力作为传输:对称谱诊断的极限

Dominik Dahlem, Diego Maniloff, Mac Misiura

AI总结 研究语言模型注意力路由的两种失效形状(过度集中或过度分散),证明对称谱诊断对方向不敏感,并揭示因果注意力中传输容量的理论下限,提出基于容量和方向的双轴诊断方法。

详情
Comments
48 pages, 6 figures, 7 tables; 81-page online supplement (proofs, additional experiments, dataset statistics) as an ancillary file
AI中文摘要

当语言模型处理幻觉响应时,其注意力路由往往以两种形状之一失效:过度集中在狭窄的位置集合上,或者分散得如此广泛以至于相关性被稀释,而失效的形状携带诊断信号。我们研究这些形状作为诊断特征,从在基准标记响应的\emph{强制评分}下计算的注意力矩阵中得出,而不是在实时生成期间。一类广泛使用的谱方法分析度归一化注意力算子的对称分量,该算子控制传输\emph{容量};我们证明该算子的每个转置不变谱诊断在结构上是\emph{方向盲的}(它无法区分算子与其转置,因此无法检测信息流方向),并且盲定理的逆定理将任何Lipschitz诊断的转置敏感性限制为不对称系数$G$。将其与规范因果架构的闭式二分-Cheeger景观配对,我们证明均匀因果注意力满足一个与$n$无关的下界$\phi \ge 1/5$,而窗口注意力以$O(w/n)$穿透下界;失效模式在形状上不同,而不仅仅在数值上不同。这个下界是一个理想化架构的基准,而不是经验吸引子:穿透它的真实注意力头的比例本身就是一个架构特征。由此产生的双轴诊断($\phi$表示容量,$G$表示方向)产生一个可证伪的极性预测:瓶颈主导和分散主导的基准应表现出相反的极性。在长度控制评估下,传输特征在测试的仅解码器、仅编码器和编码器-解码器模型中保持可解释的信号(0.62-0.84 LC-AUROC),极性在HaluEval和MedHallu之间如预测般反转。

英文摘要

When a language model processes a hallucinated response, its attention routing tends to fail in one of two shapes: over-concentrating on a narrow set of positions, or spreading so diffusely that relevance is diluted, and the shape of the failure carries diagnostic signal. We study these shapes as a diagnostic characterization, computed from attention matrices under \emph{forced scoring} of benchmark-labeled responses rather than during live generation. A widely used family of spectral methods analyzes the symmetric component of the degree-normalized attention operator, which governs transport \emph{capacity}; we prove that every transpose-invariant spectral diagnostic of this operator is structurally \emph{orientation-blind} (it cannot distinguish an operator from its transpose, and therefore cannot detect information-flow direction), with a converse to the blindness theorem bounding any Lipschitz diagnostic's transpose sensitivity by the asymmetry coefficient $G$. Pairing this with a closed-form bipartite-Cheeger landscape for canonical causal architectures, we show that uniform causal attention satisfies an $n$-independent floor $ϕ\ge 1/5$, while window attention pierces the floor as $O(w/n)$; failure modes are shape-different, not just value-different. This floor is an idealized-architecture benchmark, not an empirical attractor: the fraction of real attention heads that pierce it is itself an architectural signature. The resulting two-axis diagnostic ($ϕ$ for capacity, $G$ for direction) yields a falsifiable polarity prediction: bottleneck- and diffuse-dominated benchmarks should exhibit opposite polarity. Under length-controlled evaluation, transport features retain interpretable signal (0.62-0.84 LC-AUROC) across the tested decoder-only, encoder-only, and encoder-decoder models, with polarity reversing as predicted between HaluEval and MedHallu.

2605.04221 2026-06-11 cs.CL cs.AI 版本更新

Self-Prompting Small Language Models for Privacy-Sensitive Clinical Information Extraction

面向隐私敏感的临床信息抽取的自提示小型语言模型

Yao-Shun Chuang, Tushti Mody, Uday Pratap Singh, Shirindokht Shiraz, Chun-Teh Lee, Ryan Brandon, Muhammad F Walji, Xiaoqian Jiang, Bunmi Tokede

AI总结 针对牙科病历中非结构化、领域特定且隐私敏感的命名实体识别挑战,提出一种本地可部署的自提示框架,通过多提示集成推理和基于QLoRA的微调及直接偏好优化,使小型语言模型在Qwen2.5-14B-Instruct上达到微宏F1分数0.864/0.837。

详情
AI中文摘要

从牙科病程记录中进行临床命名实体识别具有挑战性,因为文档高度非结构化、领域特定且通常涉及隐私敏感信息。我们开发了一个本地可部署的框架,使小型语言模型能够自行生成、验证、完善和评估实体特定提示,以从牙科记录中提取多个临床实体。利用1,200份标注记录,我们通过多提示集成推理评估了候选开放权重模型,并进一步使用基于QLoRA的监督微调和直接偏好优化对选定模型进行调整。模型性能差异显著,凸显了需要针对特定任务进行评估而非依赖通用基准。Qwen2.5-14B-Instruct取得了最强的基线性能。经过DPO后,Qwen2.5-14B-Instruct和Llama-3.1-8B-Instruct分别达到了0.864/0.837和0.806/0.797的微/宏F1分数。这些发现表明,自动提示优化结合轻量级基于偏好的后训练可以支持使用本地部署的小型语言模型进行可扩展的临床信息抽取。

英文摘要

Clinical named entity recognition from dental progress notes is challenging because documentation is highly unstructured, domain-specific, and often privacy-sensitive. We developed a locally deployable framework that enables small language models to self-generate, verify, refine, and evaluate entity-specific prompts for extracting multiple clinical entities from dental notes. Using 1,200 annotated notes, we evaluated candidate open-weight models with multi-prompt ensemble inference and further adapted selected models using QLoRA-based supervised fine-tuning and direct preference optimization. Model performance varied substantially, highlighting the need for task-specific evaluation rather than reliance on generic benchmarks. Qwen2.5-14B-Instruct achieved the strongest baseline performance. After DPO, Qwen2.5-14B-Instruct and Llama-3.1-8B-Instruct achieved micro/macro F1 scores of 0.864/0.837 and 0.806/0.797, respectively. These findings suggest that automated prompt optimization combined with lightweight preference-based post-training can support scalable clinical information extraction using locally deployed small language models.