arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.11295 2026-06-11 astro-ph.CO cs.LG 新提交

Interpretable Neural Marked Statistics for Cosmological Inference

可解释的神经标记统计用于宇宙学推断

Federico Semenzato, Benjamin D. Wandelt, Michele Liguori, Alvise Raccanelli

AI总结 提出一种神经标记方案,通过可解释的物理变换从形态学层面提取宇宙学信息,在对比学习目标下优化标记统计,显著提高对σ₈和Ωₘ的约束精度。

详情
Comments
11 pages, 6 figures. Accepted to the Workshop on AI for Physics (ICML 2026)
AI中文摘要

恢复超出功率谱的宇宙学信息是即将进行的宇宙学调查的核心目标,因为物质密度中的晚期非高斯信号无法仅通过两点统计获得。标记统计通过使用非线性函数对场进行重新加权,将部分信息折叠回两点水平。我们提出了一种神经标记方案,通过一组可解释的、物理驱动的变换来推广这一过程,这些变换直接允许在形态学层面解释宇宙学信息的增益。我们采用对比学习目标将可学习的标记摘要与底层宇宙学参数对齐。在$k_{\max}=0.2\\,h\mathrm{Mpc}^{-1}$处,与经典标记相比,我们的神经标记将$\sigma_8$的边缘化约束提高了$2.9\times$,将$\Omega_m$提高了$1.8\times$,在Fisher信息层面打破了$\Omega_m-\sigma_8$简并。它进一步将参数MSE在整个宇宙学参数先验上比最佳经典标记降低了$1.45\times$。学习到的潜在几何与参数空间中的$\Omega_m$和$\sigma_8$方向对齐,表明对比目标恢复了宇宙学信息的主导轴。我们的方法为更强大、可解释的宇宙学推断摘要统计打开了大门。

英文摘要

Recovering cosmological information beyond the power spectrum is a central goal for upcoming cosmological surveys, since late-time non-Gaussian signal in the matter density cannot be accessed through two-point statistics alone. Marked statistics fold part of this information back into the two-point level by reweighting the field with non-linear functions. We propose a neural marking scheme to generalize this process through a set of interpretable, physically motivated transformations that directly allow to interpret the gain in cosmological information at the morphological level. We employ a contrastive learning objective to align learnable marked summaries with the underlying cosmological parameters. At $k_{\max}=0.2\,h\mathrm{Mpc}^{-1}$, our neural mark tightens the marginalized constraint on $\sigma_8$ by $2.9\times$ and on $\Omega_m$ by $1.8\times$ compared to classical marks, breaking the $\Omega_m-\sigma_8$ degeneracy at the Fisher information level. It further reduces the parameter MSE across our cosmological parameter prior by $1.45\times$ over the best classical mark. The learned latent geometry aligns with the $\Omega_m$ and $\sigma_8$ directions in parameter space, indicating that the contrastive objective recovers the dominant axes of cosmological information. Our approach opens the door to more powerful, interpretable summary statistics for cosmological inference.

2606.11290 2026-06-11 cs.LG cs.AI cs.CL 新提交

FlowBank: Query-Adaptive Agentic Workflows Optimization through Precompute-and-Reuse

FlowBank: 通过预计算与复用实现查询自适应智能体工作流优化

Lingzhi Yuan, Chenghao Deng, Fangxu Yu, Souradip Chakraborty, Mohammad Rostami, Furong Huang

AI总结 提出FlowBank框架,通过预计算多样化工作流并压缩为紧凑组合,在推理时自适应选择最优工作流,平衡性能与成本,在五个基准上平均得分最高且成本可控。

详情
AI中文摘要

基于大型语言模型的多智能体系统日益强大,但当前的智能体工作流优化范式存在令人不满意的权衡。任务级方法花费大量离线计算却只部署单个工作流,导致互补候选未被使用;而查询级方法为每个查询合成新工作流,推理成本高昂。我们的动机分析表明,这些范式更多是互补而非竞争:离线搜索中发现的工作流通常解决不同子集的查询,许多由昂贵查询级生成处理的查询已经可以通过更便宜的预计算工作流解决。这暗示了一个不同的目标:与其寻找一个普遍最佳的工作流或为每个实例重新生成,不如构建一个紧凑的、可复用的互补工作流库,并在推理时自适应地选择。为此,需要解决三个耦合问题:生成互补而非冗余的候选、压缩成小型可部署组合、在性能-成本权衡下为每个查询分配正确的工作流。我们提出FlowBank,一个基于组合的智能体工作流优化的三阶段框架。多样化阶段提出DiverseFlow,引导搜索覆盖未充分覆盖的查询,产生高覆盖率的候选池。精炼阶段提出CuraFlow,将候选池压缩为冗余最小的紧凑组合。匹配阶段将部署建模为查询-工作流二分图上的边值预测,将每个传入查询路由到预测效用最佳的组合成员。在五个基准上,FlowBank在评估方法中实现了最高平均得分,同时保持成本竞争力,相比最强的自动和手工基线分别相对提升4.26%和14.92%。

英文摘要

Large Language Model (LLM)-based multi-agent systems are increasingly powerful, but current agentic workflow optimization paradigms make an unsatisfying trade-off. Task-level methods spend substantial offline compute yet deploy only a single workflow, leaving complementary candidates unused, while query-level methods synthesize a new workflow per query at substantial inference cost. Our motivating analysis shows these paradigms are more complementary than competing: workflows discovered during offline search often solve different subsets of queries, and many queries handled by expensive query-level generation can already be solved by cheaper precomputed workflows. This suggests a different objective: rather than searching for one universally best workflow or regenerating one per instance, we should build a compact bank of reusable, complementary workflows and select among them adaptively at inference time. Doing so requires solving three coupled problems: generating complementary rather than redundant candidates, compressing them into a small deployable portfolio, and assigning each query to the right workflow under a performance-cost trade-off. To this end, we present FlowBank, a three-stage framework for portfolio-based agentic workflow optimization. Diversifying proposes DiverseFlow to steer search toward under-covered queries and produce a high-coverage candidate pool. Curating proposes CuraFlow to compress this pool into a compact portfolio with minimal redundancy. Matching casts deployment as edge-value prediction on a query-workflow bipartite graph and routes each incoming query to the portfolio member with the best predicted utility. Across five benchmarks, FlowBank achieves the highest average score among the evaluated methods while remaining cost-competitive, improving over the strongest automated and handcrafted baselines by 4.26% and 14.92% relative, respectively.

2606.11287 2026-06-11 eess.IV cs.CV 新提交

Intelligent Skin Cancer Detection Using a Multispectral Metasurface and a Hybrid

基于多光谱超表面和混合深度学习的智能皮肤癌检测

Afsane Saee Arezoomand

AI总结 提出结合多光谱超表面成像与CNN-ViT混合深度学习架构,实现皮肤癌高精度检测,准确率达98%,灵敏度95%,特异性99%。

详情
Comments
8 pages
AI中文摘要

皮肤癌是全球最常见的恶性肿瘤之一,早期检测对于提高患者生存率和降低治疗成本至关重要。传统的皮肤镜和视觉成像技术主要局限于可见光谱,通常无法捕捉与早期恶性肿瘤相关的细微光谱特征。本研究提出了一种创新框架,将多光谱超表面成像与基于卷积神经网络和视觉Transformer的混合深度学习架构相结合。设计的超表面能够非侵入性地获取对组织变化高度敏感的丰富光谱信息,而混合CNN-ViT模型同时提取局部和全局特征,以稳健地对皮肤病变进行分类。基于模拟的评估表明,所提方法实现了约98%的准确率、95%的灵敏度和99%的特异性,优于传统的基于RGB和单一架构的方法。使用注意力图进行的定性分析显示,模型关注临床相关的病变区域,提高了可解释性。总体而言,结果表明,将基于超表面的多光谱成像与混合深度学习相结合,可以引入新一代皮肤病学诊断工具,并为便携、快速且高精度的临床系统铺平道路。

英文摘要

Skin cancer is among the most prevalent malignancies worldwiAdbe satnradcitts early detection is essential for improving patient survival and reducing treatment costs Conventional dermoscopic and visual imaging techniques are primarily limited to the visible spectrum and often fail to capture subtle spectral signatures associated with early stage malignancies This study proposes an innovative framework that integrates a multispectral metasurface for imaging with a hybrid deep learning architecture based on Convolutional Neural Networks and Vision Transformers The designed metasurface enables noninvasive acquisition of rich spectral information highly sensitive to tissue alterations while the hybrid CNN ViT model simultaneously extracts local and global features to robustly classify skin lesions Simulation-based evaluations demonstrate that the proposed method achieves approximately 98 accuracy 95 percentages sensitivity and 99 perentage specificity surpassing conventional RGB-based and single-architecture approaches Qualitative analyses using attention maps reveal that the model focuses on clinically relevant lesion regions improving interpretability Overall the results indicate that combining metasurface based multispectral imaging with hybrid deep learning can introduce a new generation of diagnostic tools in dermatology and pave the way for portable fast and highly accurate clinical systems

2606.11285 2026-06-11 cs.CV 新提交

EventRadar: Long-Range Visual UAV Discovery through Spatiotemporal Event Sensing

EventRadar:通过时空事件感知实现远程视觉无人机发现

Zhiting Zhou, Xingchen Liu, Xinglin Yu, Jiashen Chen, Haoyang Wang, Jingao Xu, Yunhao Liu, Xinlei Chen

AI总结 针对远程小目标无人机检测难题,提出EventRadar方法,利用事件相机捕捉螺旋桨引起的时域周期性,结合场景锚定几何证据(SAGE)和梳状引导谐波组学习迭代收缩阈值算法(CHG),在700-1500米距离上实现高精度检测。

详情
AI中文摘要

机场、公共场所及其他敏感区域周围的未经授权无人机活动使得受保护空域监测日益重要。一个实用的感知系统必须搜索广阔的角度区域,发现小型远程目标,并在限制周界被突破前返回方位支持和无人机特定证据。现有的无人机检测路径通常依赖空间组织的证据,如身体范围、轮廓或轨迹连续性。然而,在远距离上,随着目标足迹减弱和图像平面支撑缩小,这些线索变得难以保持和验证。EventRadar遵循一种互补线索:螺旋桨引起的时域周期性,最近的事件相机感知研究表明,在目标外观变弱后,这种周期性可以揭示无人机特有的运动。我们将这一线索扩展到千米级主动感知,使用事件相机原型。场景锚定几何证据(SAGE)将扫描事件与IMU姿态融合,维护一个方位索引的场景记忆,将瞬态候选支撑与持久背景杂波分离。然后,梳状引导谐波组学习迭代收缩阈值算法(CHG)将每个候选视为一个弱的高速率定时信号,并以固定计算量恢复相位不敏感的谐波证据。与相关事件相机基线在700-1500米无人机事件记录上的比较,EventRadar实现了0.990 mAP$_{.3}$和0.949 F1$_{.3}$,将FN$_{.3}$降低到0.009,并在原型分析中展示了实时可行性。

英文摘要

Unauthorized unmanned aerial vehicle (UAV) activity around airports, public venues, and other sensitive sites has made protected-airspace monitoring increasingly important. A practical sensing system must search a wide angular region, find small long-range targets, and return both bearing support and UAV-specific evidence before a restricted perimeter is breached. Existing UAV detection paths often rely on spatially organized evidence, such as body extent, silhouette, or track continuity. At long range, however, these cues become difficult to preserve and verify as the target footprint weakens and its image-plane support shrinks. EventRadar follows a complementary cue: propeller-induced temporal periodicity, which recent event-camera sensing studies have shown can reveal UAV-specific motion after appearance becomes weak. We extend this cue to kilometer-scale active sensing with an event-camera prototype. Scene-Anchored Geometry Evidence (SAGE) fuses scanning events with IMU pose to maintain a bearing-indexed scene memory, separating transient candidate support from persistent background clutter. Comb-guided Harmonic-Group Learned Iterative Shrinkage and Thresholding Algorithm (CHG) then treats each candidate as a weak high-rate timing signal and recovers phase-insensitive harmonic evidence with fixed compute. Compared with related event-camera baselines on 700-1500 m UAV event recordings, EventRadar achieves 0.990 mAP$_{.3}$ and 0.949 F1$_{.3}$, reduces FN$_{.3}$ to 0.009, and shows real-time feasibility in prototype profiling.

2606.11284 2026-06-11 cs.MA cs.GT cs.LG 新提交

Phi-Actor-Critic: Steering General-Sum Games to Pareto-Efficient Correlated Equilibria

Phi-Actor-Critic: 引导一般和博弈走向帕累托高效关联均衡

Wongyu Lee, Francesco Lelli, Omran Ayoub, Massimo Tornatore

AI总结 提出Φ-Actor-Critic框架,通过交换遗憾最小化引导多智能体学习向高社会福利的关联均衡收敛,并采用集中式注意力批评家高效估计反事实遗憾,结合拉格朗日均衡选择机制优化社会福利。

详情
Comments
Accepted to IJCAI 2026
AI中文摘要

现实世界的多智能体系统,从交通协调到资源分配,通常被建模为一般和博弈,其中个体激励与集体福利相冲突。在这些设定中,核心挑战不仅是找到均衡,而是在许多次优纳什均衡中选择社会期望的结果。标准的深度多智能体强化学习(MARL)方法难以解决这个问题,因为价值分解方法受单调性假设约束,而策略梯度方法往往收敛到稳定但社会效率低下的均衡。为了解决这一限制,我们提出了Φ-Actor-Critic(Φ-AC),一个利用交换遗憾最小化引导学习向高福利关联均衡(CE)收敛的框架。为了使反事实遗憾估计在深度MARL中易于处理,Φ-AC采用了一个集中式注意力批评家,在单次前向传播中预测向量值遗憾,避免了计算昂贵的反事实模拟。我们进一步引入了一个基于拉格朗日的均衡选择机制,通过遗憾约束优化社会福利同时确保稳定性。在矩阵博弈、多智能体粒子环境(MPE)和Melting Pot Harvest场景上的实验表明,Φ-AC在多样的混合动机设定中学习到高效且稳定的协调策略,同时保持高集体回报和竞争公平性。

英文摘要

Real-world multi-agent systems, from traffic coordination to resource allocation, are often modeled as general-sum games where individual incentives conflict with collective welfare. In these settings, the central challenge is not merely finding an equilibrium, but selecting socially desirable outcomes among many suboptimal Nash equilibria. Standard deep multi-agent reinforcement learning (MARL) methods struggle with this problem, as value-decomposition approaches are constrained by monotonicity assumptions and policy-gradient methods often converge to stable but socially inefficient equilibria. To address this limitation, we propose $\Phi$-Actor-Critic ($\Phi$-AC), a framework that leverages swap regret minimization to steer learning toward high-welfare correlated equilibria (CE). To make counterfactual regret estimation tractable in deep MARL, $\Phi$-AC employs a centralized attention critic that predicts vector-valued regrets in a single forward pass, avoiding computationally expensive counterfactual simulations. We further introduce a Lagrangian-based equilibrium selection mechanism that optimizes social welfare while enforcing stability through regret constraints. Experiments on matrix games, Multi-Agent Particle Environments (MPE), and the Melting Pot Harvest scenario demonstrate that $\Phi$-AC learns efficient and stable coordination strategies across diverse mixed-motive settings while maintaining high collective return and competitive fairness.

2606.11283 2026-06-11 cs.DS cs.LG stat.ML 新提交

Fixed-Parameter Tractability of Private Synthetic Data Generation

私有合成数据生成的固定参数可处理性

Badih Ghazi, Cristóbal Guzmán, Pritish Kamath, Alexander Knop, Ravi Kumar, Pasin Manurangsi

AI总结 研究差分隐私下合成数据生成问题,通过查询族关联图的树宽参数建立固定参数可处理性,提出两种最优算法。

详情
AI中文摘要

我们研究在差分隐私下生成合成数据的问题。我们建立了该问题的固定参数可处理性(FPT),其中参数是查询族关联图的树宽。我们的算法在所有情况下都达到最优错误率,并通过两种不同方法实现:第一种基于线性规划(LP)和LP对偶分离问题的FPT;第二种基于子采样私有乘法权重方法,其中我们获得了从吉布斯分布采样的FPT。两种方法都通过树分解上的动态规划框架统一。

英文摘要

We study the problem of generating synthetic data under differential privacy. We establish fixed-parameter tractability (FPT) for this problem where the parameter is the treewidth of the query family's incidence graph. Our algorithms attain optimal error rates across all regimes and are realized by two different approaches: the first is based on linear programming (LP) and the FPT of the separation problem for the LP dual; the second is based on a subsampled private multiplicative weights method, where we obtain FPT for sampling from Gibbs distributions. Both approaches are unified by a dynamic programming framework over a tree decomposition.

2606.11279 2026-06-11 eess.AS cs.CL cs.LG cs.SD 新提交

Massive Open-Vocabulary Keyword Spotting

大规模开放词汇关键词识别

Leonor Barreiros, Raul Monteiro, Afonso Mendes, Gonçalo M. Correia

AI总结 提出一种内存占用更小的开放词汇关键词识别系统,无需微调即可处理大规模数据库,在未见语言中达到与未压缩方案相当的实体召回率。

详情
Comments
Accepted to Interspeech 2026
AI中文摘要

自动语音识别系统在转录训练数据中罕见词汇(即专业术语)时表现不佳。开放词汇关键词识别结合上下文偏置已被证明可以缓解这一问题。然而,现有系统只能处理几百个术语的词汇表,否则会成为不可行的瓶颈。我们提出了一种系统,其存储特征的内存占用比可比基线小128倍,允许用户处理大规模数据库,同时保持开放词汇。无需微调语音识别模型,我们的系统在未见过的语言中也达到了与未压缩解决方案相当的实体召回率。

英文摘要

Automatic speech recognition systems have been shown to under-perform when it comes to transcribing words rarely seen in the training data, namely specialized terminology. Open-vocabulary keyword spotting, combined with contextual biasing, has been shown to mitigate this issue. However, existing systems can only handle glossaries of a few hundred terms without becoming an infeasible bottleneck. We propose a system that stores features with a memory footprint up to 128 times smaller than a comparable baseline and allows users to process massive databases while remaining open-vocabulary. Without fine-tuning the speech recognition model, our system achieves a comparable entity recall as uncompressed solutions, even in languages not seen during training.

2606.11274 2026-06-11 cs.MA cs.LG physics.flu-dyn 新提交

Multi-agent rendezvous in fluid flows via reinforcement learning

基于强化学习的多智能体在流体中的会合

Bocheng Li, Jingran Qiu, Lihao Zhao

AI总结 采用多智能体强化学习(MARL)在涡旋流中开发物理信息会合策略,显著提高会合率,并具有跨涡旋强度、尺度和群体规模的迁移性,通过打破状态-动作图对称性防止智能体被困在分离涡旋中。

详情
AI中文摘要

会合是多智能体系统的一项关键任务,要求智能体协调以在未指定位置相遇。然而,在流体环境中实现这一目标具有挑战性,因为尚不清楚智能体如何利用底层流体运动学来促进收敛。在本研究中,我们采用多智能体强化学习(MARL)方法在涡旋流中开发物理信息会合策略。与智能体向其对应方导航的朴素策略相比,MARL策略显著提高了会合率。MARL策略还表现出跨不同涡旋强度、涡旋尺度和群体规模的可迁移性。通过打破状态-动作图的对称性,MARL策略利用一种非直观的机制,防止智能体被困在分离的涡旋中,从而提高会合成功率。此外,从学习到的策略中提取了一种启发式策略,其性能也优于朴素策略。进一步的理论分析表明,流体变形阻碍了会合过程。大的有限时间李雅普诺夫指数识别出流体效应分离相邻智能体的区域,表明应在弱变形区域规划目标。我们的发现揭示了智能体-流体相互作用在多智能体任务中的重要作用,并突出了MARL在复杂流动环境中探索群体智能的能力。

英文摘要

Rendezvous is a critical task for multi-agent systems, requiring agents to coordinate to meet at an unspecified location. However, achieving this in fluid environments presents a challenge, as it remains unclear how agents can exploit underlying fluid kinematics to facilitate convergence. In this study, we adopt a multi-agent reinforcement learning (MARL) approach to develop physics-informed rendezvous strategies in vortical flows. Compared to a naive strategy, where agents navigate toward their counterparts, MARL strategies significantly improve the rendezvous rate. MARL strategies also show transferability across varying vortex intensities, vortex scales, and swarm sizes. By breaking the symmetry of the state-action map, MARL strategy leverages a non-intuitive mechanism that prevents agents from becoming trapped in separate vortices, thereby enhancing rendezvous success. Additionally, a heuristic strategy is extracted from the learned strategy and also outperforms the naive strategy. Furthermore, a theoretical analysis demonstrates that fluid deformation impedes the rendezvous process. Large finite-time Lyapunov exponents identify where fluid effects separate adjacent agents, suggesting that targets should be planned in weak-deformation regions. Our findings reveal the important role that agent-fluid interactions play in multi-agent tasks and highlight the MARL capability to explore swarm intelligence in complex flow environments.

2606.11270 2026-06-11 cs.LG cs.AI cs.CL 新提交

Quantifying Subliminal Behavioral Transfer Ratios in Language Model Distillation

量化语言模型蒸馏中的潜意识行为迁移比率

Uwe Konig, Hamza Kazmi, Ruizhe Li, Maheep Chaudhary

AI总结 通过控制教师模型行为强度并蒸馏学生模型,量化了潜意识行为迁移比率,发现迁移具有鲁棒性且呈现不同缩放行为。

详情
AI中文摘要

旨在将良性行为迁移到学生模型的语言模型蒸馏,也可能迁移教师模型中存在的不良特征,这种现象称为潜意识学习。虽然定性证据支持该效应的存在,但其程度尚未被系统表征。本研究通过控制两个教师模型(Llama-2-7B-Chat 和 Qwen2.5-7B-Instruct)在不同引导强度下,并仅使用良性数据蒸馏学生模型,量化了潜意识行为迁移比率。使用 GPT-4.1 作为评估器对 100 个 JailbreakBench 提示进行评估,结果表明迁移是鲁棒的,但表现出不同的缩放行为。Llama-2 表现出一个尖锐的阈值($\tau = {0.25,0.32} \ \text{beyond} \ \alpha = -0.15$),而 Qwen2.5 表现出连续且更高水平的迁移($\tau$ 高达 $0.61$)。

英文摘要

Distillation of a language model intended to transfer benign behavior to a student model may also transfer undesirable characteristics, if they are present in the teacher model, a phenomenon known as subliminal learning. While qualitative evidence supports the existence of this effect, its magnitude has not been systematically characterized. This study quantifies subliminal behavioral transfer ratios by steering two teacher models (Llama-2-7B-Chat and Qwen2.5-7B-Instruct) at varying steering strengths and distilling student models using only benign data. Evaluation on 100 JailbreakBench prompts with GPT-4.1, serving as the evaluator, indicates that transfer is robust but exhibits distinct scaling behaviors. Llama-2 demonstrates a sharp threshold ($\tau = {0.25,0.32} \ \text{beyond} \ \alpha = -0.15$), whereas Qwen2.5 displays continuous and higher levels of transfer ($\tau$ up to $0.61$).

2606.11265 2026-06-11 cs.CR cs.AI 新提交

When Poison Fails After Retrieval: Revisiting Corpus Poisoning under Chunking and Reranking Pipelines

当投毒在检索后失败:重新审视分块与重排序管道下的语料库投毒

Xi Nie, Hongwei Li, Shenghao Wu, Mingxuan Li, Jiachen Li, Wenbo Jiang

AI总结 针对RAG系统,提出CRCP框架,通过联合优化检索相关性、重排序一致性和分块边界鲁棒性,解决现有投毒方法在真实多阶段检索管道中因分块和重排序导致效果下降的问题。

详情
AI中文摘要

检索增强生成(RAG)系统容易受到语料库投毒攻击,这些攻击通过恶意知识注入操纵下游模型输出。现有研究主要在简化的检索设置下评估投毒,忽视了涉及文档分块、密集检索、重排序和基于生成的生成等实际RAG管道。在本文中,我们重新审视了在现实多阶段检索管道下的语料库投毒,并表明许多现有攻击在重排序后效果显著下降,尽管在检索阶段实现了高相关性。我们识别出检索粒度不匹配是这种失败的关键原因:文档级别的对抗信号在分块过程中经常被碎片化,而重排序器偏好局部连贯且包含答案的段落,而非全局优化的语义相似性。基于这一观察,我们提出了分块感知和重排序一致的投毒(CRCP),这是一个联合优化检索相关性、重排序一致性和分块边界鲁棒性的投毒框架。CRCP在优化过程中显式建模分块变换,以生成在变化的分块配置下仍然有效的局部自包含对抗段落。在多个检索器和重排序器的标准RAG基准上的实验表明,现有投毒方法对分块大小和重排序策略高度敏感,而CRCP在现实检索管道中实现了显著更高的攻击成功率和更强的鲁棒性。我们的发现凸显了当前RAG安全评估中的一个重要现实差距,并表明现代RAG系统中的投毒应被视为一个多阶段检索一致性问题,而不仅仅是检索问题。

英文摘要

Retrieval-Augmented Generation (RAG) systems are vulnerable to corpus poisoning attacks that manipulate downstream model outputs through malicious knowledge injection. Existing studies mainly evaluate poisoning under simplified retrieval settings, overlooking practical RAG pipelines involving document chunking, dense retrieval, reranking, and grounded generation. In this paper, we revisit corpus poisoning under realistic multi-stage retrieval pipelines and show that many existing attacks substantially degrade after reranking despite achieving high retrieval-stage relevance. We identify retrieval granularity mismatch as a key reason for this failure: document-level adversarial signals are often fragmented during chunking, while rerankers favor locally coherent and answer-bearing passages rather than globally optimized semantic similarity. Based on this observation, we propose Chunk-aware and Rerank-Consistent Poisoning (CRCP), a poisoning framework that jointly optimizes retrieval relevance, reranker consistency, and chunk-boundary robustness. CRCP explicitly models chunking transformations during optimization to generate locally self-contained adversarial passages that remain effective under varying chunking configurations. Experiments on standard RAG benchmarks with multiple retrievers and rerankers show that existing poisoning methods are highly sensitive to chunk size and reranking strategies, whereas CRCP achieves substantially higher attack success rates and stronger robustness across realistic retrieval pipelines. Our findings highlight an important realism gap in current RAG security evaluation and suggest that poisoning in modern RAG systems should be studied as a multi-stage retrieval consistency problem rather than a retrieval-only problem.

2606.11264 2026-06-11 q-bio.QM cs.AI 新提交

OmniBioTwin: A System-of-Twinned-Systems Framework for Health Digital Twins

OmniBioTwin:用于健康数字孪生的孪生系统之系统框架

Zhaohui Wang, Yu Huang, Jiang Bian

AI总结 提出OmniBioTwin框架,通过多层级网络架构中的模块化孪生体和交互算子,实现跨尺度健康数字孪生的系统级集成,并在阿尔茨海默病GLP-1信号通路中验证。

详情
AI中文摘要

健康数字孪生(HDT)有望实现患者特异性建模和决策支持,但目前的方法在结构上仍然碎片化:针对单个器官或任务的单一模型缺乏跨尺度保真度,而系统级孪生缺乏通用的架构框架。我们提出OmniBioTwin,一种孪生系统之系统(SoTS)框架,将HDT组织为模块化计算实体,通过多层网络架构中的显式交互算子进行耦合。该框架包括七个协调层——涵盖数据集成、自主孪生建模、跨尺度耦合、时间同步和人机交互决策支持。我们通过实例化阿尔茨海默病中胰高血糖素样肽-1(GLP-1)信号通路的多尺度孪生来演示OmniBioTwin,说明如何在统一系统中组合和耦合分子、细胞和器官级别的孪生。

英文摘要

Health digital twins (HDTs) promise patient-specific modeling and decision support but current approaches remain structurally fragmented: monolithic models that address a single organ or task lack cross-scale fidelity, while system-level twins lack generalizable architectural frameworks. We propose OmniBioTwin, a System-of-Twinned-Systems (SoTS) framework that organizes HDTs as modular computational entities coupled through explicit interaction operators within a multi-layer network architecture. The framework comprises seven coordinated layers - spanning data integration, autonomous twin modeling, cross-scale coupling, temporal synchronization, and human-in-the-loop decision support. We demonstrate OmniBioTwin by instantiating a multiscale twin for glucagon-like peptide-1 (GLP-1) signaling pathways in Alzheimer's disease, illustrating how molecular, cellular, and organ-level twins can be composed and coupled within a unified system.

2606.11263 2026-06-11 math.ST cs.LG math.NA math.PR 新提交

Geometric bias in eigenspace perturbation under random heterogeneous noise

随机异质噪声下特征空间扰动的几何偏差

Fengkai Liu, Ke Wang, Wanjie Wang

AI总结 针对稀疏、异质方差噪声下的信号加噪声矩阵,研究发现经验特征向量存在经典扰动界无法捕捉的系统性几何偏差,并通过二次向量方程和精细各向同性局部律推导了最优非渐近扰动界。

详情
Comments
104 pages, 1 figure
AI中文摘要

谱方法从根本上依赖于主特征空间在随机扰动下的稳定性。经典上,这种稳定性由 Davis-Kahan 和 Wedin 定理量化,这些定理利用噪声的算子范数和相关谱间隙来界定特征空间误差。虽然这些最坏情况界对于任意确定性扰动是紧的,但在低秩信号加随机噪声的设置中可能造成浪费,因为它们未能捕捉信号几何与噪声分布之间的细粒度相互作用。在本文中,我们研究了被具有任意非齐次方差剖面的稀疏随机噪声破坏的信号加噪声矩阵的谱扰动。我们证明,在异质噪声方差下,经验特征向量遭受系统性的、确定性的几何偏差,这种偏差完全不为经典扰动界所见。通过利用二次向量方程并建立精细的各向同性局部律,我们推导了在算子范数和 $2\to\infty$ 范数下前导特征空间的近最优、非渐近扰动界。这些界将通常的信噪比贡献、随机波动和由信号特征空间与行方差剖面对齐决定的结构化几何偏差项分离开来。

英文摘要

Spectral methods rely fundamentally on the stability of principal eigenspaces under random perturbations. Classically, this stability is quantified by the Davis-Kahan and Wedin theorems, which bound the eigenspace error using the operator norm of the noise and the relevant spectral gaps. While these worst-case bounds are sharp for arbitrary deterministic perturbations, they can be wasteful in the low-rank signal-plus-random-noise setting, as they fail to capture the fine-grained interaction between the signal geometry and the noise distribution. In this paper, we study the spectral perturbation of signal-plus-noise matrices corrupted by sparse, random noise with an arbitrary, inhomogeneous variance profile. We demonstrate that under heterogeneous noise variances, the empirical eigenvectors suffer a systematic, deterministic geometric bias that is entirely invisible to classical perturbation bounds. By leveraging the Quadratic Vector Equation (QVE) and establishing fine-grained isotropic local laws, we derive near-optimal, non-asymptotic perturbation bounds for the leading eigenspaces in the operator and $2\to\infty$ norms. The bounds separate the usual signal-to-noise contribution, stochastic fluctuations, and structured geometric bias terms determined by the alignment between the signal eigenspaces and the row-wise variance profile.

2606.11258 2026-06-11 cs.LG nlin.PS physics.comp-ph 新提交

Loss Landscape Diagnosis for Gradient-Based Gray-Scott System Inversion: Disentangling the Roles of PINN Components

基于梯度的Gray-Scott系统反演的损失景观诊断:解构PINN各组件的角色

Yan Yang

AI总结 通过直接反向传播稳态损失至未折叠的Gray-Scott模拟,发现优化因损失景观中的平坦高原和陡峭悬崖而失败,而PINN中的残差损失通过隐式编码完整PDE动力学避免了该病理现象。

详情
Comments
Accepted at the AI4Physics Workshop, ICML 2026 (non-archival). 14 pages, 10 figures
AI中文摘要

反应扩散系统的梯度基反演通常通过代理模型或物理信息神经网络(PINN)进行,而最直接的路径——通过PDE结构本身进行反向传播——在很大程度上被避免。我们将这条直接路径作为诊断探针,通过未折叠的Gray-Scott模拟反向传播稳态损失以恢复其参数,无需代理或神经网络增强。优化未能收敛,直接绘制损失景观将其失败定位于其几何结构——平坦高原无梯度信号,被与分岔边界对齐的陡峭悬崖所包围——这种结构在损失函数中重复出现,并且无论梯度如何路由到参数都会继承。将这一最小设置视为PINN的消融实验,我们解构了每个组件的作用:在神经网络固定的情况下,残差损失是PDE参数的二次函数,产生平滑的损失景观,因此仅凭它就能避免病理现象,通过隐式编码所有初始条件下的完整PDE动力学。而神经网络无法修复不适定的参数子空间,因此仅用于完成观测数据——这种分工此前未被明确。这些发现对PINN类方法具有具体的设计意义,并提供了关于何时添加维度实际上有帮助的更广泛启发。

英文摘要

Gradient-based inversion of reaction-diffusion systems is typically approached via surrogate models or physics-informed neural networks (PINNs), while the most direct route, backpropagation through the PDE's structure itself, has largely been avoided. We pursue this direct route as a diagnostic probe, backpropagating a steady-state loss through unrolled Gray-Scott simulation to recover its parameters, with no surrogate or neural-network augmentation. Optimization fails to converge, and plotting the landscape directly locates the failure in its geometry -- flat plateaus with no gradient signal, bounded by sharp cliffs that align with bifurcation boundaries -- a structure that recurs across loss functions and is inherited however the gradients are routed to parameters. Reading this minimal setup as an ablation of PINN, we disentangle each component's role: with the neural network fixed, the residual loss is quadratic in the PDE parameters and yields a smooth landscape, so it alone already avoids the pathology, by implicitly encoding the full PDE dynamics across all initial conditions. The neural network, for its part, cannot repair an ill-posed parameter subspace, and so serves only to complete the observed data -- a division of labor not previously made explicit. These findings carry concrete design implications for PINN-type methods and a broader heuristic on when added dimensions actually help.

2606.11256 2026-06-11 physics.chem-ph cs.LG cs.NE 新提交

My Chemical Harness: Evolutionary Molecular Design over Synthetic Pathways with Large Language Model Agents

我的化学缰绳:基于合成路径的大语言模型智能体进化分子设计

César Ojeda, Darius A. Faroughy, Maryam Karimi, Payam Zarrintaj, Mir Mehdi Seyedebrahimi, Martín Carballo-Pacheco

AI总结 提出一种以可执行合成路径为种群、大语言模型仅作策略控制器的进化框架,在可溶性环氧化物水解酶代理任务上达到最优性能。

详情
Comments
27 pages | 10 figures
AI中文摘要

当候选结构伴随可行的合成路线时,设计具有目标性质的分子最为有用。我们介绍了My Chemical Harness,一种面向目标分子设计的路线原生进化框架,其中搜索种群由可执行的合成路径而非孤立的分子图组成。每条路径由可购买的构建块和反应模板构建,通过确定性化学工具执行,并通过任务特定的分子预言机评分。大语言模型仅用作策略控制器,选择关于路径长度、移动类型、反应家族、基序和探索压力的高级偏好,而本地代码执行路径构建、验证、去重、评分、选择和记忆更新。这种分离使得大语言模型能够引导探索,同时防止其引入幻觉产物或不受支持的反应步骤。在一个可溶性环氧化物水解酶代理任务上,我们的LLM智能体优于单次LLM和确定性控制器,在sEH分数、合成可及性分数和AiZynthFinder成功率指标上达到最先进性能。这些结果表明,受约束的大语言模型智能体可以在无需训练、微调或专用生成模型的情况下,在分子发现中发挥重要作用。

英文摘要

Designing molecules with target properties is most useful when candidate structures are accompanied by feasible synthetic routes. We introduce My Chemical Harness, a route-native evolutionary framework for goal-directed molecular design in which the search population consists of executable synthetic pathways rather than isolated molecular graphs. Each route is built from purchasable building blocks and reaction templates, executed by deterministic chemistry tools, and scored through task-specific molecular oracles. Large language models (LLMs) are used only as strategy controllers that select high-level preferences over route length, move type, reaction families, motifs, and exploration pressure, while local code performs route construction, validation, deduplication, scoring, selection, and memory updates. This separation lets the LLM guide exploration without allowing it to introduce hallucinated products or unsupported reaction steps. On a soluble epoxide hydrolase proxy task, our LLM agent improves over single pass LLM and deterministic controllers, reaching state-of-the-art performance across the sEH score, synthetic accessibility score, and AiZynthFinder success rate metrics. These results suggest that constrained LLM agents can play a significant role in molecular discovery without requiring training, fine-tuning, or dedicated generative models.

2606.11247 2026-06-11 cs.LG cs.AI cs.AR 新提交

Physics-informed generative AI for semiconductor manufacturing: Enforcing hard physical constraints in generative models by construction

物理信息驱动的生成式AI在半导体制造中的应用:通过构造强制生成模型中的硬物理约束

Yaser Mike Banad, Sarah Sharif

AI总结 针对半导体制造中生成模型必须满足硬物理约束的问题,本文提出通过构造集成物理信息(如物理信息扩散、PDE约束变分模型等)来强制约束,而非事后过滤,并给出四种集成模式和未来研究方向。

详情
AI中文摘要

生成模型越来越多地被用于为物理系统提出设计、数据和控制动作,然而许多此类系统受硬物理约束而非感知合理性支配。半导体制造提供了一个严苛的测试案例:生成的掩模、布局、合成缺陷数据和工艺配方必须遵守光刻、传输、反应和器件物理约束,因为物理无效的样本不仅质量低劣,而且无法使用。本文认为,半导体制造揭示了一个更广泛的计算科学挑战,即用于受约束物理领域的生成式AI必须通过构造实现物理信息驱动,而非仅通过事后过滤来纠正。我们调查了新兴的架构工具包,包括物理信息扩散、PDE约束变分模型、神经算子先验和守恒律尊重生成网络,并展示了它如何与可微分光刻、TCAD、工艺仿真和自主实验相联系。我们识别了生成模型与基于物理的模拟器之间的四种集成模式,并提出了一个以物理保真度基准、可微分模拟器基础设施以及面向物理设计和制造的多模态基础模型为中心的研究议程。核心主张是分析性的而非修辞性的:在物理有效性是成功的关键标准的情况下,通过构造强制约束的架构应被期望优于事后过滤的架构,而晶圆厂正是这种区别最鲜明的环境。

英文摘要

Generative models are increasingly used to propose designs, data, and control actions for physical systems, yet many such systems are governed by hard physical constraints rather than by perceptual plausibility. Semiconductor manufacturing provides a demanding test case: generated masks, layouts, synthetic defect data, and process recipes must obey lithography, transport, reaction, and device-physics constraints, because physically invalid samples are not merely low quality but unusable. This Perspective argues that semiconductor manufacturing exposes a broader computational-science challenge, namely that generative AI for constrained physical domains must be physics-informed by construction, not corrected only through post-hoc filtering. We survey the emerging architectural toolkit, including physics-informed diffusion, PDE-constrained variational models, neural-operator priors, and conservation-law-respecting generative networks, and show how it connects to differentiable lithography, TCAD, process simulation, and autonomous experimentation. We identify four integration patterns between generative models and physics-based simulators, and we propose a research agenda centered on physics-fidelity benchmarks, differentiable simulator infrastructure, and multimodal foundation models for physical design and manufacturing. The central claim is analytical rather than rhetorical: where physical validity is the binding criterion of success, architectures that enforce it by construction should be expected to outperform those that filter for it after the fact, and the fab is the setting where this distinction is sharpest.

2606.11245 2026-06-11 cs.AI cs.NE q-bio.NC 新提交

Position: Hippocampal Explicit Memory Is the Cornerstone for AGI

立场:海马体显式记忆是通用人工智能的基石

Sangjun Park

AI总结 本文主张,将显式记忆整合到大语言模型中是迈向通用人工智能的关键,因为LLM的学习机制类似人类内隐记忆,而高阶认知功能依赖海马体显式记忆。

详情
Comments
Accepted to ICML 2026 (Position Paper Track)
AI中文摘要

大语言模型(LLM)在各种任务中展现了卓越的能力,提升了人们对通用人工智能(AGI)的期望。这篇立场论文认为,整合显式记忆是推动LLM迈向AGI的基石。关键原因在于,LLM的底层学习机制与人类内隐记忆高度相似。然而,AGI所需的高阶认知功能,如长期战略规划、元认知和符号推理,严重依赖海马体显式记忆,无法仅从内隐统计学习中产生。借鉴神经科学的发现,我提出这一观点,并辅以人工显式记忆系统的计算要求,希望促进进一步研究,为显式记忆整合奠定基础。

英文摘要

Large Language Models (LLMs) have demonstrated remarkable capabilities across various tasks, raising expectations for Artificial General Intelligence (AGI). This position paper argues that integrating explicit memory is the cornerstone for advancing LLMs toward AGI. The key reason is that the underlying learning mechanism of LLMs is highly analogous to human implicit memory. However, higher-order cognitive functions necessary for AGI, such as long-term strategic planning, metacognition, and symbolic reasoning, heavily rely on hippocampal explicit memory and cannot arise solely from implicit statistical learning. Drawing on findings from neuroscience, I advance this perspective and complement it with computational requirements for artificial explicit memory systems, hoping to foster further research and lay the groundwork for explicit memory integration.

2606.11244 2026-06-11 cs.AR cs.AI 新提交

SPEAR: A System for Post-Quantization Error-Adaptive Recovery Enabling Efficient Low-Bit LLM Serving

SPEAR: 一种后量化误差自适应恢复系统,实现高效低比特LLM服务

Hongyuan Liu, Yawei Li, Zhiqiang Que, Qinli Yang, Junming Shao, Guosheng Hu

AI总结 针对低比特量化导致LLM质量下降的问题,提出SPEAR系统,通过输入感知的门控误差补偿器(EC)选择性修正高误差层,结合自适应内核融合调度和SLO感知调度器,在<1%内存开销下恢复W4与FP16之间56-75%的困惑度差距。

详情
AI中文摘要

高效的大语言模型(LLM)服务日益受到部署成本的制约。量化是降低服务成本的关键技术,但即使是最先进的4比特量化器,其与FP16之间仍存在显著的质量差距,尤其是在低比特服务最有利的小型模型中。我们发现这一差距的根本原因:量化误差高度依赖于输入,且在不同token之间差异显著,而现有的后量化补偿方法是静态的,对所有输入应用相同的修正。结果,简单token被过度修正,而困难token则修正不足。我们提出SPEAR,一种后量化误差自适应恢复系统,用于改进低比特LLM服务。SPEAR引入了由逐token门控调制的轻量级误差补偿器(EC),并将其仅放置在通过CKA引导的熵感知诊断识别出的最误差敏感层。这将少量参数预算集中在最有效的位置。EC的高效部署带来了若干系统挑战,包括额外计算、由输入相关门控引起的张量并行同步,以及跨配置的延迟不稳定。SPEAR通过自适应内核融合调度解决了这些问题,结合了后同步集成规约内核与P2P双写,将EC后计算融合到低比特GEMM中,并采用SLO约束的EC感知调度器以实现可预测的服务性能。在具有挑战性的逐通道量化设置中,SPEAR恢复了W4与FP16之间56-75%的困惑度差距,同时增加了不到1%的模型内存开销,并保持了与广泛使用的4比特服务部署相当的延迟。

英文摘要

Efficient large language model (LLM) serving is increasingly constrained by deployment cost. Quantization is a key technique for reducing serving cost, yet even state-of-the-art 4-bit quantizers exhibit a noticeable quality gap from FP16, particularly for smaller models where low-bit serving is most beneficial. We identify a fundamental cause of this gap: quantization error is highly input-dependent and varies substantially across tokens, while existing post-quantization compensation methods are static and apply identical corrections to all inputs. As a result, easy tokens are over-corrected while hard tokens remain under-corrected. We present SPEAR, a system for post-quantization error-adaptive recovery that improves low-bit LLM serving. SPEAR introduces lightweight Error Compensators (ECs) modulated by per-token gates and places them only at the most error-sensitive layers identified through a CKA-guided entropy-aware diagnostic. This focuses a small parameter budget where it is most effective. Efficient deployment of ECs presents several systems challenges, including additional computation, tensor-parallel synchronization caused by input-dependent gating, and latency instability across configurations. SPEAR addresses these issues through adaptive kernel-fusion dispatch, combining an epilogue-integrated peer-reduction kernel with P2P dual-write to fuse the post-EC computation into low-bit GEMMs, and an SLO-constrained EC-aware scheduler for predictable serving performance. Across challenging per-channel quantization settings, SPEAR recovers 56-75% of the perplexity gap between W4 and FP16 while adding less than 1% model memory overhead and maintaining latency comparable to a widely used 4-bit serving deployment.

2606.11236 2026-06-11 cs.NE cs.CV cs.LG 新提交

A2SG:Adaptive and Asymmetric Surrogate Gradients for Training Deep Spiking Neural Networks

A2SG:用于训练深度脉冲神经网络的适应性和非对称替代梯度

Yechan Kang, Yongjin Kweon, Mingyeong Seo, Sohee Park, Yeonguk Jeon, Jongkil Park, Hyun Jae Jang, Jaewook Kim, YeonJoo Jeong, Suyoun Lee, Seongsik Park

AI总结 提出适应性和非对称替代梯度(A2SG)框架,通过自适应窗口调整梯度方向一致性、非对称梯度反映神经元动态,降低梯度变化并促进收敛到平坦最小值,在多种SNN模型和任务上提升精度与能效。

详情
Comments
Accepted at ICML 2026
AI中文摘要

由于替代梯度导致的尖锐损失景观和时间不一致性,训练深度脉冲神经网络(SNN)仍然具有挑战性。为了解决这些问题,我们提出了一个统一框架:适应性和非对称替代梯度A2SG。适应性梯度调整一个有效窗口以实现时空适应,减少空间梯度变化并保持梯度随时间的方向一致性。非对称梯度通过为具有更高膜电位的神经元分配更大的梯度来反映神经元动态,并且我们证明它们比对称替代梯度产生更低的方差。我们的分析进一步建立了局部梯度变化与损失景观曲率之间的直接联系,为A2SG如何促进收敛到更平坦的最小值并改善泛化提供了原理性解释。我们在多种模型上进行了广泛实验,包括基于CNN和基于Transformer的SNN,涉及各种任务,如使用静态和神经形态数据集的图像分类以及分割。结果表明,A2SG持续提高了准确性和能效,使其成为训练深度SNN的通用且可靠的解决方案。我们的代码可在以下网址获取:此 https URL。

英文摘要

Training deep spiking neural networks (SNNs) remains challenging due to sharp loss landscapes and temporal inconsistency caused by surrogate gradients. To address these challenges, we propose a unified framework: adaptive and asymmetric surrogate gradients A2SG. The adaptive gradients adjust an effective window for spatio-temporal adaptation, reducing spatial gradient variation and maintaining directional consistency of gradients over time. The asymmetric gradients reflect neuronal dynamics by assigning larger gradients to neurons with higher membrane potentials, and we prove that they yield lower variation than symmetric surrogates. Our analysis further establishes a direct connection between local gradient variation and the curvature of the loss landscape, providing a principled explanation for how A2SG promotes convergence to flatter minima and improves generalization. We conduct extensive experiments on diverse models, including CNN-based and Transformer-based SNNs, across various tasks such as image classification using both static and neuromorphic datasets, as well as segmentation. The results demonstrate that A2SG consistently improves accuracy and energy efficiency, establishing it as a general and reliable solution for training deep SNNs. Our code is available at this https URL.

2606.11221 2026-06-11 cs.CV 新提交

LAST: Bridging Vision-Language and Action Manifolds via Gromov-Wasserstein Alignment

LAST: 通过Gromov-Wasserstein对齐连接视觉-语言与动作流形

Huaihai Lyu, Chaofan Chen, Yuheng Ji, Xiansheng Chen, Pengwei Wang, Shanghang Zhang, Changsheng Xu

AI总结 提出LAST方法,通过李代数线性化和局部度量离散化,对齐视觉-语言语义几何与动作流形,解决异构空间不兼容问题,提升VLA模型收敛性和泛化性。

详情
AI中文摘要

我们从Gromov-Wasserstein视角研究视觉-语言-动作(VLA)学习,目标是使动作表征的关系几何与VL嵌入的语义几何兼容。然而,由于领域间的数学异质性,这种对齐并非易事:视觉-语言的语义空间在拓扑上是线性和各向同性的,而机器人动作的物理流形是非欧几里得和各向异性的。它们不兼容的度量结构使得直接回归不适定。为了解决这种不兼容性,我们引入了LAST(李代数动作空间分词器),它通过两阶段变换重建动作空间以建立与VL模态的局部度量兼容性:(1)全局拓扑线性化:通过李代数映射线性化动作流形,将轨迹转换为固定长度、物理可加的表示。(2)局部度量离散化:将表示分层离散化为模式和白化残差,生成近似各向同性的局部图表,这些图表在统计上与语义度量对齐。通过在全局和局部层面解决结构不匹配问题,LAST使VLA模型具有更优的收敛性和泛化性。

英文摘要

We take a Gromov-Wasserstein perspective on Vision-Language-Action (VLA) learning, where the goal is to make the relational geometry of action representations compatible with the semantic geometry of VL embeddings. However, this alignment is non-trivial due to the mathematical heterogeneity between the domains: the semantic space of vision-language is topologically linear and isotropic, whereas the physical manifold of robotic action is non-Euclidean and anisotropic. Their disjoint metric structures render direct regression ill-posed. To resolve this incompatibility, we introduce LAST (Lie-algebraic Action Space Tokenizer), which reconstructs the action space to establish local metric compatibility with the VL modality via a two-stage transformation: (1) Global Topological Linearization: linearizing the action manifold via Lie-algebraic mapping, converting trajectories into a fixed-length, physically additive representation. (2) Local Metric Discretization: hierarchically discretizing the representation into schemas and whitened residuals, yielding approximately isotropic local charts that are statistically aligned with the semantic metric. By resolving the structural mismatch at both global and local levels, LAST enables VLA models with superior convergence and generalizability.

2606.11220 2026-06-11 cs.CL 新提交

LifeSentence: Language models can encode human life course trajectories from longitudinal panel data

LifeSentence: 语言模型可以从纵向面板数据编码人类生命历程轨迹

Samuel Liu, Muchen Xi, William Yeoh, Joshua J. Jackson

AI总结 提出LifeSentence模型,将大型语言模型与纵向面板数据结合,通过结构化自然语言记录生命事件并微调预训练模型,在少样本条件下超越传统方法,实现生命事件预测与时间顺序重建。

详情
AI中文摘要

预测人类生命结果对于理解个体如何获得长寿健康的生活至关重要。传统的统计方法准确度有限,可能是因为忽略了生命历程的序列结构。现代方法如Transformer架构需要大规模训练数据,而大多数纵向面板研究缺乏此类数据。本文介绍LifeSentence,一种将大型语言模型与纵向面板数据相结合的生命历程推理模型。通过将每个生命事件表示为结构化的自然语言记录,并在一个包含预测、鲁棒性和推理的18任务评估分类体系上对预训练的240亿参数语言模型进行指令微调,LifeSentence利用预训练期间已编码的分布知识补充面板数据。该模型在来自德国社会经济面板的约65,000名个体上训练——比之前基于Transformer的方法少约45倍——在所有任务族上均优于经典和深度学习基线,在联合事件与时间预测上相比最佳基线实现三倍改进,并在从去除时间戳的事件集重建时间顺序时达到91.2%的Kendall tau系数。在没有显式监督的情况下,该模型仅从离散事件序列中恢复出记录的社会分层模式,包括教育溢价、性别工资差距和母亲惩罚。自然语言接口进一步支持定性新研究查询,例如将早期生活史连接到指定的晚年终点,使LifeSentence成为预测工具和对人类传记进行反事实探索的探针。

英文摘要

Forecasting human life outcomes is important to gain insights into how individuals attain long and healthy lives. Conventional statistical approaches yield limited accuracy, potentially due to discarding the sequential structure of the life course. Modern methods such as transformer architectures require large scale training data that most longitudinal panel studies lack. Here we introduce LifeSentence, a model for life-course reasoning that bridges large language models with longitudinal panel data. By representing each life event as a structured natural-language record and instruction-tuning a pretrained 24-billion-parameter language model across an 18-task evaluation taxonomy spanning prediction, robustness and reasoning, LifeSentence supplements panel data with distributional knowledge already encoded during pretraining. Trained on approximately 65,000 individuals from the German Socio-Economic Panel - roughly 45 times fewer than prior transformer-based approaches - LifeSentence outperforms classical and deep learning baselines across all task families, achieving a threefold improvement in joint event-and-timing prediction from best baselines and 91.2% Kendall's tau when reconstructing chronological order from timestamp-stripped event sets. Without explicit supervision, the model recovers documented patterns of social stratification, including the education premium, the gender wage gap and the motherhood penalty, from discrete event sequences alone. A natural-language interface further enables qualitatively new research queries, such as connecting an early-life history to a specified late-life endpoint, establishing LifeSentence as both a predictive tool and a probe for counterfactual exploration of human biographies.

2606.11218 2026-06-11 cs.CY cs.AI 新提交

An Ethical eValuation Agent (EeVA): Results of a Proof-of-Concept Test on a Prototype Agentic-like Workflow to Assist Ethical Deliberations

伦理评估代理(EeVA):在原型类代理工作流中辅助伦理审议的概念验证测试结果

Stephen Milford, B. Zara Malgir, Miguel Vazquez

AI总结 提出基于LLM的类代理工作流EeVA,通过10种伦理框架评估用例,生成结构化评估与综合,促进伦理反思而非给出绝对答案,在三个案例中验证了可行性。

详情
AI中文摘要

伦理审议常被误解为寻找单一对错答案,这给必须应对伦理挑战的非伦理专业人员带来困难。我们开发了EeVA,一种基于LLM的类代理工作流,旨在支持比较性伦理反思而非提供确定性伦理答案。EeVA使用n8n编程,包含三个互连工作流:启动器、工作器和发射器。它通过评估器和综合提示,根据10种伦理框架评估上传的用例。概念验证测试使用了来自城市交通、点对点能源交易和社会服务资源分配的三个已发表案例。在所有案例中,EeVA生成了结构一致的框架特定评估和综合报告。输出区分了不同框架,识别了收敛和分歧,提出了增加一致性的修改建议,并突出了持续的伦理张力。综合报告对非专业人士可读,并将注意力从简单答案转向设计条件、保障措施以及跨框架完全一致不太可能的领域。研究结果表明,LLM可以被组织成可用的工作流,在保留伦理多元性的同时,帮助弥合伦理学家与非伦理专业人员之间的沟通差距。EeVA的价值不在于取代伦理学家或解决道德分歧,而在于构建结构化的伦理审议。EeVA为在伦理专业知识有限的情况下支持伦理反思提供了一个有前景的概念验证。在成为成熟工具之前,还需要在可重复性、人工评估、用户测试和效率方面进行进一步工作。

英文摘要

Ethical deliberation is often misunderstood as a search for single right or wrong answers, creating difficulties for non-ethically trained personnel who must address ethically laden challenges. We developed EeVA, an agentic-like LLM-based workflow designed to support comparative ethical reflection rather than deliver definitive ethical answers. EeVA was programmed in n8n using three interconnected workflows: starter, worker, and emitter. It evaluated uploaded use cases against 10 ethical frameworks through evaluator and synthesis prompts. Proof-of-concept testing used three published cases from urban mobility, peer-to-peer energy trading, and social-service resource allocation. Across all cases, EeVA produced consistently structured framework-specific evaluations and integrated syntheses. Outputs differentiated between frameworks, identified convergences and divergences, recommended modifications to increase alignment, and highlighted persistent ethical tensions. Syntheses were readable for non-specialists and shifted attention away from simplistic answers toward design conditions, safeguards, and areas where full cross-framework agreement was unlikely. The findings suggest that LLMs can be organised into usable workflows that preserve ethical plurality while helping bridge the communicative gap between ethicists and non-ethically trained personnel. EeVA's value lies not in replacing ethicists or resolving moral disagreement, but in scaffolding structured ethical deliberation. EeVA offers a promising proof of concept for supporting ethical reflection where access to ethics expertise is limited. Further work is needed on reproducibility, human evaluation, user testing, and efficiency before it can be considered a mature tool.

2606.11217 2026-06-11 cs.CY cs.AI cs.HC 新提交

Preregistration for Experiments with AI Agents

AI智能体实验的预注册

Michelle Vaccaro

AI总结 针对AI智能体实验中的方法论漏洞,提出将预注册实践扩展至该领域,并设计专用模板以提升研究可信度。

详情
Comments
Accepted at ICML 2026 as a Spotlight (Top 5%) Position Paper
AI中文摘要

大型语言模型(LLM)和自主AI智能体的普及催生了一种快速发展的方法论范式:“计算机内”行为实验。最初,这种方法被设想为在认知、决策和社会动态研究中,使用AI智能体作为人类参与者的替代品,但现在它已具有新的意义——随着AI智能体越来越多地代表个人和组织进行谈判、交易和做出重大决策,理解它们的行为本身已成为研究重点。虽然这些AI智能体实验在可扩展性、成本效益和实验控制方面提供了前所未有的优势,但它们也继承并有时放大了长期困扰人类受试者研究的方法论漏洞。为解决这些问题,本文主张,预注册实践——对于提高人类受试者实验的可信度至关重要——现在应扩展到AI智能体实验。我们系统地列举了AI智能体实验引入的研究者自由度——例如模型选择、提示措辞、设置和基于结果的重新设计——并展示了低迭代成本和缺乏报告规范如何使这些选择既容易被利用又难以被检测。我们提出了一个针对AI智能体实验的预注册模板,并呼吁会议、期刊和资助机构将预注册作为这一新兴研究范式的标准实践。

英文摘要

The proliferation of large language models (LLMs) and autonomous AI agents has given rise to a rapidly growing methodological paradigm: "in silico" behavioral experiments. Originally conceived as a way to use AI agents as proxies for human participants in studies of cognition, decision-making, and social dynamics, this approach has taken on new significance -- as AI agents increasingly negotiate, transact, and make consequential decisions on behalf of people and organizations, understanding their behavior has become a research priority in its own right. While these experiments with AI agents offer unprecedented advantages in terms of scalability, cost efficiency, and experimental control, they also inherit, and in some cases amplify, methodological vulnerabilities that have long plagued human subjects research. To address these issues, this paper argues that preregistration practices -- central to improving the credibility of human subjects experiments -- should now be extended to experiments with AI agents. We systematically catalog the researcher degrees of freedom that experiments with AI agents introduce -- model selection, prompt wording, settings, and outcome-contingent redesign, for example -- and show how the low cost of iteration and lack of reporting norms make these choices both easy to exploit and difficult to detect. We propose a preregistration template tailored to experiments with AI agents and call on conferences, journals, and funding agencies to make preregistration standard practice for this emerging research paradigm.

2606.11215 2026-06-11 cs.CY cs.AI 新提交

The Environmental Cost of LLMs in AIED: Reporting and Practices

AIED中LLMs的环境成本:报告与实践

Sabrina C. Eimler, Lukas Erle, Daniel Flood, Aditi Haiman, Luca Häckert, André Helgert, Lachlan McGinness, Büsra Yapici

AI总结 针对AIED社区缺乏LLM计算与环境成本标准化报告的问题,提出开源方法测量并报告碳排放,包括本地和云端硬件,以及未知参数的前沿LLM计算开销公式。

详情
AI中文摘要

近年来,大型语言模型(LLM)在人工智能教育(AIED)社区中的使用越来越广泛。虽然LLM为学习者和教育者提供了独特的途径,但使用LLM会带来计算和环境成本。由于缺乏标准化程序来测量和报告这些影响,这些成本大多被隐藏。为了解决这一差距,我们首先对AIED 2025会议论文集的所有论文进行了文献综述,确定是否以及如何报告LLM的计算或环境成本。大多数项目使用LLM,但很少报告使用的计算资源,几乎没有将LLM的环境影响作为伦理问题讨论。为了解决缺乏标准化报告实践的问题,我们提出了一种开源方法,用于系统测量和报告LLM的计算开销以及运行机器学习(ML)AIED系统的环境影响。我们提供了测量本地和云端硬件碳足迹的软件解决方案。我们还提供了一个易于使用的公式,用于计算前沿LLM的计算开销,即使确切的参数数量未知。总体而言,我们希望激励同事们使用我们的方法,在AIED社区中争取更透明地报告使用LLM的隐藏成本。

英文摘要

Large Language Model (LLM) usage in recent years has become increasingly widespread in the Artificial Intelligence in Education (AIED) community. While LLMs offer unique avenues for learners and educators, using LLMs comes with computational and environmental costs. These costs are mostly hidden due to a lack of standardised procedures to measure and report these impacts. To address this gap, we first conducted a literature review of all papers published as part of the AIED 2025 conference proceedings, determining if and how computational or environmental costs of LLMs are reported. Most projects use LLMs, but few report computational resources used and almost none discuss environmental impacts of LLMs as an ethical concern. To address this lack of standardised reporting practices, we propose an open-source method for systematically measuring and reporting the computational expense of LLMs and environmental impact of running Machine Learning (ML) AIED systems. We provide software solutions to measure the carbon footprint for both local and cloud based hardware. We also provide an easy-to-use formula to calculate the computational expense of frontier LLMs even when the exact number of parameters is not known. Overall, we hope to motivate colleagues to use our method to strive for more transparent reporting of hidden costs of using LLMs in the AIED community.

2606.11214 2026-06-11 cs.CY cs.AI cs.HC 新提交

From Awareness to Action: Understanding and Overcoming the Research-Practice Gap in Algorithmic Fairness for Public Health

从意识到行动:理解并克服公共卫生算法公平性中的研究-实践差距

Sara Altamirano, Tijs Portegies, Sennay Ghebreab

AI总结 通过混合方法研究,揭示算法公平性在公共卫生ML应用中从意识到行动的差距,提出Fairness-to-Action框架,整合方法、组织和系统维度,指出公平性制度化薄弱、翻译机制外部驱动及系统优先性偏重准确性的问题。

详情
Comments
Extended version of an accepted IASEAI'26 paper; includes technical appendices. 22 pages, 2 figures
AI中文摘要

算法公平性对于负责任的机器学习驱动的公共卫生研究至关重要,但其实际实施仍然有限。为了调查这种意识-行动差距,我们进行了一项顺序混合方法研究,包括专家访谈、在线调查和系统映射。专家访谈为调查设计提供了信息,调查揭示了公平性的碎片化定义、有限的培训和指导、对外部来源的依赖以及正式评估、缓解或监测的罕见使用。这些发现随后被映射到三个既定的研究-实践差距视角:知识-实践差距、知识到行动循环和知道-做差距,每个视角提供了互补的观点。基于这一综合,我们引入了公平到行动框架,该框架整合了方法、组织和系统维度,以识别算法公平性知识转化停滞的位置。我们的分析表明,公平性仍然制度化薄弱,转化机制由外部驱动,系统级优先事项继续强调准确性而非公平性。这些见解为推进安全、公平和道德的机器学习驱动的公共卫生研究实践提供了关键杠杆点。

英文摘要

Algorithmic fairness is essential for responsible ML-driven public health research, yet its practical implementation remains limited. To investigate this awareness-action gap, we conducted a sequential mixed-methods study comprising expert interviews, an online survey, and systematic mapping. The expert interviews informed the design of the survey, which in turn revealed fragmented definitions of fairness, limited training and guidance, reliance on external sources, and rare use of formal assessment, mitigation, or monitoring. These findings were subsequently mapped onto three established research-practice gap lenses: the Knowledge-Practice Gap, the Knowledge-to-Action Cycle, and the Knowing-Doing Gap, each offering complementary perspectives. Building on this synthesis, we introduce the Fairness-to-Action framework, which integrates methodological, organizational, and systemic dimensions to identify where translation of algorithmic fairness knowledge stalls. Our analysis shows that fairness remains weakly institutionalized, translation mechanisms are externally driven, and system-level priorities continue to emphasize accuracy over fairness. These insights suggest critical leverage points for advancing safe, fair, and ethical ML-driven public health research practice.

2606.11205 2026-06-11 cs.LG cs.AI cs.CL 新提交

Dual-Stance Evaluation of Sycophancy: The Structure of Agreement and the Limits of Intervention

谄媚的双立场评估:同意的结构与干预的局限

Matthew James Buchan

AI总结 提出双立场评估方法,发现激活引导在减少谄媚时也会抑制对事实正确陈述的同意,揭示了表示可读但不可写的普遍差距。

详情
Comments
18 pages, 9 figures, accepted to TAIS 2026
AI中文摘要

激活引导可以改变LLM的行为,但标准评估通常不测试减少谄媚的方向是否也抑制对事实正确陈述的同意。我们引入了双立场评估,测试每个话题的两个立场,并将其应用于Llama-3-8B-Instruct上的质心差引导。我们发现一种分离:模型在几何上不同的子空间中表示谄媚和事实同意,但引导方向在两者上的投影相等,无法差异化地针对任一。因此,该方向同样减少对事实正确陈述(例如地球是圆的)和谄媚陈述的同意。两个激活组的所有其他静态属性都匹配,表明行为分离源于生成动态或残差流分析无法解析的更细粒度结构。该模式说明了一个普遍差距:从激活中可读的表示可能无法通过它们写入。

英文摘要

Activation steering can shift LLM behaviour, but standard evaluations do not typically test whether a sycophancy-reduction direction also suppresses agreement with factually correct statements. We introduce dual-stance evaluation, which tests both stances of each topic, and apply it to centroid-difference steering on Llama-3-8B-Instruct. We find a dissociation: the model represents sycophantic and factual agreement in geometrically distinct subspaces, yet the steering direction projects equally onto both and cannot differentially target either. The direction accordingly reduces agreement with factually correct statements (e.g. that the Earth is round) as well as sycophantic ones. All other static properties of the two activation groups are matched, suggesting the behavioural dissociation arises from generation dynamics or from finer-grained structure that residual-stream analysis cannot resolve. The pattern illustrates a general gap: representations that are readable from activations may not be writable through them.

2606.11199 2026-06-11 cs.CL cs.AI cs.IR cs.LG 新提交

NightFeats @ MMU-RAGent NeurIPS 2025: A Context-Optimized Multi-Agent RAG System for the Text-to-Text Track

NightFeats @ MMU-RAGent NeurIPS 2025: 面向文本到文本轨道的上下文优化多智能体RAG系统

Quentin Fever, Naziha Aslam

AI总结 提出一种结构化多智能体RAG系统NightFeats,通过检索、策展和组合三阶段分解知识合成,引入时序语义重排序、矛盾协调和引用保留架构,在MMU-RAGent竞赛中超越商业基线。

详情
Comments
5 pages, 1 figure, 1 table. NeurIPS 2025 Competition Track (MMU-RAGent). System developed October 2025
AI中文摘要

我们提出NightFeats,一个结构化的多智能体检索增强生成(RAG)系统,提交至NeurIPS 2025的MMU-RAGent竞赛,并在文本到文本轨道中获得最佳动态评估奖。本文并非以基准最大化目标,而是提出一个原则性流水线,将知识合成为三个协调阶段:检索、策展和组合,每个阶段由显式的中间表示和交接契约控制。受智能体上下文工程(ACE)启发,该系统引入时序语义重排序、有界矛盾协调和保留引用的组合作为核心架构原语。竞赛结果表明,NightFeats在LLM-as-a-Judge和人类Likert评估中超越了包括Claude-SonnetV2和Nova-Pro在内的商业基线,证实了架构透明性和可验证证据基础比单纯优化自动相似度指标的系统更符合人类偏好。

英文摘要

We present NightFeats, a structured multi-agent retrieval-augmented generation (RAG) system submitted to the MMU-RAGent competition at NeurIPS 2025, where it was awarded Best Dynamic Evaluation in the text-to-text track. Rather than targeting benchmark maximization, this work proposes a principled pipeline that decomposes knowledge synthesis into three coordinated phases: retrieval, curation, and composition, each governed by explicit intermediate representations and handoff contracts. Inspired by Agentic Context Engineering (ACE), the system introduces temporal-semantic reranking, bounded contradiction reconciliation, and citation-preserving composition as core architectural primitives. Competition results show that NightFeats surpasses proprietary baselines including Claude-SonnetV2 and Nova-Pro on LLM-as-a-Judge and Human Likert evaluations, confirming that architectural transparency and verifiable evidence grounding are better aligned with human preferences than systems optimizing narrowly for automatic similarity metrics.

2606.11197 2026-06-11 eess.AS cs.AI cs.CL cs.SD 新提交

MA-DLE: Speech-based Automatic Depression Level Estimation via Memory Augmentation

MA-DLE: 基于记忆增强的语音自动抑郁程度估计

Xuzhi Wang, Xinran Wu, Ziping Zhao, Jianhua Tao, Björn W. Schuller

AI总结 提出记忆增强特征方法,通过选择性整合历史时序特征和动态记忆特征,结合层次注意力融合模块,在DAIC-WOZ和E-DAIC数据集上实现最优性能。

详情
Comments
Accepted at IEEE TAC
AI中文摘要

基于语音的抑郁程度自动估计对于实现早期检测和及时干预至关重要,尤其是在资源受限的心理健康环境中。近年来,深度学习在包括情感计算和心理健康评估在内的多个领域取得了显著成功。现有方法大多依赖基于RNN的架构(如LSTM和GRU)来建模时间信息以进行抑郁估计。然而,提取的特征往往只强调少数相邻语音片段,限制了其捕捉长程依赖的能力。为克服这一局限,我们引入了一种基于记忆的特征增强方法,以增强GRU提取特征的表示能力。我们的记忆库并非不加区分地整合历史数据,而是设计为选择性整合两类组件以减少冗余和不相关性:(1) 与当前GRU输出高度相似的历史时序特征,提供互补的上下文信息;(2) 基于特征变异性识别的动态记忆特征,捕捉指示抑郁症状的行为和情绪波动。为有效融合记忆增强特征与GRU输出,我们进一步设计了层次注意力融合(HAF)模块。我们的方法在广泛使用的DAIC-WOZ和E-DAIC数据集上进行了评估,取得了最先进的性能。

英文摘要

Speech-based automatic estimation of depression levels is essential for enabling early detection and timely intervention, particularly in resource-constrained mental health settings. In recent years, deep learning has demonstrated impressive success across various domains, including affective computing and mental health assessment. Most existing approaches rely on RNN-based architectures (such as LSTM and GRU) to model temporal information for depression estimation. However, the extracted features often emphasize only a few adjacent speech segments, limiting their ability to capture long-range dependencies. To overcome this limitation, we introduce a memory-based feature augmentation method that enhances the representational capacity of GRU-extracted features. Rather than indiscriminately incorporating historical data, our memory bank is designed to selectively integrate two types of components in order to reduce redundancy and irrelevance: (1) historical temporal features that closely resemble the current GRU output, offering complementary contextual information; and (2) dynamic memory features identified based on feature variability, which capture behavioral and emotional fluctuations indicative of depressive symptoms. To effectively fuse the memory-augmented features with GRU outputs, we further design a Hierarchical Attention Fusion (HAF) module. Our method is evaluated on the widely used DAIC-WOZ and E-DAIC datasets, achieving state-of-the-art performance.

2606.11195 2026-06-11 cs.CY cs.AI cs.HC 新提交

From Consumption to Reflection: Designing Human-AI Relations for Stable Reasoning

从消费到反思:为稳定推理设计人-人工智能关系

Rikard Rosenbacke, Carl Rosenbacke, Victor Rosenbacke, Martin McKee

AI总结 提出关系反思智能(RRI),一种推理时治理层,通过可审计的推理循环实现反思,将人机交互转变为联合推理系统,以补偿双方局限并实现稳定推理。

详情
AI中文摘要

大型语言模型(LLM)改变了人类获取信息的方式,但并未改变我们推理信息的方式。它们的流畅性加速了消费,同时绕过了支撑健全判断的缓慢反思过程。本文介绍了关系反思智能(RRI),一种推理时治理层,通过可审计的推理循环将反思操作化。RRI 不在模型内部运行,而是在模型周围运行,为人类与 LLM 之间的稳定、可审计推理提供了实用结构。核心前提是,LLM 继承了与塑造人类思维相似的认知脆弱性:依赖直觉捷径、混淆表征与现实、偏好连贯性而非证伪。当人类和模型共享这些倾向时,它们的错误会叠加。我们称之为关系漂移,一种源于交互而非仅来自模型的失败。解决这一问题需要从建模词间关系转向建模模型输出与人类推理之间的关系。RRI 通过三个组件提供了这一缺失层:Rose-Frame(识别推理中可能的故障点)、Architect's Pen(在关键时刻引入针对性反思步骤)以及一个推理时工作流(无需重新训练模型即可嵌入这些步骤)。这些元素共同将人机交互转变为一个具有显式检查点、冲突揭示和可审计假设轨迹的联合推理系统。RRI 不是让机器像人类一样思考,也不是强迫人类像机器一样推理,而是创造一种结构化交互,使双方补偿彼此的局限。它将 AI 安全重新定义为认知架构问题,其中可靠决策取决于将反思直接嵌入交互过程。

英文摘要

Large language models (LLMs) have transformed how humans access information, but not how we reason with it. Their fluency accelerates consumption while bypassing the slow, reflective processes that underpin sound judgment. This paper introduces Relational Reflective Intelligence (RRI), an inference-time governance layer that operationalizes reflection through auditable reasoning loops. RRI operates not inside the model but around it, providing a practical structure for stable, auditable reasoning between humans and LLMs. The core premise is that LLMs inherit cognitive vulnerabilities similar to those that shape human thought: reliance on intuitive shortcuts, confusion between representation and reality, and a preference for coherence over falsification. When humans and models share these tendencies, their errors compound. We refer to this as relational drift, a failure that arises from interaction rather than from the model alone. Addressing this requires a shift from modeling relations between words to structuring relations between model outputs and human reasoning. RRI provides this missing layer through three components: the Rose-Frame, which identifies likely breakdowns in reasoning; the Architect's Pen, which introduces targeted reflection steps at critical moments; and an inference-time workflow that embeds these steps without retraining the model. Together, these elements transform human-AI interaction into a joint reasoning system with explicit checkpoints, conflict surfacing, and an auditable trail of assumptions. Rather than making machines think like humans or forcing humans to reason like machines, RRI creates a structured interaction in which both compensate for each other's limitations. It reframes AI safety as a cognitive architecture problem, where reliable decisions depend on embedding reflection directly into the interaction process.

2606.07537 2026-06-11 cs.CL cs.AI cs.LG 交叉投稿

From Architecture to Output: Structural Origins of Hallucination in Large Language Models and the Amplifying Role of Data

从架构到输出:大语言模型中幻觉的结构性起源及数据的放大作用

Md. Rejaul Korim Sadi, Toufiqur Rahman Tasin, Golam Mostofa Naeem

AI总结 本文分析大语言模型幻觉的结构性根源,指出自注意力、最大似然估计训练目标和自回归解码三个架构决策构成复合失效系统,并揭示数据病理如何放大这些脆弱性。

详情
Comments
11 pages, 7 figures, 15 references
AI中文摘要

大语言模型会产生幻觉——生成流畅、自信但事实错误的输出——这种一致性跨越代际和规模。现有分类法按输出类型对幻觉进行分类,区分内在与外在失败以及忠实性与事实性偏差。这些框架在描述上严谨,但未能识别产生特定实例的内部机制。本文将幻觉分析为三个架构决策的结构性后果,这些决策共同构成一个复合失效系统。自注意力的共现学习用统计邻近性替代语义含义,导致实体混淆、事实错误归因和语义漂移。最大似然估计训练目标在无事实约束下优化下一个词元概率,奖励统计上合理的输出,无论其真值如何。自回归解码在暴露偏差下的永久从左到右承诺确保单个错误词元级联向前传递整个输出序列而无法修正。数据集病理——长尾缺陷、训练偏差和合成污染——放大了这些脆弱性,但并非独立导致它们。我们做出三项贡献。首先,我们将每个机制映射到Alansari和Luqman分类法中的特定输出类别,将内在幻觉定位于自注意力,外在幻觉定位于MLE,逻辑不一致定位于自回归解码。其次,我们表明每个常被引用的数据集病理利用这些机制之一,而非独立产生幻觉。第三,我们识别出仅基于输出类型分类的诊断局限性,并将其与推理层缓解方法进行对比。

英文摘要

Large language models hallucinate--producing fluent, confident, factually wrong outputs--with a consistency that persists across generations and scales. Existing taxonomies classify hallucination by output type, distinguishing intrinsic from extrinsic failures and faithfulness from factuality divergence. These frameworks are descriptively rigorous but do not identify which internal mechanism produced a given instance. This paper analyses hallucination as a structural consequence of three architectural decisions that together form a compound failure system. Self-attention's co-occurrence learning substitutes statistical proximity for semantic meaning and produces entity confusion, fact misattribution, and semantic drift. The maximum likelihood estimation training objective optimises next-token probability without factual constraint, rewarding statistically plausible outputs regardless of their truth value. Autoregressive decoding's permanent left-to-right commitment under exposure bias ensures that a single wrong token cascades forward through the entire output sequence without revision. Dataset pathologies--long-tail deficiencies, training bias, and synthetic pollution--amplify these vulnerabilities but do not independently cause them. We make three contributions. First, we map each mechanism to a specific output category in the Alansari and Luqman taxonomy, locating intrinsic hallucination in self-attention, extrinsic hallucination in MLE, and logical inconsistency in autoregressive decoding. Second, we show that each commonly cited dataset pathology exploits one of these mechanisms rather than originating hallucination independently. Third, we identify the diagnostic limitation of output-type-only classification and contrast it with inference-layer mitigation approaches.

2605.04893 2026-06-11 cs.LG cs.CL stat.ML 版本更新

Self-Attention as Transport: Limits of Symmetric Spectral Diagnostics

自注意力作为传输:对称谱诊断的极限

Dominik Dahlem, Diego Maniloff, Mac Misiura

AI总结 研究语言模型注意力路由的两种失效形状(过度集中或过度分散),证明对称谱诊断对方向不敏感,并揭示因果注意力中传输容量的理论下限,提出基于容量和方向的双轴诊断方法。

详情
Comments
48 pages, 6 figures, 7 tables; 81-page online supplement (proofs, additional experiments, dataset statistics) as an ancillary file
AI中文摘要

当语言模型处理幻觉响应时,其注意力路由往往以两种形状之一失效:过度集中在狭窄的位置集合上,或者分散得如此广泛以至于相关性被稀释,而失效的形状携带诊断信号。我们研究这些形状作为诊断特征,从在基准标记响应的\emph{强制评分}下计算的注意力矩阵中得出,而不是在实时生成期间。一类广泛使用的谱方法分析度归一化注意力算子的对称分量,该算子控制传输\emph{容量};我们证明该算子的每个转置不变谱诊断在结构上是\emph{方向盲的}(它无法区分算子与其转置,因此无法检测信息流方向),并且盲定理的逆定理将任何Lipschitz诊断的转置敏感性限制为不对称系数$G$。将其与规范因果架构的闭式二分-Cheeger景观配对,我们证明均匀因果注意力满足一个与$n$无关的下界$\phi \ge 1/5$,而窗口注意力以$O(w/n)$穿透下界;失效模式在形状上不同,而不仅仅在数值上不同。这个下界是一个理想化架构的基准,而不是经验吸引子:穿透它的真实注意力头的比例本身就是一个架构特征。由此产生的双轴诊断($\phi$表示容量,$G$表示方向)产生一个可证伪的极性预测:瓶颈主导和分散主导的基准应表现出相反的极性。在长度控制评估下,传输特征在测试的仅解码器、仅编码器和编码器-解码器模型中保持可解释的信号(0.62-0.84 LC-AUROC),极性在HaluEval和MedHallu之间如预测般反转。

英文摘要

When a language model processes a hallucinated response, its attention routing tends to fail in one of two shapes: over-concentrating on a narrow set of positions, or spreading so diffusely that relevance is diluted, and the shape of the failure carries diagnostic signal. We study these shapes as a diagnostic characterization, computed from attention matrices under \emph{forced scoring} of benchmark-labeled responses rather than during live generation. A widely used family of spectral methods analyzes the symmetric component of the degree-normalized attention operator, which governs transport \emph{capacity}; we prove that every transpose-invariant spectral diagnostic of this operator is structurally \emph{orientation-blind} (it cannot distinguish an operator from its transpose, and therefore cannot detect information-flow direction), with a converse to the blindness theorem bounding any Lipschitz diagnostic's transpose sensitivity by the asymmetry coefficient $G$. Pairing this with a closed-form bipartite-Cheeger landscape for canonical causal architectures, we show that uniform causal attention satisfies an $n$-independent floor $\phi \ge 1/5$, while window attention pierces the floor as $O(w/n)$; failure modes are shape-different, not just value-different. This floor is an idealized-architecture benchmark, not an empirical attractor: the fraction of real attention heads that pierce it is itself an architectural signature. The resulting two-axis diagnostic ($\phi$ for capacity, $G$ for direction) yields a falsifiable polarity prediction: bottleneck- and diffuse-dominated benchmarks should exhibit opposite polarity. Under length-controlled evaluation, transport features retain interpretable signal (0.62-0.84 LC-AUROC) across the tested decoder-only, encoder-only, and encoder-decoder models, with polarity reversing as predicted between HaluEval and MedHallu.