arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.13362 2026-05-15 cs.MA cs.AI cs.DC cs.GT econ.TH

Constitutional Governance in Metric Spaces

Ehud Shapiro, Nimrod Talmon

AI总结本文研究了在度量空间中实现平等自主治理的计算机制，提出了宪法治理框架，将提案、审议、修改和共识等过程整合为一个多项式时间协议。该框架通过为每个可修改的组件分配度量空间、聚合规则和超级多数阈值，支持成员通过理想元素投票并提交获得超级多数支持的公开提案，从而实现宪法共识。研究还展示了该框架在七个典型场景中的应用，并证明了广义中位数在多数阈值下具有良好的激励相容性，为数字社区和组织的宪法治理提供了全面解决方案。

详情

英文摘要

Computational social choice and algorithmic decision theory offer rich aggregation theory but no comprehensive process for egalitarian self-governance: aggregation, deliberation, amendment, and consensus are each considered in isolation, with key metric-space aggregators being NP-hard. Here, we propose constitutional governance in metric spaces, integrating these stages into a coherent polynomial-time protocol for constitutional governance. The constitution assigns, per amendable component including itself, a metric space, aggregation rule, and supermajority threshold. Amendments proceed by members voting with their ideal elements, followed by members submitting public proposals carrying supermajority public support under the revealed votes. Public proposals can be sourced from deliberation among members, vote aggregation, or AI mediation. The constitutional rule adopts a supported proposal with positive maximal score, if there is one, else retains the status quo. With Constitutional Consensus, a community can run the constitutional governance protocol on members' personal computing devices (e.g., smartphones), achieving digital sovereignty. We focus on the utility of the generalised median, prove that at majority threshold no misreport weakly dominates sincere voting, and study the compromise gap between best peak and unconstrained optimum. We instantiate the framework to seven canonical settings -- electing officers, setting rates, allocating budgets, ranking priorities, selecting boards, drafting bylaws, and amending the constitution. By unifying metric-space aggregation, reality-aware social choice, supermajority amendment, constitutional consensus, deliberative coalition formation, and AI mediation, this work delivers a comprehensive solution to the constitutional governance of digital communities and organisations.

URL PDF HTML ☆

赞 0 踩 0

2605.13343 2026-05-15 cs.GR cs.DC cs.LG cs.NA math.NA

Hierarchical Transformer Preconditioning for Interactive Physics Simulation

Carl Osborne, Minghao Guo, Crystal Owens, Wojciech Matusik

AI总结该研究提出了一种基于分层Transformer的预条件器，用于加速实时物理模拟中的求解过程。通过结合弱可接受H-矩阵划分，该方法在保持计算效率的同时，能够有效捕捉长程耦合关系。核心贡献包括一种新的训练目标函数，提升了预条件器对不规则谱的适应性，并实现了在大规模多相泊松系统中的高效求解，显著优于传统方法。

Comments 10 pages, 7 figures. Includes supplementary video and material

2605.13137 2026-05-15 cs.IR cs.AI

LeanSearch v2: Global Premise Retrieval for Lean 4 Theorem Proving

Guoxiong Gao, Zeming Sun, Jiedong Jiang, Yutong Wang, Jingda Xu, Peihao Wu, Bryan Dai, Bin Dong

AI总结 LeanSearch v2 是一种用于 Lean 4 定理证明的全局前提检索系统，旨在从数学库中找到能够支持定理证明的多个相关引理。该系统包含两种模式：标准模式通过嵌入-重排序流程实现高精度的单次查询检索，而推理模式则通过迭代的草稿-检索-反思循环实现全局前提的恢复。实验表明，LeanSearch v2 在多个基准测试中显著优于现有系统，有效提升了定理证明的成功率。

2605.13095 2026-05-15 cs.CR cs.AI cs.CY cs.LG

Watermarking Should Be Treated as a Monitoring Primitive

Toluwani Aremu, Nils Lukas, Jie Zhang

AI总结该论文探讨了生成模型中水印技术在溯源、归因和安全监控中的应用，并指出当前水印评估通常仅针对单个样本的对抗攻击，忽视了观察者通过聚合多个输出信号进行实体级信息推断的能力。研究引入了基于观察者的威胁模型，表明即使零比特水印也能在多密钥环境下实现归因，并揭示了水印设计在外部监控方面的潜在风险与应对策略。论文揭示了归因与监控之间的根本性双重用途矛盾，强调水印评估应超越单样本鲁棒性，考虑聚合分析和观察者能力的影响。

Comments 12 pages, 5 figures

2605.09664 2026-05-15 cs.CR cs.LG

FreeMOCA: Memory-Free Continual Learning for Malicious Code Analysis

Zahra Asadi, Haeseung Jeon, Sohyun Han, Md Mahmuduzzaman Kamol, Se Eun Oh, Mohammad Saidur Rahman

AI总结随着每年新发现的恶意软件样本超过2亿个，反病毒系统需要不断适应不断变化的威胁环境。然而，仅使用新样本进行再训练会导致灾难性遗忘和可被利用的检测盲区，而使用整个数据集再训练则计算成本高昂。为此，本文提出FreeMOCA，一种无需存储记忆且计算高效的持续学习框架，通过在任务更新之间进行自适应的逐层插值，保留先前知识，从而有效提升恶意代码分析的持续学习能力。实验表明，FreeMOCA在多个大规模基准数据集上显著优于现有方法，大幅减少了遗忘并提升了检测准确率。

Comments 17 pages, 5 figures, 12 tables

2605.09530 2026-05-15 cs.CR cs.CL

MemPrivacy: Privacy-Preserving Personalized Memory Management for Edge-Cloud Agents

Yining Chen, Jihao Zhao, Bo Tang, Haofen Wang, Yue Zhang, Fei Huang, Feiyu Xiong, Zhiyu Li

AI总结随着基于大语言模型的智能体越来越多地部署在边缘-云环境中，个性化记忆成为实现长期适应和以用户为中心交互的关键。然而，现有的云端辅助记忆管理方式容易暴露敏感用户信息，而现有的隐私保护方法通常依赖于激进的语义抹除，导致记忆效用和个性化质量下降。为此，本文提出 MemPrivacy，通过在边缘设备上识别隐私敏感内容，并用语义结构化的类型感知占位符替代，既保护了隐私，又保留了记忆生成与检索所需的信息。实验表明，MemPrivacy 在隐私信息提取方面表现优异，同时显著降低了推理延迟，有效平衡了隐私保护与个性化记忆效用。

2605.09018 2026-05-15 cs.NE cs.AI cs.LG

Evolutionary Ensemble of Agents

Zongmin Yu, Liu Yang

AI总结本文提出了一种名为EvE的进化集成框架，用于组织现有的高能力编码代理，使其形成一个协同进化的系统，以实现算法发现。该方法固定基础代理结构，专注于进化代理行为的指导与技能，通过两个协同进化的种群（功能代码求解器和代理指导状态）进行同步竞争，并根据其对当前求解状态的边际贡献更新代理的Elo评分。实验表明，EvE在In-Context Operator Networks（ICON）的研究瓶颈中自主发现了可靠的缩放-插值机制，展示了其在复杂代码库中通过自适应代理集成突破性能瓶颈的有效性。

2605.07060 2026-05-15 physics.geo-ph cs.LG physics.comp-ph stat.ML

Functional-prior-based approaches to Bayesian PDE-constrained inversion using physics-informed neural networks

Ryoichiro Agata, Tomohisa Okazaki

AI总结本文提出了一种基于函数先验的贝叶斯偏微分方程约束反演方法（fpBPINN），旨在将物理意义明确的函数空间先验有效引入基于物理信息神经网络（PINN）的贝叶斯反演中。研究引入了两种互补方法：一种通过学习神经网络权重先验以符合给定函数先验，另一种则在函数空间中直接进行变分推理。实验表明，这两种方法在地震层析成像和达西流渗透率反演中均能准确估计后验分布，突显了引入物理可解释函数先验在提升反演精度中的重要性。

2604.17954 2026-05-15 math.DG cs.LG

Complex normalizing flows can almost be information Kähler-Ricci flows

Andrew Gracyk

AI总结本文探讨了复正规化流与近似凯勒-里奇流之间的联系，将复正规化流中用于密度变换的对数行列式与凯勒流形的里奇曲率联系起来。通过引入增广雅可比矩阵和贝叶斯参数视角，研究揭示了复正规化流的对数密度在连续极限下与费舍尔信息度量相吻合，从而在时间导数和期望的意义下恢复了凯勒-里奇流的变体。该工作建立了复正规化流的统计行为与几何特征之间的桥梁，为理解深度生成模型提供了新的几何视角。

2604.09603 2026-05-15 cs.DC cs.AI cs.LG

ECHO: Elastic Speculative Decoding with Sparse Gating for High-Concurrency Scenarios

Xinyi Hu, Yuhao Shen, Baolin Zhang, Hengxin Zhang, Jun Dai, Shuang Ge, Lei Chen, Yue Li, Mingcheng Wan

AI总结 ECHO 是一种面向高并发场景的弹性推测解码框架，旨在提升大语言模型推理效率。该方法通过稀疏置信度门控机制，将推测执行重新建模为预算调度问题，灵活平衡解码深度与宽度，从而减少全局验证步骤并提高每步效率。实验表明，ECHO 在多种模型规模下均优于现有方法，尤其在工业级模型 Qwen3-235B 上实现了最高达 5.35 倍的加速效果。

2603.29097 2026-05-15 eess.AS cs.SD

Asymmetric Encoder-Decoder Based on Time-Frequency Correlation for Speech Separation

Ui-Hyeop Shin, Hyung-Min Park

AI总结本文研究了在真实声学环境下如何有效分离混叠语音信号的问题，提出了一种基于时频相关性的不对称编码-解码框架SR-CorrNet。该方法通过引入分离-重建策略，结合时频双路径结构，实现了对说话人特征的逐步细化提取，并利用结构化的相关性到滤波估计方法提升分离效果。实验表明，该方法在多种数据集和不同环境条件下均取得了显著的性能提升。

Comments Submitted to IEEE Transactions on Audio, Speech, and Language Processing (TASLPRO) Code: https://github.com/dmlguq456/SR_CorrNet

2603.24586 2026-05-15 cs.SE cs.CL

Comparing Developer and LLM Biases in Code Evaluation

Aditya Mittal, Ryan Shar, Zichu Wu, Shyam Agarwal, Tongshuang Wu, Chris Donahue, Ameet Talwalkar, Wayne Chi, Valerie Chen

AI总结随着大语言模型（LLM）在代码评估中被广泛用作评判者，研究其在真实交互场景中的表现变得尤为重要。本文提出TRACE框架，用于评估LLM评判者预测人类偏好和揭示人类与模型在代码质量评价上的系统性偏差的能力。研究发现，在多种代码交互场景中，最佳LLM评判者的表现仍比人类注释者低12%-23%，并识别出35个导致人类与模型评判不一致的关键因素，其中大部分与现有软件工程代码质量标准相关。

2603.24422 2026-05-15 cs.IR cs.AI cs.CL

OneSearch-V2: The Latent Reasoning Enhanced Self-distillation Generative Search Framework

Ben Chen, Siyuan Wang, Yufei Ma, Zihan Liang, Xuxin Zhang, Yue Lv, Ying Yang, Huangyu Dai, Lingtao Mao, Tong Zhao, Zhipeng Qian, Xinyu Sun, Zhixin Zhai, Yang Zhao, Bochao Liu, Jingshan Lv, Xiao Liang, Hui Kong, Jing Chen, Han Li, Chenyi Lei, Wenwu Ou, Kun Gai

AI总结本文提出了一种名为 OneSearch-V2 的生成式检索框架，旨在解决现有系统在复杂查询理解、用户意图挖掘和偏好过拟合等方面的问题。该方法通过引入潜在推理增强的自蒸馏训练机制，提升了对用户深层需求的理解与匹配能力，并结合行为偏好对齐优化系统，有效缓解了单一转化指标带来的奖励黑客问题。实验表明，OneSearch-V2 在多项指标上均有显著提升，包括点击率、买家数量和订单量，并改善了搜索体验质量。

Comments Codes are available at https://github.com/benchen4395/onesearch-family. Feel free to contact benchen4395@gmail.com

详情

英文摘要

Generative Retrieval (GR) has emerged as a promising paradigm for modern search systems. Compared to multi-stage cascaded architecture, it offers advantages such as end-to-end joint optimization and high computational efficiency. OneSearch, as a representative industrial-scale deployed generative search framework, has brought significant commercial and operational benefits. However, its inadequate understanding of complex queries, inefficient exploitation of latent user intents, and overfitting to narrow historical preferences have limited its further performance improvement. To address these challenges, we propose OneSearch-V2, a latent reasoning enhanced self-distillation generative search framework. It contains three key innovations: (1) a thought-augmented complex query understanding module, which enables deep query understanding and overcomes the shallow semantic matching limitations of direct inference; (2) a reasoning-internalized self-distillation training pipeline, which uncovers users' potential yet precise e-commerce intentions beyond log-fitting through implicit in-context learning; (3) a behavior preference alignment optimization system, which mitigates reward hacking arising from the single conversion metric, and addresses personal preference via direct user feedback. Extensive offline evaluations demonstrate OneSearch-V2's strong query recognition and user profiling capabilities. Online A/B tests further validate its business effectiveness, yielding +3.98\% item CTR, +2.07\% buyer volume, and +2.11\% order volume. Manual evaluation further confirms gains in search experience quality, with +1.37\% in page good rate and +1.65\% in query-item relevance. More importantly, OneSearch-V2 effectively mitigates common search system issues such as information bubbles and long-tail sparsity, without incurring additional inference costs or serving latency.

URL PDF HTML ☆

赞 0 踩 0

2603.00772 2026-05-15 stat.ML cs.LG

Generalizing Score-based generative models for Heavy-tailed Distributions

Tiziano Fassina, Gabriel Cardoso, Sylvan Le Corff, Thomas Romary

AI总结本文研究了如何将基于分数的生成模型（SGMs）推广到具有重尾分布的数据。针对现有方法在生成保真度和理论基础方面的不足，作者提出了两个理论贡献：一是证明通过早期停止和适当初始化可以将扩散框架扩展到任意目标分布；二是为归一化流的生成过程推导出新的理论保证。基于这些结果，文章提出了一种统一的生成框架，结合归一化流捕捉重尾特性与SGM细化结构细节，有效提升了生成质量并克服了现有方法的局限。

2602.17407 2026-05-15 eess.SY cs.RO cs.SY

Bluetooth Phased-array Aided Inertial Navigation Using Factor Graphs: Experimental Verification

Glen Hjelmerud Mørkbak Sørensen, Torleiv H. Bryne, Kristoffer Gryte, Tor Arne Johansen

AI总结本文研究了利用相控阵蓝牙系统辅助惯性导航的问题，提出基于因子图优化的估计方法，并通过多旋翼无人机飞行实验验证其性能。研究对比了不同鲁棒估计策略在GNSS信号丢失场景下的表现，展示了蓝牙角度、距离或气压测量辅助导航的可行性与效果。该工作为低成本、高鲁棒性的室内导航系统提供了实验依据与方法支持。

Comments 6 pages, 5 figures, 2 tables. \c{opyright} 2026 the authors. This work has been accepted to IFAC for publication under a Creative Commons Licence CC-BY-NC-ND

2602.15249 2026-05-15 cs.DL cs.AI

Artificial Intelligence Specialization in the European Union: Underexplored Role of the Periphery at NUTS-3 Level

Victor Herrero-Solana, Carmen Gálvez

AI总结本研究分析了2015年至2024年间欧洲NUTS-3地区在人工智能领域的研究分布情况，利用引文数据和分类系统，计算了相对专业化指数和相对引用影响力指标。研究发现，尽管巴黎、华沙和马德里等大都市在论文数量上占优，但人工智能领域的相对专业化程度最高的是东欧和西班牙的一些外围地区，如格拉纳达和维尔纽斯地区。研究还揭示了专业化与引用影响力之间关系较弱，不同地区呈现出多样化的发展模式。

Comments 15 pages, 3 figures

2602.14881 2026-05-15 math.OC cs.AI

Numerical exploration of the range of shape functionals using neural networks

Eloi Martinet, Ilias Ftouhi

AI总结本文提出了一种基于神经网络的新数值框架，用于探索Blaschke–Santaló图，该图用于描述形状泛函之间的可能不等式关系。通过引入基于规范函数的可逆神经网络结构，实现了对任意维凸集的参数化，并在形状优化过程中保持凸性。为实现图内的均匀采样，作者设计了一种通过自动微分最小化Riesz能量泛函的粒子系统，并在二维和三维凸体的多个几何和偏微分方程型泛函上验证了方法的有效性。

Comments 20 pages, 8 figures

2602.06718 2026-05-15 cs.CR cs.AI

GhostCite: A Large-Scale Analysis of Citation Validity in the Age of Large Language Models

Zuyao Xu, Yuqi Qiu, Lu Sun, Fasheng Miao, Fubin Wu, Xiang Li, Xinyi Wang, Haozhe Lu, Zhengze Zhang, Yuxin Hu, Jialu Li, Luo Jin, Feng Zhang, Rui Luo, Xinran Liu, Yingxian Li, Jiaji Liu

AI总结《GhostCite：大语言模型时代引文有效性的大规模分析》研究了大型语言模型（LLMs）在学术写作中广泛使用所引发的引文有效性问题。研究开发了一个开源框架\citeb，用于大规模验证引文，并通过三个实验分析了LLMs生成虚假引文（“幽灵引文”）的现象。研究发现，所有测试的LLMs在不同领域生成引文时都有较高比例的虚构引文，且近年来学术会议论文中的无效引文比例显著上升，同时多数研究者依赖AI工具，但审稿人对引文的审查并不严格，反映出当前学术出版体系在应对这一问题上的不足。

2602.03680 2026-05-15 physics.soc-ph cs.SD

Instantaneous Spectra Analysis of Pulse Series -- Application to Lung Sounds with Abnormalities

Fumihiko Ishiyama

AI总结本文研究了脉冲序列的瞬时频谱分析方法，并将其应用于异常肺音（如爆裂音和哮鸣音）及正常肺音的分析。传统傅里叶分析的时间频率分辨率受限于周期边界条件假设，作者提出采用线性外推条件替代该假设，从而实现更精确的瞬时频谱分析。该方法能够有效提取脉冲序列中每个脉冲的频谱信息，并生成脉冲序列的时频图，清晰展示其时间频率结构，为异常肺音的识别提供了新的分析工具。

Comments 10 pages, 7 figures. To appear Proc. IEEE CSPA 2026

2512.12772 2026-05-15 cs.MM cs.CV

JointAVBench: A Benchmark for Joint Audio-Visual Reasoning Evaluation

Jianghan Chao, Jianzhang Gao, Wenhui Tan, Yuchong Sun, Ruihua Song, Liyun Ru

AI总结为了全面评估能够处理多模态信息的全大语言模型（Omni-LLMs），本文提出JointAVBench基准，涵盖多模态依赖、多样化的音频信息类型和不同场景跨度三个关键方面。该基准通过自动化流程生成严格依赖音视频联合理解的问题与答案，弥补了现有数据集在多模态评估方面的不足。实验表明，即使表现最好的Omni-LLM在该基准上的平均准确率也仅为65.3%，显示出在跨场景推理等方面仍有较大提升空间。

2511.21247 2026-05-15 eess.AS cs.LG cs.SD

The Spheres Dataset: Multitrack Orchestral Recordings for Music Source Separation and Information Retrieval

Jaime Garcia-Martinez, David Diaz-Guerra, John Anderson, Ricardo Falcon-Perez, Pablo Cabañas-Molero, Tuomas Virtanen, Julio J. Carabias-Orti, Pedro Vera-Candeas

AI总结本文介绍了《Spheres数据集》，这是一个包含多轨管弦乐录音的数据集，旨在推动经典音乐领域中音乐源分离及相关音乐信息检索任务的机器学习研究。数据集由Colibrì乐团在The Spheres录音棚演奏的超过一小时的音乐作品组成，包括柴可夫斯基《罗密欧与朱丽叶》和莫扎特第四十号交响曲，并附有各乐器的音阶和独奏片段。通过23个麦克风的多角度录制，该数据集提供了真实立体声混音、可控的音轨混入以及独立音轨，适用于源分离模型的训练与评估，并附有各乐器位置的房间脉冲响应，为研究提供了丰富的声学特性信息。

2511.18820 2026-05-15 physics.flu-dyn cs.LG

Unsupervised simulation of incompressible flows with physics- and equality- constrained artificial neural networks

Qifeng Hu, Inanc Senocak

AI总结该研究提出了一种基于物理约束和等式约束的人工神经网络（PECANN）框架，用于无监督模拟不可压缩流体在高雷诺数下的流动。通过引入压力泊松方程目标函数和条件自适应增广拉格朗日乘子法（CA-ALM），严格满足连续性方程和边界条件，有效解决了传统物理信息神经网络在高雷诺数流动中难以保证无散性约束的问题。实验表明，该方法在多个典型流动场景中无需监督预训练或标签数据，即可准确捕捉流动结构，包括高雷诺数下圆柱绕流中涡旋脱落的自发产生。

Comments 33 pages, 19 figures

详情

英文摘要

Physics-informed neural networks (PINNs) have shown promise for solving partial differential equations, yet their success in simulating incompressible flows at high Reynolds numbers remains limited. Existing approaches rely on auxiliary labeled data, supervised pretraining, or reference solutions, and no purely unsupervised method comparable to conventional finite-difference or finite-volume solvers has been demonstrated. We attribute this gap to the absence of a mechanism for enforcing the divergence-free constraint and boundary conditions to strict tolerances. To address this, we adopt the physics- and equality-constrained artificial neural network (PECANN) framework with a conditionally adaptive augmented Lagrangian method (CA-ALM), and introduce a pressure-Poisson-based objective. The residual of the pressure Poisson equation is minimized subject to the momentum and continuity equations and boundary conditions on the primitive variables as equality constraints, with CA-ALM enforcing all constraints tightly. For advection-dominated, high-Reynolds-number flows, we further propose an adaptive vanishing entropy viscosity that stabilizes early training without influencing the converged solution. A baseline that instead uses the momentum residual as the objective proves ineffective under the same machinery, underscoring the critical role of the pressure-Poisson objective. The method is assessed on lid-driven cavity flow up to $Re=7{,}500$, three-dimensional unsteady Beltrami flow, and steady and unsteady flow past a circular cylinder with general inflow-outflow boundary conditions, including an ablation study identifying admissible outlet conditions -- all without labeled data or supervised pretraining. Notably, it captures the spontaneous onset of periodic vortex shedding in unsteady cylinder flow without external perturbations, starting from a randomly initialized network.

URL PDF HTML ☆

赞 0 踩 0

2511.16964 2026-05-15 cs.MA cs.AI cs.DC

Optimizing PyTorch Inference with LLM-Based Multi-Agent Systems

Kirill Nagaitsev, Luka Grbcic, Samuel Williams, Costin Iancu

AI总结本文研究了如何利用基于大语言模型的多智能体系统优化PyTorch推理性能。通过构建逻辑框架对比不同多智能体优化系统，发现采用以利用为主策略并结合错误修复智能体能取得最佳效果，且优化粒度对性能有显著影响。实验表明，该方法在H100 GPU上实现了比PyTorch Eager平均2.88倍的加速，优于torch.compile的1.85倍。

2511.05820 2026-05-15 cs.SE cs.AI

From Ranking to Reasoning: Explainable Web API Recommendation via Semantic Reasoning

Zishuo Xu, Dezhong Yao, Yao Wan

AI总结随着Web API数量的快速增长，自动化的API推荐对于高效构建混合应用变得至关重要。现有方法在推荐策略固定、无法适应复杂需求以及缺乏解释性方面存在不足。为此，本文提出WAR-R1框架，结合语义推理与可变规模推荐，通过轻量大语言模型生成推荐API及其自然语言解释，并引入特殊起始和终止标记以支持推荐数量的自适应调整。实验表明，WAR-R1在推荐准确率和解释质量上均优于现有方法，验证了其有效性。

2511.05159 2026-05-15 stat.ML cs.LG

A New Framework for Convex Clustering in Kernel Spaces: Finite Sample Bounds, Consistency and Performance Insights

Shubhayan Pan, Kushal Bose, Debolina Paul, Saptarshi Chakraborty, Swagatam Das

AI总结本文提出了一种在核空间中的凸聚类新框架，用于处理线性不可分或非凸结构的数据。该方法通过将数据映射到再生核希尔伯特空间（RKHS），在变换后的空间中进行凸聚类，从而提升对复杂数据分布的处理能力，并能在有限维空间中生成嵌入表示。研究提供了该方法的理论保证，包括算法收敛性和有限样本误差界，并通过实验验证了其在合成和真实数据集上的优越性能，为非线性与非凸数据的聚类提供了有效解决方案。

2510.25240 2026-05-15 stat.ML cs.LG

Generative Bayesian Optimization: Generative Models as Acquisition Functions

Rafael Oliveira, Daniel M. Steinberg, Edwin V. Bonilla

AI总结本文提出了一种将生成模型用于批量贝叶斯优化（BO）的通用策略，使生成模型能够作为候选解采样器，从而实现大规模批量优化、非连续设计空间优化以及高维和组合设计优化。受直接偏好优化（DPO）成功启发，研究通过使用观测数据计算出的简单效用值训练生成模型，使其生成的分布密度与预期效用（即BO的获取函数值）成正比，避免了传统方法中构建代理模型的需求。理论分析表明，生成模型在BO过程中形成的分布序列在一定条件下可逼近最优目标，并通过高维大规模优化实验验证了方法的有效性。

Comments Published at ICLR 2026. Compared with the proceedings version on OpenReview, this version includes a minor revision to Section 3

2510.19973 2026-05-15 cs.NI cs.AI

A Tutorial on Cognitive Biases in Agentic AI-Driven 6G Autonomous Networks

Hatim Chergui, Farhad Rezazadeh, Merouane Debbah, Christos Verikoukis

AI总结本文综述了智能体驱动的6G自组织网络中常见的认知偏差问题，分析了这些偏差的分类、数学表达及其在通信系统中的表现，并提出了针对性的缓解策略。通过两个6G网络管理场景的案例验证，研究展示了如何利用本地化大语言模型和改进的记忆机制，有效减少锚定偏差和时间确认偏差，从而提升资源分配效率，实现显著的能耗降低和延迟优化。

Comments 26 pages, 18 figures, 4 tables, link to source code available. Accepted at IEEE OJCOMS

详情

英文摘要

The path to higher network autonomy in 6G lies beyond the mere optimization of key performance indicators (KPIs), requiring systems that perceive and reason over the network environment as it is. This can be achieved through agentic AI, where large language model (LLM)-powered agents utilize multimodal telemetry, memory, and cross-domain negotiation to achieve multi-objective goals. However, deploying such agents introduces cognitive biases inherited from human design, which can severely distort reasoning and actuation. This paper provides a comprehensive tutorial on well-known cognitive biases, detailing their taxonomy, mathematical formulation, emergence in telecom systems, and tailored mitigation strategies. We validate these concepts through two distinct use-cases in 6G management. First, we tackle anchoring bias in inter-slice resource negotiation. To overcome the prohibitive execution delays of cloud-based LLMs, this use-case deploys a locally hosted 1B-parameter model on an RTX A4000 GPU, successfully achieving sub-second inference latencies compatible with near-real-time operations. By replacing fixed heuristic anchors with a Truncated Weibull randomized anchor strategy, the agents dismantle rigid biases, intelligently consume SLA slack, and dynamically double the system-wide energy savings (peaking at 25\%) without violating strict latency limits. Second, we mitigate temporal and confirmation biases in RAN-Edge cross-domain negotiation by designing an unbiased collective memory. By integrating semantic/temporal decay and an inflection bonus that actively highlights past negotiation failures, agents are prevented from over-relying on recent data or repeating past mistakes. Grounding decisions in this richer, debiased historical context yields highly robust agreements, achieving a $\times 5$ latency reduction and roughly 40\% higher energy savings compared to memoryless baselines.

URL PDF HTML ☆

赞 0 踩 0

2510.15141 2026-05-15 stat.ML cs.LG stat.AP

Manifold Dimension Estimation via Local Graph Structure

Zelong Bi, Pierre Lafaye de Micheaux

AI总结本文提出了一种基于局部图结构的流形维度估计方法，通过在局部主成分分析坐标上进行回归来捕捉流形的局部结构。该方法引入了两个代表性估计器：二次嵌入（QE）和总最小二乘（TLS），实验表明它们在合成数据和现实数据上均具有竞争力，且在许多情况下优于现有先进方法。

2510.13583 2026-05-15 stat.ML cs.LG

On the Identifiability of Causal Graphs with the Invariance Principle

Francesco Montagna

AI总结本文研究了在独立同分布观测数据下因果图的可识别性问题，提出在结构因果模型生成的数据分布以及少量（最多两个）具有不同噪声统计特性的环境数据下，可以唯一确定因果图。该成果首次保证了在固定数量环境中恢复完整因果图的可能性，且适用于任意非线性机制，仅需噪声满足高斯性假设，并探讨了放松该假设的可能方法。研究还进一步拓展了独立成分分析与因果发现之间的对偶关系，表明在较少辅助信息条件下，因果发现可达到与非线性ICA相当的性能。

Comments Published as ICLR 2026 conference paper

2509.12341 2026-05-15 quant-ph cs.CL cs.CR

Exact Coset Sampling for Quantum Lattice Algorithms

Yifan Zhang

AI总结本文研究了Chen提出的量子晶格算法在学习错误（LWE）参数设置下的后处理阶段，提出了一种精确的余类采样方法。通过测量确定性余数并应用依赖于运行参数的二次相位调整，该方法能够消除波形啁啾，从而在无需完整偏移量的情况下实现精确的量子傅里叶变换。在满足一定前提条件的情况下，该方法能够以高概率得到满足共振条件的傅里叶结果，并在共振条件下保证结果在对偶超平面上的均匀分布。

Comments Preprint - Work in Progress