arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2511.16964 2026-05-15 cs.MA cs.AI cs.DC

Optimizing PyTorch Inference with LLM-Based Multi-Agent Systems

Kirill Nagaitsev, Luka Grbcic, Samuel Williams, Costin Iancu

发表机构 * NVIDIA Corporation（NVIDIA公司）； Microsoft Corporation（微软公司）； Ansel et al. ( 2024 )（Ansel等人（2024））； Sabne ( 2020 )（Sabne（2020））； Kerr et al. ( 2017 )（Kerr等人（2017））； Tillet et al. ( 2019 )（Tillet等人（2019））； Spector et al. ( 2024 )（Spector等人（2024））； Ouyang et al. ( 2025 )（Ouyang等人（2025））； Lange et al. ( 2025a（Lange等人（2025a；b））； b )（Li等人（2025））； Li et al. ( 2025 )（METR（2025））； METR ( 2025 )（Andrews和Witteveen（2025））； Andrews and Witteveen ( 2025 )（Baronio等人（2025））； Baronio et al. ( 2025 )（Novikov等人（2025））； Novikov et al. ( 2025 )（Wei等人（2025））； Wei et al. ( 2025 )（Sharma（2025））； Sharma ( 2025 )

AI总结本文研究了如何利用基于大语言模型的多智能体系统优化PyTorch推理性能。通过构建逻辑框架对比不同多智能体优化系统，发现采用以利用为主策略并结合错误修复智能体能取得最佳效果，且优化粒度对性能有显著影响。实验表明，该方法在H100 GPU上实现了比PyTorch Eager平均2.88倍的加速，优于torch.compile的1.85倍。

2511.05820 2026-05-15 cs.SE cs.AI

From Ranking to Reasoning: Explainable Web API Recommendation via Semantic Reasoning

Zishuo Xu, Dezhong Yao, Yao Wan

发表机构 * School of Software Engineering（软件工程学院）； School of Computer Science and Technology（计算机科学与技术学院）； Huazhong University of Science and Technology（华中科技大学）

AI总结随着Web API数量的快速增长，自动化的API推荐对于高效构建混合应用变得至关重要。现有方法在推荐策略固定、无法适应复杂需求以及缺乏解释性方面存在不足。为此，本文提出WAR-R1框架，结合语义推理与可变规模推荐，通过轻量大语言模型生成推荐API及其自然语言解释，并引入特殊起始和终止标记以支持推荐数量的自适应调整。实验表明，WAR-R1在推荐准确率和解释质量上均优于现有方法，验证了其有效性。

2511.05159 2026-05-15 stat.ML cs.LG

A New Framework for Convex Clustering in Kernel Spaces: Finite Sample Bounds, Consistency and Performance Insights

Shubhayan Pan, Kushal Bose, Debolina Paul, Saptarshi Chakraborty, Swagatam Das

发表机构 * Indian Statistical Institute, Kolkata（印度统计研究院，加尔各答）； Electronics and Communication Sciences Unit, Indian Statistical Institute（印度统计研究院电子与通信科学单位）； Department of Statistics, University of Oxford（牛津大学统计系）； Department of Statistics, University of Michigan（密歇根大学统计系）

AI总结本文提出了一种在核空间中的凸聚类新框架，用于处理线性不可分或非凸结构的数据。该方法通过将数据映射到再生核希尔伯特空间（RKHS），在变换后的空间中进行凸聚类，从而提升对复杂数据分布的处理能力，并能在有限维空间中生成嵌入表示。研究提供了该方法的理论保证，包括算法收敛性和有限样本误差界，并通过实验验证了其在合成和真实数据集上的优越性能，为非线性与非凸数据的聚类提供了有效解决方案。

2510.25240 2026-05-15 stat.ML cs.LG

Generative Bayesian Optimization: Generative Models as Acquisition Functions

Rafael Oliveira, Daniel M. Steinberg, Edwin V. Bonilla

发表机构 * CSIRO’s Data61（CSIRO的数据61）

AI总结本文提出了一种将生成模型用于批量贝叶斯优化（BO）的通用策略，使生成模型能够作为候选解采样器，从而实现大规模批量优化、非连续设计空间优化以及高维和组合设计优化。受直接偏好优化（DPO）成功启发，研究通过使用观测数据计算出的简单效用值训练生成模型，使其生成的分布密度与预期效用（即BO的获取函数值）成正比，避免了传统方法中构建代理模型的需求。理论分析表明，生成模型在BO过程中形成的分布序列在一定条件下可逼近最优目标，并通过高维大规模优化实验验证了方法的有效性。

Comments Published at ICLR 2026. Compared with the proceedings version on OpenReview, this version includes a minor revision to Section 3

Journal ref The Fourteenth International Conference on Learning Representations (ICLR 2026)

2510.19973 2026-05-15 cs.NI cs.AI

A Tutorial on Cognitive Biases in Agentic AI-Driven 6G Autonomous Networks

Hatim Chergui, Farhad Rezazadeh, Merouane Debbah, Christos Verikoukis

发表机构 * i2CAT Foundation（i2CAT基金会）； Hostelworld Group（Hostelworld集团）； Technical University of Catalonia (UPC)（技术大学（加泰罗尼亚））； Khalifa University of Science and Technology（卡里玛大学）； ISI/ATH ； University of Patras（帕特拉大学）

AI总结本文综述了智能体驱动的6G自组织网络中常见的认知偏差问题，分析了这些偏差的分类、数学表达及其在通信系统中的表现，并提出了针对性的缓解策略。通过两个6G网络管理场景的案例验证，研究展示了如何利用本地化大语言模型和改进的记忆机制，有效减少锚定偏差和时间确认偏差，从而提升资源分配效率，实现显著的能耗降低和延迟优化。

Comments 26 pages, 18 figures, 4 tables, link to source code available. Accepted at IEEE OJCOMS

详情

英文摘要

The path to higher network autonomy in 6G lies beyond the mere optimization of key performance indicators (KPIs), requiring systems that perceive and reason over the network environment as it is. This can be achieved through agentic AI, where large language model (LLM)-powered agents utilize multimodal telemetry, memory, and cross-domain negotiation to achieve multi-objective goals. However, deploying such agents introduces cognitive biases inherited from human design, which can severely distort reasoning and actuation. This paper provides a comprehensive tutorial on well-known cognitive biases, detailing their taxonomy, mathematical formulation, emergence in telecom systems, and tailored mitigation strategies. We validate these concepts through two distinct use-cases in 6G management. First, we tackle anchoring bias in inter-slice resource negotiation. To overcome the prohibitive execution delays of cloud-based LLMs, this use-case deploys a locally hosted 1B-parameter model on an RTX A4000 GPU, successfully achieving sub-second inference latencies compatible with near-real-time operations. By replacing fixed heuristic anchors with a Truncated Weibull randomized anchor strategy, the agents dismantle rigid biases, intelligently consume SLA slack, and dynamically double the system-wide energy savings (peaking at 25\%) without violating strict latency limits. Second, we mitigate temporal and confirmation biases in RAN-Edge cross-domain negotiation by designing an unbiased collective memory. By integrating semantic/temporal decay and an inflection bonus that actively highlights past negotiation failures, agents are prevented from over-relying on recent data or repeating past mistakes. Grounding decisions in this richer, debiased historical context yields highly robust agreements, achieving a $\times 5$ latency reduction and roughly 40\% higher energy savings compared to memoryless baselines.

URL PDF HTML ☆

赞 0 踩 0

2510.15141 2026-05-15 stat.ML cs.LG stat.AP

Manifold Dimension Estimation via Local Graph Structure

Zelong Bi, Pierre Lafaye de Micheaux

发表机构 * School of Mathematics and Statistics, University of New South Wales（新南威尔士大学数学与统计学学院）

AI总结本文提出了一种基于局部图结构的流形维度估计方法，通过在局部主成分分析坐标上进行回归来捕捉流形的局部结构。该方法引入了两个代表性估计器：二次嵌入（QE）和总最小二乘（TLS），实验表明它们在合成数据和现实数据上均具有竞争力，且在许多情况下优于现有先进方法。

2510.13583 2026-05-15 stat.ML cs.LG

On the Identifiability of Causal Graphs with the Invariance Principle

Francesco Montagna

发表机构 * Institute of Science and Technology Austria（奥地利科学与技术研究所）； Chan Zuckerberg Initiative（查兰·泽克伯格倡议）

AI总结本文研究了在独立同分布观测数据下因果图的可识别性问题，提出在结构因果模型生成的数据分布以及少量（最多两个）具有不同噪声统计特性的环境数据下，可以唯一确定因果图。该成果首次保证了在固定数量环境中恢复完整因果图的可能性，且适用于任意非线性机制，仅需噪声满足高斯性假设，并探讨了放松该假设的可能方法。研究还进一步拓展了独立成分分析与因果发现之间的对偶关系，表明在较少辅助信息条件下，因果发现可达到与非线性ICA相当的性能。

Comments Published as ICLR 2026 conference paper

2508.14950 2026-05-15 eess.IV cs.LG

Potential and challenges of generative adversarial networks for super-resolution in 4D Flow MRI

Oliver Welin Odeback, Arivazhagan Geetha Balasubramanian, Jonas Schollenberger, Edward Ferdiand, Alistair A. Young, C. Alberto Figueroa, Susanne Schnell, Outi Tammisola, Ricardo Vinuesa, Tobias Granberg, Alexander Fyrdahl, David Marlevi

发表机构 * Surgery, Karolinska Institutet , addressline= Karolinska Universitetssjukhuset Solna (L1:00) , city= Stockholm , postcode= 171 76 , country= Sweden ； organization= FLOW, Engineering Mechanics, KTH Royal Institute of Technology , addressline= Osquars Backe 18 , city= Stockholm , postcode= 100 44 , country= Sweden ； organization= Department of Radiology ； Biomedical Imaging, University of California San Francisco , addressline= 505 Parnassus Avenue , city= San Francisco , postcode= 94143 , state= CA , country= USA ； organization= Faculty of Informatics, Telkom University , addressline= Jl.Telekomunikasi No. 1, Terusan Buahbatu , city= Bandung , postcode= 40257 , state= West Java , country= Indonesia ； organization= Auckland Bioengineering Institute, University of Auckland , addressline= Bioengineering House, 70 Symonds St , city= Grafton , postcode= 1010 , country= New Zealand ； organization= School of Biomedical Engineering \& Imaging Sciences, King's College London , addressline= 1 Lambeth Palace Rd, South Bank , city= London , postcode= SE1 7EU , country= UK ； organization= Department of Biomedical Engineering, University of Michigan , addressline= 1107 Carl A. Gerstacker Bldg 2200 Bonisteel Blvd. , city= Ann Arbor , postcode= 48109-2099 , state= MI , country= USA ； organization= Department of Physics, University of Greifswald , addressline= Felix-Hausdorff-Str. 6 , city= Greifswald , postcode= 174 89 , country= Germany ； organization= Department of Aerospace Engineering, University of Michigan , addressline= 1320 Beal Avenue , city= Ann Arbor , postcode= 48109-2140 , state= MI , country= USA ； organization= Department of Neuroradiology, Karolinska University Hospital , addressline= Hälsovägen 13, O42 , city= Stockholm , postcode= 141 86 , country= Sweden ； organization= Department of Clinical Physiology, Karolinska University Hospital , addressline= Eugeniavägen 3, A8:01 , city= Solna , postcode= 171 64 , country= Sweden ； organization= Institute for Medical Engineering ； Science, Massachusetts Institute of Technology , addressline= 45 Carleton St , city= Cambridge , postcode= 02142 , state= MA , country= USA

AI总结本文研究了生成对抗网络（GAN）在4D血流磁共振成像（4D Flow MRI）超分辨率重建中的潜力与挑战。针对该技术在近壁速度测量中分辨率低、噪声大的问题，作者提出了一种专门设计的GAN架构，并在三种对抗损失函数下进行了评估。实验表明，Wasserstein GAN在提升近壁速度恢复精度和训练稳定性方面表现最优，展示了GAN在改善4D Flow MRI图像质量中的应用前景。

Comments 26 pages, 10 figures

Journal ref Computers in Biology and Medicine 211 (2026) 111745

详情

DOI: 10.1016/j.compbiomed.2026.111745

英文摘要

4D Flow Magnetic Resonance Imaging (4D Flow MRI) enables non-invasive quantification of blood flow and hemodynamic parameters. However, its clinical application is limited by low spatial resolution and noise, particularly affecting near-wall velocity measurements. Machine learning-based super-resolution has shown promise in addressing these limitations, but challenges remain, not least in recovering near-wall velocities. Generative adversarial networks (GANs) offer a compelling solution, having demonstrated strong capabilities in restoring sharp boundaries in non-medical super-resolution tasks. Yet, their application in 4D Flow MRI remains unexplored, with implementation challenged by known issues such as training instability and non-convergence. In this study, we investigate GAN-based super-resolution in 4D Flow MRI. Training and validation were conducted using patient-specific cerebrovascular in-silico models, converted into synthetic images via an MR-true reconstruction pipeline. A dedicated GAN architecture was implemented and evaluated across three adversarial loss functions: Vanilla, Relativistic, and Wasserstein. Our results demonstrate that the proposed GAN improved near-wall velocity recovery compared to a non-adversarial reference (vNRMSE: 6.9% vs. 9.6%); however, that implementation specifics are critical for stable network training. While Vanilla and Relativistic GANs proved unstable compared to generator-only training (vNRMSE: 8.1% and 7.8% vs. 7.2%), a Wasserstein GAN demonstrated optimal stability and incremental improvement (vNRMSE: 6.9% vs. 7.2%). The Wasserstein GAN further outperformed the generator-only baseline at low SNR (vNRMSE: 8.7% vs. 10.7%). These findings highlight the potential of GAN-based super-resolution in enhancing 4D Flow MRI, particularly in challenging cerebrovascular regions, while emphasizing the need for careful selection of adversarial strategies.

URL PDF HTML ☆

赞 0 踩 0

2508.07876 2026-05-15 stat.ML cs.LG math.DS math.ST stat.TH

Stochastic dynamics learning with state-space systems

Juan-Pablo Ortega, Florian Rossmannek

发表机构 * Division of Mathematical Sciences, School of Physical and Mathematical Sciences, Nanyang Technological University（数学科学系，物理与数学科学学院，南洋理工大学）

AI总结本文研究了状态空间系统在随机动态学习中的特性，旨在深化对脉冲神经网络计算（RC）理论基础的理解。通过统一处理确定性和随机性场景下的记忆衰减和回声状态属性（ESP），作者证明了即使在缺乏ESP的情况下，记忆衰减和解的稳定性也具有普遍性，从而为RC模型的广泛应用提供了理论支持。在随机情形下，文章引入了基于概率分布吸引子动力学的新视角，拓展了非自主动力系统的相关研究，为RC模型在因果性、稳定性与记忆特性方面提供了更深入的见解。

Journal ref Mathematical Models and Methods in Applied Sciences, 2026

2508.03941 2026-05-15 cs.IR cs.LG

Measuring the stability and plasticity of recommender systems

Maria João Lavoura, Robert Jungnickel, João Vinagre

发表机构 * Independent researcher（独立研究者）； Joint Research Centre - European Commission（欧洲委员会联合研究中心）

AI总结本文研究了推荐系统在长期运行中的稳定性与可塑性问题，提出了一个离线评估方法，用于分析推荐模型在重新训练时的行为表现。该方法从模型保留历史模式（稳定性）和适应新变化（可塑性）两个方面对算法进行评估，提供了一种与数据集、算法和指标无关的长期性能分析框架。实验结果表明，不同类型的推荐算法在稳定性和可塑性上存在差异，并可能存在两者之间的权衡关系。

Comments Final version published in the proceedings of ACM UMAP 2026: https://doi.org/10.1145/3774935.3812707

2507.13941 2026-05-15 q-bio.NC cs.AI cs.CV eess.IV

Shared representations in brains and models reveal a two-route cortical organization during scene perception

Pablo Marcos-Manchón, Lluís Fuentemilla

发表机构 * Department of Cognition, Development and Education Psychology, Faculty of Psychology, University of Barcelona（认知、发展与教育心理学系，心理学学院，巴塞罗那大学）； Institute of Neurosciences, University of Barcelona（神经科学研究所，巴塞罗那大学）； Bellvitge Institute for Biomedical Research（Bellvitge生物医学研究 institute）

AI总结该研究通过分析7T fMRI数据，探讨了人类大脑在场景感知过程中信息的组织与传递路径。研究利用表征相似性分析，比较了个体间共享的脑区表征结构与视觉和语言神经网络的层次特征，发现大脑存在两条分离的处理通路：一条负责场景布局与环境背景，另一条专门处理生物内容。这一发现深化了对视觉信息处理的经典模型，揭示了场景感知是一个由多个可区分表征路径组成的分布式脑网络。

Comments for associate code, see https://github.com/memory-formation/convergent-transformations

2507.05193 2026-05-15 eess.IV cs.CV

RAM-W600: A Multi-Task Wrist Dataset and Benchmark for Rheumatoid Arthritis

Songxiao Yang, Haolin Wang, Yao Fu, Ye Tian, Tamotsu Kamishima, Masayuki Ikebe, Yafei Ou, Masatoshi Okutomi

发表机构 * Institute of Science Tokyo（东京科学研究所）； Hokkaido University（北海道大学）； The University of Tokyo（东京大学）

AI总结该研究提出了一种名为RAM-W600的多任务腕关节X光图像数据集，用于类风湿性关节炎（RA）的辅助诊断与疾病监测。该数据集包含来自六个医疗中心的388名患者的1048张腕部常规X光图像，提供了像素级的腕骨实例分割标注和SvdH骨侵蚀评分，是首个公开的腕骨实例分割资源。该数据集有助于推动RA相关研究，如关节间隙狭窄量化、骨侵蚀检测、骨变形评估等，并可能应用于腕部骨折定位等任务，有望降低腕部RA研究的门槛，促进计算机辅助诊断技术的发展。

Comments Published in NeurIPS 2025

2506.20425 2026-05-15 stat.ML cs.LG stat.CO stat.ME

Scalable Subset Selection in Linear Mixed Models

Ryan Thompson, Matt P. Wand, Joanna J. J. Wang

发表机构 * School of Mathematical and Physical Sciences, University of Technology Sydney（技术与物理科学学院，悉尼技术大学）

AI总结本文研究了在包含固定效应和随机效应的线性混合模型中如何高效地进行可扩展的子集选择问题。为了解决现有方法在处理大量预测变量时计算效率低下的问题，作者提出了一种基于 $\ell_0$ 正则化的新型子集选择方法，并结合坐标下降算法和局部搜索算法以实现快速收敛和非凸优化的高效求解。该方法在统计上提供了有限样本下的KL散度界，并在合成和真实数据实验中表现出优越的性能。

2505.16714 2026-05-15 quant-ph cs.LG

Experimental robustness benchmarking of quantum neural networks on a superconducting quantum processor

Hai-Feng Zhang, Zhao-Yun Chen, Peng Wang, Liang-Liang Guo, Tian-Le Wang, Xiao-Yan Yang, Ren-Ze Zhao, Ze-An Zhao, Sheng Zhang, Lei Du, Hao-Ran Tao, Zhi-Long Jia, Wei-Cheng Kong, Huan-Yu Liu, Athanasios V. Vasilakos, Yang Yang, Yu-Chun Wu, Ji Guan, Peng Duan, Guo-Ping Guo

发表机构 * Laboratory of Quantum Information, School of Physics, University of Science（量子信息实验室，物理学院，科学大学）； CAS Center For Excellence in Quantum Information（中国科学院量子信息卓越中心）； Quantum Physics, University of Science（量子物理，科学大学）； Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, Anhui, 230088, China（人工智能研究所，合肥国家综合性科学中心，合肥，安徽，230088，中国）； Department of ICT（信息与通信技术系）； Center for AI Research, University of Agder (UiA), Jon Lilletuns vei 9, 4879 Grimstad, Norway（人工智能研究中心，阿格德大学（UiA），Jon Lilletuns vei 9，4879 Grimstad，挪威）； Anhui University, Hefei, Anhui, 230039, China（安徽大学，合肥，安徽，230039，中国）； Key Laboratory of System Software (Chinese Academy of Sciences)（系统软件重点实验室（中国科学院））； State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, Beijing 100190, China（计算机科学国家重点实验室，软件研究所，中国科学院，北京100190，中国）

AI总结本研究首次在超导量子处理器上对20量子比特的量子神经网络分类器进行了系统的实验鲁棒性评估，揭示了量子机器学习模型在对抗攻击下的安全性问题。研究提出了一种高效的对抗攻击算法，用于量化评估量子神经网络的鲁棒性，并验证了对抗训练能够通过正则化输入梯度显著提升其鲁棒性。实验还表明，与经典神经网络相比，量子神经网络具有更强的对抗鲁棒性，这归因于其固有的量子噪声，并且实验结果与理论下界高度吻合，验证了攻击方法的有效性与鲁棒性界限的紧致性。

Comments There are 8 pages with 5 figures in the main text

Journal ref SCIENCE CHINA Physics, Mechanics & Astronomy Volume 69, Issue 6: 260315 (2026)

2505.09552 2026-05-15 stat.ME cs.LG stat.ML

Scalable Krylov Subspace Methods for Generalized Mixed-Effects Models with Crossed Random Effects

Pascal Kündig, Fabio Sigrist

发表机构 * Lucerne University of Applied Sciences and Arts（卢塞恩应用科学与艺术大学）； Seminar for Statistics, ETH Zurich（苏黎世联邦理工学院统计研究所）； University of Basel（巴塞尔大学）

AI总结该论文针对具有交叉随机效应的广义混合效应模型中的计算瓶颈问题，提出了一种基于Krylov子空间的方法，有效提升了高维数据下的计算效率。研究通过理论分析和实验验证，展示了预条件随机Lanczos拟合和共轭梯度方法在收敛性和数值稳定性方面的优势，并开发了可扩展的预测方差计算方法。实验表明，新方法相比传统的Cholesky分解方法，在速度和稳定性上均有显著提升。

2505.09246 2026-05-15 cs.IR cs.AI cs.CL

Autofocus Retrieval: An Effective Pipeline for Multi-Hop Question Answering With Semi-Structured Knowledge

Derian Boer, Stephen Roth, Stefan Kramer

发表机构 * Institute of Computer Science（计算机科学研究所）； Johannes Gutenberg University Mainz（美因茨约翰内斯·古腾堡大学）

AI总结本文提出了一种基于半结构化知识库的多跳问答框架Autofocus-Retriever（AF-Retriever），旨在有效结合结构化和非结构化信息进行问答。该方法通过引入可交换的大语言模型提取实体属性和关系约束，并结合向量相似度搜索与增量范围扩展策略，实现了在多个基准测试中优于现有方法的零样本和少样本性能。其核心贡献在于通过四步约束驱动的检索与四步补充排序流程，显著提升了答案检索的准确性和鲁棒性。

Journal ref Transactions on Machine Learning Research 2026

2504.11703 2026-05-15 cs.CR cs.AI

Progent: Securing AI Agents with Privilege Control

Tianneng Shi, Jingxuan He, Zhun Wang, Hongwei Li, Linyu Wu, Wenbo Guo, Dawn Song

发表机构 * UC Berkeley（加州大学伯克利分校）； UC Santa Barbara（加州大学圣巴巴拉分校）； National University of Singapore（新加坡国立大学）

AI总结 AI代理通过调用工具与外部环境交互，容易受到如间接提示注入等攻击，导致未经授权的操作。为此，本文提出Progent框架，通过特权控制机制增强AI代理的安全性。Progent将特权表示为基于工具名称和参数的符号化安全策略，通过确定性过程检查每个工具调用，确保最小特权原则。该框架利用大型语言模型自动生成并动态更新策略，并结合SMT求解器保证策略更新的单调性，从而在保障实用性的前提下有效防止权限升级，实验表明其在多个基准测试中显著降低了攻击成功率。

2504.01571 2026-05-15 cs.GR cs.AI cs.CV cs.LG

Pro-DG: Procedural Diffusion Guidance for Architectural Facade Generation

Aleksander Plocharski, Jan Swidzinski, Przemyslaw Musialski

发表机构 * Warsaw University of Technology（华沙技术大学）； Akces NCBR ； Imperial College London（伦敦帝国理工学院）； New Jersey Institute of Technology（新泽西理工学院）

AI总结本文提出了一种基于过程化扩散引导（Pro-DG）的建筑立面生成方法，通过在稳定扩散框架中引入分层过程化规则生成控制图，从而生成逼真的建筑立面图像。该方法从单张输入图像及其分割结果出发，利用逆过程模块识别立面的分层布局，并结合结构特征设计了一种新的ControlNet流程，实现由过程化变换引导的立面图像生成。该方法能够精确控制局部外观并进行大规模结构编辑，实验表明其在保持建筑风格和实现可控编辑方面优于现有方法。

Comments 17 pages, 15 figures, Computer Graphics Forum 2026 Journal Paper

2501.18756 2026-05-15 stat.ML cs.LG math.OC

A Unified Framework for Entropy Search and Expected Improvement in Bayesian Optimization

Nuojin Cheng, Leonard Papenmeier, Stephen Becker, Luigi Nardi

发表机构 * Department of Applied Mathematics, University of Colorado Boulder（科罗拉多大学波尔得分校应用数学系）； Department of Computer Science, Lund University（吕勒欧大学计算机科学系）

AI总结本文提出了一种统一的理论框架——变分熵搜索（Variational Entropy Search），揭示了预期改进（EI）与基于信息论的获取函数之间的深层联系，挑战了它们本质不同的传统观点。研究通过将EI解释为最大值熵搜索（MES）的变分近似，提出了一个新的获取函数VES-Gamma，该方法在合成和现实世界的低维与高维基准测试中表现出色，优于现有的EI和MES方法。

Journal ref Proceedings of the 42nd International Conference on Machine Learning, PMLR 267:10106-10120, 2025

2410.03280 2026-05-15 eess.AS cs.AI cs.LG eess.SP

Manikin-Recorded Cardiopulmonary Sounds Dataset Using Digital Stethoscope

Yasaman Torabi, Shahram Shirani, James P. Reilly

发表机构 * Electrical and Computer Engineering Department, McMaster University（麦斯特大学电气与计算机工程系）

AI总结该研究提出了一种使用数字听诊器录制的心肺声音数据集，包含正常及多种异常心肺音，如杂音、心律失常和呼吸音等。数据集通过临床模拟人采集，涵盖了不同身体部位的单独和混合声音，并经过频率滤波处理以增强特定声音类型。该数据集为人工智能在心肺疾病自动检测、声音分类及深度学习等领域的研究提供了重要的资源。

Journal ref IEEE Data Descriptions, vol. 2, pp. 133-140, 2025

2410.02091 2026-05-15 cs.SE cs.AI cs.HC econ.GN q-fin.EC

The Impact of Generative AI on Collaborative Open-Source Software Development: Evidence from GitHub Copilot

Fangchen Song, Ashish Agarwal, Wen Wen

发表机构 * University of Texas at Austin（德克萨斯大学奥斯汀分校）

AI总结本研究探讨了生成式人工智能（AI）对协作式开源软件（OSS）开发的影响，重点分析了GitHub Copilot这一AI编程助手在GitHub开源项目中的实际作用。研究发现，使用Copilot可使项目层面的代码贡献量提升5.9%，主要源于开发者参与度和个体生产力的提高，但同时也带来了8%的协调时间增加。研究还指出，AI对核心开发者和外围开发者的影响存在差异，为理解AI在开源社区中的长期影响提供了重要参考。

2404.13649 2026-05-15 stat.ML cs.LG stat.ME

Distributional Principal Autoencoders

Xinwei Shen, Nicolai Meinshausen

发表机构 * Department of Statistics, University of Washington（华盛顿大学统计系）

AI总结本文提出了一种名为分布主成分自编码器（DPA）的降维方法，旨在在重建数据时保留原始数据的分布特性。该方法通过学习数据在低维潜在变量条件下的条件分布，使得重建数据与原始数据在分布上一致。实验表明，DPA在气候数据、单细胞数据和图像数据上均能有效保留数据的原始分布和重要结构特征。

2303.14511 2026-05-15 hep-ex cs.AI cs.LG hep-ph physics.data-an

Improving robustness of jet tagging algorithms with adversarial training: exploring the loss surface

Annika Stein

发表机构 * Center for Theoretical Physics, Sloane Physics Laboratory, Yale University（理论物理中心，斯洛恩物理实验室，耶鲁大学）； III. Physics Institute A, RWTH Aachen University（物理研究所A，亚琛工业大学）

AI总结本文研究了如何通过对抗训练提高高能物理中喷注分类算法的鲁棒性，重点分析了输入特征微小扰动对模型性能的影响。作者通过探索损失函数的几何结构，揭示了模型在面对系统性不确定性时的稳健性机制，并提出了一种在保持高性能的同时增强模型鲁棒性的对抗训练方法。

Comments 5 pages, 2 figures; submitted to ACAT 2022 proceedings

Journal ref 2026 J. Phys.: Conf. Ser. 3206 012085

2211.16113 2026-05-15 cs.NE cs.LG

Timing-Based Backpropagation in Spiking Neural Networks Without Single-Spike Restrictions

Kakei Yamamoto, Yusuke Sakemi, Kazuyuki Aihara

发表机构 * University of Tokyo（东京大学）； Research Center for Mathematical Engineering（数学工程研究中心）； University of Tokyo Institutes for Advanced Study（东京大学先进研究机构）

AI总结本文提出了一种无需单次放电限制的新型反向传播算法，用于训练脉冲神经网络（SNNs），该算法通过单个神经元的多个脉冲时间相对关系来编码信息。与传统方法不同，该方法允许每个神经元多次放电，从而提升了网络的计算能力，并在多个任务中达到了与非卷积人工神经网络相当的准确率。研究还发现，网络的脉冲数量特性依赖于突触后电流和膜电位的时间常数，并存在一个最优时间常数以实现最高测试准确率，这一现象在传统基于单次放电的时间编码方法中未被观察到。

Comments 10 pages, 5 figures

Journal ref 2024 International Joint Conference on Neural Networks (IJCNN), Yokohama, Japan, 2024, pp. 1-9

2202.05568 2026-05-15 stat.ML cs.IT cs.LG math.IT math.PR math.ST stat.TH

Change of measure through the Legendre transform

Antoine Picard-Weibel, Benjamin Guedj

发表机构 * Suez, CIRSEE, France（苏伊士，CIRSEE，法国）； University College London, United Kingdom（伦敦大学学院，英国）； Inria, France（法国国家信息与自动化技术研究所）

AI总结本文研究了通过Legendre变换实现测度变化的方法，用于推导PAC-Bayes泛化界。作者结合Legendre变换与Fenchel-Young不等式，基于$f$-散度构建了测度变化不等式，拓展了传统Donsker-Varadhan定理的条件。该方法为学习理论提供了更灵活的分析工具，能够在更广泛的假设条件下建立PAC-Bayes保证。

Comments 27 pages

2605.14188 2026-05-15 quant-ph cs.CL cs.DL physics.atom-ph

QOuLiPo: What a quantum computer sees when it reads a book

Christophe Jurczak

发表机构 * Quantonation

AI总结本文研究了量子计算机如何“阅读”书籍，通过将八部文艺复兴时期的经典著作输入中性原子量子处理器，将文本结构转化为图结构，从而探索量子硬件对文本的处理方式。研究引入了“刚性 rho”指标，用于衡量书籍结构的独特性，并反向设计文本结构以匹配量子硬件的图结构，生成名为 QOuLiPo 的新文本集合，为量子处理器的性能评估提供基准。该工作为数字人文领域提供了与量子计算结合的新方法，并展示了量子处理器在处理复杂文本结构上的潜力。

详情

英文摘要

What does a book look like to a quantum computer? This paper takes eight classical works of the Renaissance and its late-antique inheritance -- from Augustine to Galileo -- and runs each through a neutral-atom quantum processor. The bridge is graphs: each textual unit becomes an atom, and graph edges are physical blockade constraints for engineered exact unit-disk designs, or a 2D approximation to the semantic graph for natural texts. Three contributions follow. First, we introduce rigidity rho, a metric for how unique a book's structural backbone is -- distinguishing Marguerite de Navarre's Heptameron (rigid, twelve-nouvelle hard core) from Boethius (fully fungible, every chapter substitutable). Second, we invert the pipeline: rather than extracting a graph from existing prose, we pick a target graph the hardware encodes natively, and write a book whose structure matches it. The twenty-nine texts written this way, collected under the name QOuLiPo, extend the OuLiPo tradition to graph-topological constraints and, together with the eight natural texts, form a benchmark distribution against which neutral-atom hardware can be tracked as it scales. Third, we run both natural and engineered texts on Pasqal's FRESNEL processor up to one hundred atoms; engineered texts reach high approximation ratios, the cleanest instances returning the exact backbone. A cloud-accessible quantum machine plus an agentic coding environment now lets a single investigator run this pipeline end-to-end. What is reported is an application layer, not a speedup -- humanistic instances ready to load onto neutral-atom processors as they scale, already complementing classical text analysis. The Digital Humanities community has a stake in building familiarity with this hardware now: the engineered-corpus design choices made today fix the benchmark distribution future hardware will be measured against.

URL PDF HTML ☆

赞 0 踩 0

2605.14177 2026-05-15 cs.IR cs.AI cs.CL

Thinking Ahead: Prospection-Guided Retrieval of Memory with Language Models

Harshita Chopra, Krishna Kant Chintalapudi, Suman Nath, Ryen W. White, Chirag Shah

发表机构 * University of Washington（华盛顿大学）； Microsoft Research（微软研究院）

AI总结本文研究了如何通过前瞻思维引导语言模型从长期对话历史中检索用户特定的事实，以提升个性化对话系统的性能。为了解决传统检索方法依赖语义相似度而难以发现远距离相关事实的问题，作者提出了基于前瞻引导的检索方法（PGR），通过构建可能的未来步骤作为检索探针，从而更有效地挖掘用户历史中相关但不易被传统方法发现的记忆。实验表明，该方法在多个基准测试中显著提升了检索效果和响应质量。

Comments Preprint

详情

英文摘要

Long-horizon personalization requires dialogue assistants to retrieve user-specific facts from extended interaction histories. In practice, many relevant facts often have low semanticsimilarity to the query under dense retrieval. Standard Retrieval-Augmented Generation (RAG) and GraphRAG systems are still largely retrospective: they rely on embedding similarity to the query or on fixed graph traversals, so they often miss facts that matter for the user's needs but lie far from the query in embedding space. Inspired by prospection, the human ability to use imagined futures as cues for recall, we introduce Prospection-Guided Retrieval (PGR), which decouples retrieval from how memories are stored. Given a user query, PGR first expands the goal into a short Tree-of-Thought (ToT) or linear chain of plausible next steps, and uses these steps as retrieval probes rather than relying on the original query alone. The facts retrieved by these probes are then used to personalize the next round of prospection, enabling PGR to uncover additional memories that become relevant only after the simulation is grounded in the user's history. We also introduce MemoryQuest, a challenging multi-session benchmark in which each query is annotated with 3--5 dated reference facts subject to a low query-reference similarity constraint. Across 1,625 queries spanning 185 user profiles from 3 publicly available datasets, PGR-TOT substantially improves retrieval, including nearly 3x recall on MemoryQuest over the strongest baseline. In pairwise LLM-as-judge comparisons against baselines, PGR-generated responses are preferred on 89--98% of queries, with blinded human annotations on held-out subsets showing the same trend. Overall, the results demonstrate that explicit prospection yields large gains in long-horizon retrieval and response quality relative to similarity-only baselines.

URL PDF HTML ☆

赞 0 踩 0

2605.14153 2026-05-15 cs.CR cs.AI

ExploitBench: A Capability Ladder Benchmark for LLM Cybersecurity Agents

Seunghyun Lee, David Brumley

发表机构 * Carnegie Mellon University（卡内基梅隆大学）； Bugcrowd

AI总结本文提出ExploitBench，一个用于评估大语言模型（LLM）在网络安全领域能力的分级基准，将漏洞利用过程分解为16个可衡量的阶段，从代码崩溃到完全控制目标系统。该基准通过确定性验证机制，准确评估模型在不同阶段的表现。实验基于41个V8漏洞进行，结果显示当前公开部署的前沿模型在触发漏洞和崩溃方面表现良好，但在实现任意代码执行等高级能力上仍有明显不足，而私有模型则表现出更强的利用能力。

2605.14142 2026-05-15 stat.ML cs.LG stat.CO

To discretize continually: Mean shift interacting particle systems for Bayesian inference

Ayoub Belhadji, Daniel Sharp, Youssef M. Marzouk

发表机构 * Center for Computational Science and Engineering（计算科学中心）； Laboratory for Information and Decision Systems（信息与决策系统实验室）； Massachusetts Institute of Technology（麻省理工学院）

AI总结本文提出了一种基于最大均值差异（MMD）最小化的交互粒子系统，用于在已知非归一化密度的情况下近似概率分布的积分。该方法扩展了经典均值漂移算法和经验分布最优量化算法，适用于连续分布，并且不受未知归一化常数的影响，支持无梯度和有梯度的实现方式。实验表明，该方法在多模态混合、贝叶斯分层模型、受PDE约束的反问题等多种采样任务中表现出良好的收敛性、多模态捕捉能力和高维扩展性。

2605.14123 2026-05-15 eess.IV cs.CV

Keyed Nonlinear Transform: Lightweight Privacy-Enhancing Feature Sharing for Medical Image Analysis

Haebom Lee, Gyeongjung Kim

发表机构 * OOLU Soft Co., Ltd.（OOLU软件有限公司）

AI总结本文提出了一种名为Keyed Nonlinear Transform（KNT）的轻量级特征转换方法，用于在医疗图像分析中增强隐私保护，解决特征共享过程中患者身份信息泄露的问题。该方法通过密钥条件的非线性变换对中间特征进行混淆，有效降低了特征的可重新识别性，同时保持了模型的分类性能和计算效率。实验表明，KNT在不重新训练模型的前提下，显著提升了隐私保护水平，并适用于多种医学图像任务。

AI 大模型

视觉与机器人

科学与医疗

Optimizing PyTorch Inference with LLM-Based Multi-Agent Systems

From Ranking to Reasoning: Explainable Web API Recommendation via Semantic Reasoning

A New Framework for Convex Clustering in Kernel Spaces: Finite Sample Bounds, Consistency and Performance Insights

Generative Bayesian Optimization: Generative Models as Acquisition Functions

A Tutorial on Cognitive Biases in Agentic AI-Driven 6G Autonomous Networks

Manifold Dimension Estimation via Local Graph Structure

On the Identifiability of Causal Graphs with the Invariance Principle

Potential and challenges of generative adversarial networks for super-resolution in 4D Flow MRI

Stochastic dynamics learning with state-space systems

Measuring the stability and plasticity of recommender systems

Shared representations in brains and models reveal a two-route cortical organization during scene perception

RAM-W600: A Multi-Task Wrist Dataset and Benchmark for Rheumatoid Arthritis

Scalable Subset Selection in Linear Mixed Models

Experimental robustness benchmarking of quantum neural networks on a superconducting quantum processor

Scalable Krylov Subspace Methods for Generalized Mixed-Effects Models with Crossed Random Effects

Autofocus Retrieval: An Effective Pipeline for Multi-Hop Question Answering With Semi-Structured Knowledge

Progent: Securing AI Agents with Privilege Control

Pro-DG: Procedural Diffusion Guidance for Architectural Facade Generation

A Unified Framework for Entropy Search and Expected Improvement in Bayesian Optimization

Manikin-Recorded Cardiopulmonary Sounds Dataset Using Digital Stethoscope

The Impact of Generative AI on Collaborative Open-Source Software Development: Evidence from GitHub Copilot

Distributional Principal Autoencoders

Improving robustness of jet tagging algorithms with adversarial training: exploring the loss surface

Timing-Based Backpropagation in Spiking Neural Networks Without Single-Spike Restrictions

Change of measure through the Legendre transform

QOuLiPo: What a quantum computer sees when it reads a book

Thinking Ahead: Prospection-Guided Retrieval of Memory with Language Models

ExploitBench: A Capability Ladder Benchmark for LLM Cybersecurity Agents

To discretize continually: Mean shift interacting particle systems for Bayesian inference

Keyed Nonlinear Transform: Lightweight Privacy-Enhancing Feature Sharing for Medical Image Analysis