arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 3405
2512.18508 2026-05-26 stat.ME cs.AI cs.SY eess.SP eess.SY

Selection-Induced Contraction of Innovation Statistics in Gated Kalman Filters

门控卡尔曼滤波中创新统计量的选择诱导收缩

Barak Or

发表机构 * metaor artificial intelligence(metaor人工智能) Google Reichman Tech School(谷歌Reichman技术学校) Reichman University(Reichman大学)

AI总结 本文证明在门控卡尔曼滤波中,经过门控后的创新统计量收敛于门控条件而非名义量,并推导了椭球门控下创新的一阶和二阶矩的精确表达式,揭示了门控引起的确定性协方差收缩,并扩展至最近邻关联分析。

Comments 9 pages, preprint

详情
AI中文摘要

验证门控是经典卡尔曼跟踪系统的基本组成部分。只有归一化创新平方(NIS)低于规定阈值的测量值才被考虑用于状态更新。虽然这个过程在统计上基于卡方分布,但它隐含地将无条件创新过程替换为条件观测过程,仅限于验证事件。本文表明,门控后计算的创新统计量收敛于门控条件量而非名义量。在线性高斯假设下,我们推导了椭球门控条件下创新的一阶和二阶矩的精确表达式,并证明门控会引起创新协方差的确定性、维度依赖的收缩。分析扩展至最近邻(NN)关联,后者被证明是一个额外的统计选择算子。我们证明,在多个门内测量中选择最小范数创新会引入不可避免的能量收缩,这意味着在非平凡门控和关联下,名义创新统计量无法保持。二维情况下的闭式结果量化了组合效应并说明了其实际意义。

英文摘要

Validation gating is a fundamental component of classical Kalman-based tracking systems. Only measurements whose normalized innovation squared (NIS) falls below a prescribed threshold are considered for state update. While this procedure is statistically motivated by the chi-square distribution, it implicitly replaces the unconditional innovation process with a conditionally observed one, restricted to the validation event. This paper shows that innovation statistics computed after gating converge to gate-conditioned rather than nominal quantities. Under classical linear--Gaussian assumptions, we derive exact expressions for the first- and second-order moments of the innovation conditioned on ellipsoidal gating, and show that gating induces a deterministic, dimension-dependent contraction of the innovation covariance. The analysis is extended to NN association, which is shown to act as an additional statistical selection operator. We prove that selecting the minimum-norm innovation among multiple in-gate measurements introduces an unavoidable energy contraction, implying that nominal innovation statistics cannot be preserved under nontrivial gating and association. Closed-form results in the two-dimensional case quantify the combined effects and illustrate their practical significance.

2512.10961 2026-05-26 cs.HC cs.AI

AI as Equalizer or Amplifier? Task Complexity as the Moderating Factor for Human Expertise in Hybrid Intelligence Systems

AI是均衡器还是放大器?任务复杂性作为混合智能系统中人类专业知识的调节因素

Tao An

发表机构 * Hawaii Pacific University(夏威夷太平洋大学)

AI总结 本文提出AI在常规任务中均衡表现,在复杂任务中放大专家与新手差距,并构建了人类贡献层次与参与层次的框架,强调领域知识而非提示工程决定放大效果。

Comments 9 pages, 3 figures, 1 table. v2 matches the camera-ready version accepted at HHAI 2026. Removed v1 aggregated projections (training timeline figure, n=580). Empirical basis is structured field observations of 10 to 20 colleagues at a single organization (Beijing Feimu) since mid-2024. Conceptual framework unchanged. To appear in Frontiers in Artificial Intelligence and Applications (IOS Press)

详情
AI中文摘要

越来越多的实证研究表明,生成式AI缩小了新手与专家在常规任务上的表现差距——即所谓的“均衡器”效应。本文挑战了这一结论的普遍性。基于认知增强理论、专家-新手研究以及对一个小型软件产品团队内部生成式AI使用的结构化观察,我们认为AI主要作为认知放大器:其输出质量根本上取决于指导它的人类专业知识。我们提出了一个包含人类贡献的三个层次(问题定义、质量评估、迭代优化)和三个参与级别(被动接受、迭代协作、认知指导)的框架,证明领域知识——而非提示工程技能——决定了放大效果。我们通过提出AI在结构良好的常规任务上均衡表现,而在需要深度判断的复杂任务上放大已有差异,来调和均衡器与放大器的观点。这种调和直接影响了人机混合系统的设计:我们应构建奖励和发展专业知识的AI,而非取代专业知识的AI。我们为HHAI社区提出了一个研究议程,聚焦于专业知识敏感的AI设计、自适应协作界面以及AI增强工作中人类能力发展的纵向研究。

英文摘要

A growing body of empirical research suggests that generative AI narrows performance gaps between novice and expert workers on routine tasks--the so-called "equalizer" effect. This paper challenges the generality of that conclusion. Drawing on cognitive augmentation theory, expert-novice research, and structured observations of in-house generative-AI use across a small software product team, we argue that AI functions primarily as a cognitive amplifier: a system whose output quality depends fundamentally on the expertise of the human who directs it. We present a framework comprising three layers of human contribution (problem definition, quality evaluation, iterative refinement) and three levels of engagement (passive acceptance, iterative collaboration, cognitive direction), demonstrating that domain expertise--not prompt engineering skill--determines amplification effectiveness. We reconcile the equalizer and amplifier perspectives by proposing that AI equalizes performance on well-structured, routine tasks while amplifying pre-existing differences on complex tasks requiring deep judgment. This reconciliation carries direct implications for hybrid human-AI system design: rather than building AI that replaces expertise, we should build AI that rewards and develops it. We outline a research agenda for the HHAI community centered on expertise-sensitive AI design, adaptive collaboration interfaces, and longitudinal studies of human capability development in AI-augmented work.

2512.05791 2026-05-26 physics.med-ph cs.CV cs.LG math.PR

Fast and Robust Diffusion Posterior Sampling for MR Image Reconstruction Using the Preconditioned Unadjusted Langevin Algorithm

使用预条件未调整朗之万算法实现快速且鲁棒的MR图像重建扩散后验采样

Moritz Blumenthal, Tina Holliber, Jonathan I. Tamir, Martin Uecker

发表机构 * Institute of Biomedical Imaging, Graz University of Technology, Graz, Austria Department of Radiology, Boston Children's Hospital, Harvard Medical School, Boston, USA Chandra Family Department of Electrical Engineering, University of Texas at Austin, USA Department of Diagnostic Medicine, Dell Medical School, University of Texas at Austin, USA

AI总结 针对MR图像重建中扩散后验采样速度慢和参数调优问题,提出基于预条件未调整朗之万算法的精确似然方法,实现快速收敛且无需调参的鲁棒采样。

Comments Submitted to Magnetic Resonance in Medicine

详情
AI中文摘要

目的:结合未调整朗之万算法(ULA)与扩散模型,可以从高度欠采样的k空间数据生成高质量MRI重建结果并附带不确定性估计。然而,扩散后验采样(DPS)或似然退火等采样方法存在重建时间长和需要参数调优的问题。本文旨在开发一种具有快速收敛性的鲁棒采样算法。 理论与方法:在用于后验采样的反向扩散过程中,精确似然与所有噪声尺度下的扩散先验相乘。为克服收敛缓慢的问题,采用了预条件技术。该方法在fastMRI数据上训练,并在健康志愿者的回顾性欠采样脑部数据上测试。 结果:对于笛卡尔和非笛卡尔加速MRI的后验采样,新方法在重建速度和样本质量上均优于退火采样和DPS。 结论:所提出的预条件精确似然方法能够在各种MRI重建任务中实现快速可靠的后验采样,无需参数调优。

英文摘要

Purpose: The Unadjusted Langevin Algorithm (ULA) in combination with diffusion models can generate high quality MRI reconstructions with uncertainty estimation from highly undersampled k-space data. However, sampling methods such as diffusion posterior sampling (DPS) or likelihood annealing suffer from long reconstruction times and the need for parameter tuning. The purpose of this work is to develop a robust sampling algorithm with fast convergence. Theory and Methods: In the reverse diffusion process used for sampling the posterior, the exact likelihood is multiplied with the diffused prior at all noise scales. To overcome the issue of slow convergence, preconditioning is used. The method is trained on fastMRI data and tested on retrospectively undersampled brain data of a healthy volunteer. Results: For posterior sampling in Cartesian and non-Cartesian accelerated MRI the new approach outperforms annealed sampling and DPS in terms of reconstruction speed and sample quality. Conclusion: The proposed exact likelihood with preconditioning enables rapid and reliable posterior sampling across various MRI reconstruction tasks without the need for parameter tuning.

2511.18794 2026-05-26 cs.GR cs.CV

ChronoGS: Disentangling Invariants and Changes in Multi-Period Scenes

ChronoGS:多时期场景中不变性与变化的解耦

Zhongtao Wang, Jiaqi Dai, Qingtian Zhu, Yilong Li, Mai Su, Fei Zhu, Meng Gai, Shaorong Wang, Chengwei Pan, Yisong Chen, Guoping Wang

发表机构 * Peking University(北京大学) Beijing Forestry University(北京林业大学) The University of Tokyo(东京大学) Beihang University(北航)

AI总结 提出ChronoGS,一种时间调制的高斯表示方法,通过统一锚点支架重建多时期场景,并解耦稳定与演化组件,实现时间一致的重建,同时发布ChronoScene基准数据集。

Comments CVPR26 Highlight

详情
AI中文摘要

多时期图像集合在现实应用中很常见。城市为测绘而重新扫描,建筑工地为进度跟踪而再次访问,自然区域为环境变化而监测。这些数据形成多时期场景,其中几何和外观会演变。重建此类场景是一个重要但尚未充分探索的问题。现有管线依赖于不兼容的假设:静态和野外方法强制单一几何,而动态方法假设平滑运动,两者在长期、不连续变化下均失败。为解决此问题,我们引入ChronoGS,一种时间调制的高斯表示,它在统一锚点支架内重建所有时期。它还被设计为解耦稳定和演化组件,实现多时期场景的时间一致重建。为促进相关研究,我们发布ChronoScene数据集,一个真实和合成多时期场景的基准,捕捉几何和外观变化。实验表明,ChronoGS在重建质量和时间一致性上始终优于基线。我们的代码和ChronoScene数据集公开于https://github.com/ZhongtaoWang/ChronoGS。

英文摘要

Multi-period image collections are common in real-world applications. Cities are re-scanned for mapping, construction sites are revisited for progress tracking, and natural regions are monitored for environmental change. Such data form multi-period scenes, where geometry and appearance evolve. Reconstructing such scenes is an important yet underexplored problem. Existing pipelines rely on incompatible assumptions: static and in-the-wild methods enforce a single geometry, while dynamic ones assume smooth motion, both failing under long-term, discontinuous changes. To solve this problem, we introduce ChronoGS, a temporally modulated Gaussian representation that reconstructs all periods within a unified anchor scaffold. It's also designed to disentangle stable and evolving components, achieving temporally consistent reconstruction of multi-period scenes. To catalyze relevant research, we release ChronoScene dataset, a benchmark of real and synthetic multi-period scenes, capturing geometric and appearance variation. Experiments demonstrate that ChronoGS consistently outperforms baselines in reconstruction quality and temporal consistency. Our code and the ChronoScene dataset are publicly available at https://github.com/ZhongtaoWang/ChronoGS.

2511.03963 2026-05-26 stat.ML cs.LG

Robust inference using density-powered Stein operators

使用密度驱动的Stein算子进行稳健推断

Shinto Eguchi

发表机构 * The Institute of Statistical Mathematics(统计数学研究所)

AI总结 提出基于γ-散度的γ-Stein算子,通过密度加权实现未归一化概率模型的稳健推断,并应用于稳健拟合优度检验和贝叶斯后验近似。

Comments Revised version

详情
AI中文摘要

我们引入了Stein算子的密度幂加权变体,称为γ-Stein算子。这是一类从γ-散度导出的新型算子,旨在为未归一化概率模型构建稳健的推断方法。该算子的构造(通过模型密度的正幂γ进行加权)固有地降低了异常值的影响,提供了一种稳健性的原则性机制。应用该算子产生了得分匹配的稳健推广,保留了不依赖于模型归一化常数的关键性质。我们将此框架扩展到两个关键应用:用于稳健拟合优度检验的γ-核化Stein散度,以及用于稳健贝叶斯后验近似的γ-Stein变分梯度下降。在受污染的高斯和四次势模型上的实验结果表明,我们的方法在稳健性和统计效率上显著优于标准基线。

英文摘要

We introduce a density-power weighted variant for the Stein operator, called the $γ$-Stein operator. This is a novel class of operators derived from the $γ$-divergence, designed to build robust inference methods for unnormalized probability models. The operator's construction (weighting by the model density raised to a positive power $γ$ inherently down-weights the influence of outliers, providing a principled mechanism for robustness. Applying this operator yields a robust generalization of score matching that retains the crucial property of being independent of the model's normalizing constant. We extend this framework to develop two key applications: the $γ$-kernelized Stein discrepancy for robust goodness-of-fit testing, and $γ$-Stein variational gradient descent for robust Bayesian posterior approximation. Empirical results on contaminated Gaussian and quartic potential models show our methods significantly outperform standard baselines in both robustness and statistical efficiency.

2510.20954 2026-05-26 stat.ML cs.LG eess.SP

A Spectral Framework for Graph Neural Operators: Convergence Guarantees and Tradeoffs

图神经算子的谱框架:收敛保证与权衡

Roxanne Holden, Luana Ruiz

发表机构 * Applied Mathematics and Statistics(应用数学与统计学) Johns Hopkins University(约翰霍普金斯大学)

AI总结 本文提出统一谱框架,分析图神经算子在无正则性、全局Lipschitz连续和分段Lipschitz连续假设下的收敛率与权衡。

详情
AI中文摘要

图极限(Graphons)作为图序列的极限,为分析图神经算子的渐近行为提供了算子理论框架。采样图到图极限的谱收敛诱导了相应神经算子的收敛,从而实现了图神经网络(GNN)的可迁移性分析。本文开发了一个统一的谱框架,将不同假设下(包括无正则性、全局Lipschitz连续和分段Lipschitz连续)的收敛结果整合在一起。该框架将这些结果置于公共算子环境中,便于直接比较其假设、收敛率和权衡。我们进一步在合成图和真实世界图上展示了这些率的经验紧致性。

英文摘要

Graphons, as limits of graph sequences, provide an operator-theoretic framework for analyzing the asymptotic behavior of graph neural operators. Spectral convergence of sampled graphs to graphons induces convergence of the corresponding neural operators, enabling transferability analyses of graph neural networks (GNNs). This paper develops a unified spectral framework that brings together convergence results under different assumptions on the underlying graphon, including no regularity, global Lipschitz continuity, and piecewise-Lipschitz continuity. The framework places these results in a common operator setting, enabling direct comparison of their assumptions, convergence rates, and tradeoffs. We further illustrate the empirical tightness of these rates on synthetic and real-world graphs.

2510.08609 2026-05-26 cs.SE cs.CR cs.LG cs.PL

Which Is Better For Reducing Outdated and Vulnerable Dependencies: Pinning or Floating?

哪种方法更能减少过时和易受攻击的依赖:固定版本还是浮动版本?

Imranur Rahman, Jill Marley, William Enck, Laurie Williams

发表机构 * North Carolina State University(北卡罗来纳州立大学)

AI总结 本研究通过实证分析npm、PyPI和Cargo生态系统中依赖版本约束的使用趋势,利用生存分析比较固定版本与浮动版本对依赖过时和易受攻击风险的影响。

Comments Accepted to ASE 2025

详情
Journal ref
2025 40th IEEE/ACM International Conference on Automated Software Engineering (ASE)
AI中文摘要

开发者通常使用版本约束来指定其项目依赖的可接受版本。固定依赖可以减少破坏性变更的可能性,但需要手动管理过时和易受攻击依赖的替换。另一方面,浮动依赖可以自动获取错误修复和安全修复,但存在破坏性变更的风险。安全从业者主张固定依赖以防止软件供应链攻击,例如恶意包更新。然而,由于固定是最严格的版本约束,它最可能导致依赖过时。尽管如此,不同版本约束类型下依赖变得过时或易受攻击的可能性如何变化尚不清楚。本研究旨在通过大规模实证评估不同版本约束类型下依赖变得过时或易受攻击的可能性,帮助开发者做出明智的依赖版本约束选择。在本研究中,我们首先识别了npm、PyPI和Cargo生态系统中依赖版本约束使用的趋势以及开发者对版本约束类型更改的模式。然后,我们使用生存分析对依赖状态转换进行建模,并估计使用固定版本与其他版本约束类型相比,依赖变得过时或易受攻击的可能性如何变化。我们观察到,在过时和易受攻击的依赖中,最常用的版本约束类型是浮动-次要,固定版本次之。我们还发现,浮动-主要导致过时的可能性最小,而浮动-次要导致易受攻击的可能性最小。

英文摘要

Developers consistently use version constraints to specify acceptable versions of the dependencies for their project. Pinning dependencies can reduce the likelihood of breaking changes, but comes with a cost of manually managing the replacement of outdated and vulnerable dependencies. On the other hand, floating can be used to automatically get bug fixes and security fixes, but comes with the risk of breaking changes. Security practitioners advocate pinning dependencies to prevent against software supply chain attacks, e.g., malicious package updates. However, since pinning is the tightest version constraint, pinning is the most likely to result in outdated dependencies. Nevertheless, how the likelihood of becoming outdated or vulnerable dependencies changes across version constraint types is unknown. The goal of this study is to aid developers in making an informed dependency version constraint choice by empirically evaluating the likelihood of dependencies becoming outdated or vulnerable across version constraint types at scale. In this study, we first identify the trends in dependency version constraint usage and the patterns of version constraint type changes made by developers in the npm, PyPI, and Cargo ecosystems. We then modeled the dependency state transitions using survival analysis and estimated how the likelihood of becoming outdated or vulnerable changes when using pinning as opposed to the rest of the version constraint types. We observe that among outdated and vulnerable dependencies, the most commonly used version constraint type is floating-minor, with pinning being the next most common. We also find that floating-major is the least likely to result in outdated and floating-minor is the least likely to result in vulnerable dependencies.

2510.07343 2026-05-26 cs.GR cs.AI eess.IV

Local MAP Sampling for Diffusion Models

扩散模型的局部MAP采样

Shaorong Zhang, Rob Brekelmans, Greg Ver Steeg

发表机构 * University of California, Riverside, CA, US(加州大学河滨分校)

AI总结 提出局部MAP采样(LMAPS)框架,通过沿扩散轨迹迭代求解局部MAP子问题,统一了优化方法与概率采样,在图像恢复和科学任务中达到最优性能。

详情
AI中文摘要

扩散后验采样(DPS)通过从$p(x_0 \mid y)$采样,为逆问题提供了一种基于贝叶斯原理的方法。虽然后验采样对于捕捉不确定性和多模态性很有价值,但许多经典和实际的逆问题设置最终优先考虑精确的点估计——最显著的是MAP估计器,它长期以来一直是成像和科学应用中的标准重建目标。我们引入了局部MAP采样(LMAPS),这是一种新的推理框架,沿扩散轨迹迭代求解局部MAP子问题。这一视角阐明了它们与全局MAP和DPS的联系,为基于优化的方法提供了统一的概率解释。在此基础之上,我们开发了实用算法,其中协方差近似基于高斯先验假设,并重新制定了目标函数以提高稳定性和可解释性。在广泛的图像恢复和科学任务中,LMAPS实现了最先进的性能。

英文摘要

Diffusion Posterior Sampling (DPS) provides a principled Bayesian approach to inverse problems by sampling from $p(x_0 \mid y)$. While posterior sampling is valuable for capturing uncertainty and multi-modality, many classical and practical inverse problem settings ultimately prioritize accurate point estimation -- most notably the MAP estimator, which has long served as a standard reconstruction objective in imaging and scientific applications. We introduce Local MAP Sampling (LMAPS), a new inference framework that iteratively solves local MAP subproblems along the diffusion trajectory. This perspective clarifies their connection to global MAP and DPS, offering a unified probabilistic interpretation for optimization-based methods. Building on this foundation, we develop practical algorithms with a covariance approximation motivated by a Gaussian prior assumption, and a reformulated objective for stability and interpretability. Across a broad set of image restoration and scientific tasks, LMAPS achieves state-of-the-art performance.

2509.25507 2026-05-26 stat.ML cs.LG math.ST stat.ME stat.TH

One-shot Conditional Sampling: MMD meets Nearest Neighbors

一次性条件采样:MMD 遇见最近邻

Anirban Chatterjee, Sayantan Choudhury, Rohan Hore

发表机构 * University of Chicago(芝加哥大学) MBZUAI(马斯克商学院) Carnegie Mellon University(卡内基梅隆大学)

AI总结 提出 CGMMD 框架,通过最小化最大均值差异(MMD)实现一次性条件采样,理论保证收敛性,并在图像去噪和超分辨率等任务中表现优异。

Comments Accepted at the 43rd International Conference on Machine Learning (ICML 2026)

详情
AI中文摘要

我们如何从从未完全观察到的条件分布中生成样本?这个问题在现代机器学习和经典统计学的广泛应用中都会出现,包括计算机视觉中的图像后处理、基于模拟的推理中的近似后验采样以及复杂数据设置中的条件分布建模。在这种情况下,与无条件采样相比,可以利用额外的特征信息来实现更自适应和高效的采样。基于此,我们引入了使用 MMD 的条件生成器(CGMMD),一种用于条件采样的新颖框架。与许多当代方法不同,我们的方法将训练目标设定为一个简单的、无对抗的直接最小化问题。CGMMD 的一个关键特性是它能够在生成器的单次前向传播中产生条件样本,从而实现实际的一次性采样,测试时复杂度低。我们建立了从 CGMMD 采样器采样时产生的损失的严格理论界限,并证明了估计分布向真实条件分布的收敛性。在此过程中,我们还开发了基于最近邻的泛函的一致集中结果,这可能具有独立的研究价值。最后,我们展示了 CGMMD 在涉及复杂条件密度的合成任务以及实际应用(如图像去噪和图像超分辨率)中具有竞争力的表现。

英文摘要

How can we generate samples from a conditional distribution that we never fully observe? This question arises across a broad range of applications in both modern machine learning and classical statistics, including image post-processing in computer vision, approximate posterior sampling in simulation-based inference, and conditional distribution modeling in complex data settings. In such settings, compared with unconditional sampling, additional feature information can be leveraged to enable more adaptive and efficient sampling. Building on this, we introduce Conditional Generator using MMD (CGMMD), a novel framework for conditional sampling. Unlike many contemporary approaches, our method frames the training objective as a simple, adversary-free direct minimization problem. A key feature of CGMMD is its ability to produce conditional samples in a single forward pass of the generator, enabling practical one-shot sampling with low test-time complexity. We establish rigorous theoretical bounds on the loss incurred when sampling from the CGMMD sampler, and prove convergence of the estimated distribution to the true conditional distribution. In the process, we also develop a uniform concentration result for nearest-neighbor based functionals, which may be of independent interest. Finally, we show that CGMMD performs competitively on synthetic tasks involving complex conditional densities, as well as on practical applications such as image denoising and image super-resolution.

2508.11307 2026-05-26 physics.ao-ph cs.LG physics.data-an

Approximating the universal thermal climate index using sparse regression with orthogonal polynomials

使用正交多项式稀疏回归逼近通用热气候指数

Sabin Roman, Ljupco Todorovski, Saso Dzeroski, Gregor Skok

发表机构 * Department of Knowledge Technologies, Jo z ef Stefan Institute, Jamova 39, 1000 Ljubljana, Slovenia Faculty of Mathematics Physics, University of Ljubljana, Jadranska ulica 19, 1000 Ljubljana, Slovenia

AI总结 针对通用热气候指数(UTCI)标准多项式近似误差大的问题,提出基于正交多项式基的稀疏回归方法,在保持计算效率的同时显著降低平均误差和大误差频率。

Comments Final peer-reviewed version of the manuscript

详情
Journal ref
Geoscientific Model Development 19, 4319-4330 (2026)
AI中文摘要

通用热气候指数(UTCI)是一种衡量热舒适度的指标,用于量化人类对环境条件的感受。由于其作为生物气候指标的稳健性和多功能性,已被广泛应用于生物气候学的众多研究中,并越来越多地作为户外热舒适度的操作度量。从相关环境参数计算UTCI值通常并不直接,因此使用6次多项式近似已成为计算UTCI值的标准方法。尽管计算效率高,但该多项式近似的误差可能很大。本研究的目标是开发一种改进的多项式近似版本——既能保持相当的计算效率,又在数值稳定性方面更稳健,且精度显著提高,特别是在减少较大误差的频率方面。通过使用稀疏正交回归(即基于正交多项式基的稀疏回归)实现了这一目标,这不仅大幅降低了平均误差(即平均误差、平均绝对误差和均方根误差),还显著减少了较大误差的频率。利用勒让德多项式基,可以构建近似模型,有效填充精度与复杂度的帕累托前沿,并在不同模型容量下表现出稳定的层次化系数结构。仅使用20%的数据训练新近似模型,并在剩余80%的数据上进行测试,显示出成功的泛化能力,且结果在自助法下具有稳健性。该分解有效地将UTCI近似为正交基中的傅里叶式展开,在L2(最小二乘)意义上接近理论最优值。

英文摘要

The Universal Thermal Climate Index (UTCI) is a measure of thermal comfort that quantifies how humans experience environmental conditions. Due to its robustness and versatility as a bioclimatic indicator, it has been extensively employed across a wide range of studies in bioclimatology and is increasingly used as an operational measure of outdoor thermal comfort. Calculating the UTCI value from the relevant environmental parameters is nominally not straightforward, which is why using a 6th-degree polynomial approximation has become the standard way to calculate UTCI values. Although it is computationally efficient, the error of this polynomial approximation can be substantial. The goal of this study was to develop an improved version of the polynomial approximation - one that retains comparable computational efficiency but is more robust in terms of numerical stability and substantially more accurate, particularly in reducing the frequency of larger errors. This goal was achieved using sparse orthogonal regression, namely sparse regression with an orthogonal polynomial basis, which not only substantially reduces the average errors (i.e., the mean error, the mean absolute error, and the root mean square error) but also drastically reduces the frequency of large errors. By leveraging Legendre polynomial bases, approximation models could be constructed that efficiently populate a Pareto front of accuracy versus complexity and exhibit stable, hierarchical coefficient structures across varying model capacities. Training the new approximation models over only 20% of the data, with the testing performed over the remaining 80%, highlights successful generalization, with the results being robust under bootstrapping. The decomposition effectively approximates the UTCI as a Fourier-like expansion in an orthogonal basis, yielding results near the theoretical optimum in the L2 (least squares) sense.

2507.06038 2026-05-26 math.NA cs.LG cs.NA

Fredholm Neural Networks for inverse problems in elliptic PDEs

Fredholm神经网络用于椭圆型偏微分方程反问题

Kyriakos C. Georgiou, Constantinos Siettos, Athanasios N. Yannacopoulos

发表机构 * Division of Applied Mathematics, Brown University(布朗大学应用数学系) Department of Statistics and Stochastic Modelling and Applications Laboratory, Athens University of Economics and Business(雅典经济与商业大学统计学与随机建模与应用实验室)

AI总结 基于Fredholm神经网络框架,提出可解释的Potential Fredholm神经网络(PFNN)求解椭圆型偏微分方程正反问题,实现高精度并严格证明误差界。

详情
AI中文摘要

在我们先前关于Fredholm神经网络(Fredholm NN / FNN)求解积分方程的工作基础上,我们将该框架扩展到线性和非线性椭圆型偏微分方程的反问题。所提出的方案包含一个定制设计的深度神经网络(DNN),其中层数、权重、偏置和超参数基于不动点方案以可解释的方式计算,因此我们称之为Potential Fredholm神经网络(PFNN)。我们首先构建PFNN作为求解正问题的方法,表明该方法确保了高精度和可解释性,在区域内部实现小误差,在边界上接近机器精度。然后,我们使用该方法求解椭圆型PDE的反问题,并提供了方案一致性的严格证明以及与PFNN架构直接相关的区域内部和边界的误差界。特别地,我们表明这些误差界依赖于边界函数的近似和积分离散方案,两者都直接对应于Fredholm NN架构的组成部分。通过这种方式,我们构建了一个可解释的方案,该方案为反问题提供精确解,同时由于PFNN的架构而明确尊重边界条件。我们评估了所提出方案在二维和三维线性和半线性椭圆型PDE上的性能。

英文摘要

Building on our previous work on Fredholm Neural Networks (Fredholm NNs/ FNNs) for solving integral equations, we extend the framework to inverse problems for linear and nonlinear elliptic partial differential equations. The proposed scheme consists of a custom-designed deep neural network (DNN) in which the number of layers, weights, biases and hyperparameters are computed in an explainable manner based on a fixed-point scheme, and we therefore refer to this as the Potential Fredholm Neural Network (PFNN). We first build the PFNN as a method for solving the forward problem, showing that this approach ensures both a high accuracy and explainability, achieving small errors in the interior of the domain, and near machine-precision on the boundary. We then use this approach to solve inverse problems for elliptic PDEs, and provide a rigorous proof for the consistency of the scheme and error bounds for both the interior and boundary of the domain, tied directly to the architecture of the PFNN. In particular, we show that these error bounds depend on the approximation of the boundary function and the integral discretization scheme, both of which directly correspond to components of the Fredholm NN architecture. In this way, we construct an explainable scheme that provides accurate solutions to the inverse problems, whilst still explicitly respecting the boundary conditions, due to the architecture of the PFNN. We assess the performance of the proposed scheme for linear and semi-linear elliptic PDEs in two and three dimensions.

2506.01945 2026-05-26 econ.EM cs.LG stat.AP

Stock Market Telepathy: Graph Neural Networks Predicting the Secret Conversations between MINT and G7 Countries

股市读心术:图神经网络预测MINT与G7国家之间的秘密对话

Nurbanu Bursa

发表机构 * Hacettepe University(哈切特佩大学)

AI总结 使用MTGNN图神经网络分析2012-2024年G7与MINT国家股市指数,揭示美国、加拿大、印尼和土耳其的影响力,并证明该方法优于传统预测模型。

详情
Journal ref
Communications in Statistics: Case Studies, Data Analysis and Applications (2026)
AI中文摘要

新兴经济体,特别是MINT国家(墨西哥、印度尼西亚、尼日利亚和土耳其),在全球股市中的影响力日益增强,尽管它们仍易受G7(加拿大、法国、德国、意大利、日本、英国和美国)等发达国家经济状况的影响。金融市场的这种相互关联性和敏感性使得理解这些关系对于投资者和政策制定者准确预测股价走势至关重要。为此,我们研究了2012年至2024年G7和MINT国家的主要股市指数,使用了一种称为多元时间序列图神经网络(MTGNN)的最新图神经网络算法。该方法允许考虑多元时间序列中复杂的时空连接。在实现中,MTGNN揭示出美国和加拿大在预测过程中对股市指数最具影响力的G7国家,而印度尼西亚和土耳其是最具影响力的MINT国家。此外,我们的结果表明,MTGNN在预测MINT和G7国家股市指数价格方面优于传统方法。因此,该研究为经济板块市场提供了宝贵的见解,并提出了一种使用MTGNN分析全球股市动态的令人信服的实证方法。

英文摘要

Emerging economies, particularly the MINT countries (Mexico, Indonesia, Nigeria, and Türkiye), are gaining influence in global stock markets, although they remain susceptible to the economic conditions of developed countries like the G7 (Canada, France, Germany, Italy, Japan, the United Kingdom, and the United States). This interconnectedness and sensitivity of financial markets make understanding these relationships crucial for investors and policymakers to predict stock price movements accurately. To this end, we examined the main stock market indices of G7 and MINT countries from 2012 to 2024, using a recent graph neural network (GNN) algorithm called multivariate time series forecasting with graph neural network (MTGNN). This method allows for considering complex spatio-temporal connections in multivariate time series. In the implementations, MTGNN revealed that the US and Canada are the most influential G7 countries regarding stock indices in the forecasting process, and Indonesia and Türkiye are the most influential MINT countries. Additionally, our results showed that MTGNN outperformed traditional methods in forecasting the prices of stock market indices for MINT and G7 countries. Consequently, the study offers valuable insights into economic blocks' markets and presents a compelling empirical approach to analyzing global stock market dynamics using MTGNN.

2505.07078 2026-05-26 q-fin.TR cs.AI cs.CE

Can LLM-based Financial Investing Strategies Outperform the Market in Long Run?

基于LLM的金融投资策略能否长期跑赢市场?

Weixian Waylon Li, Hyeonjun Kim, Mihai Cucuringu, Tiejun Ma

发表机构 * AIAI, School of Informatics The University of Edinburgh Edinburgh United Kingdom Global Finance Research Center Sungkyunkwan University Seoul Republic of Korea Dept. of Statistics \& OMI University of California, Los Angeles University of Oxford United States The University of Edinburgh Sungkyunkwan University University of California, Los Angeles University of Oxford

AI总结 提出FINSABER回测框架,在更长时间和更大股票池上评估基于LLM的择时策略,发现其优势在长期和广泛截面下显著下降,且在牛熊市中表现不佳。

Comments KDD 2026, Datasets & Benchmarks Track

详情
AI中文摘要

大型语言模型(LLM)最近被用于资产定价任务和股票交易应用,使AI代理能够从非结构化金融数据中生成投资决策。然而,大多数对LLM择时投资策略的评估都是在狭窄的时间范围和有限的股票池中进行的,由于幸存者偏差和数据窥探偏差,其有效性被夸大。我们通过提出FINSABER(一个在更长时间段和更大符号池中评估择时策略的回测框架),批判性地评估其泛化能力和稳健性。跨越二十年和100多个符号的系统回测表明,先前报告的LLM优势在更广泛的截面和更长期的评估下显著恶化。我们的市场制度分析进一步表明,LLM策略在牛市中过于保守,表现不及被动基准,在熊市中过于激进,导致重大损失。这些发现强调了开发能够优先考虑趋势检测和制度感知风险控制,而不仅仅是增加框架复杂性的LLM策略的必要性。

英文摘要

Large Language Models (LLMs) have recently been leveraged for asset pricing tasks and stock trading applications, enabling AI agents to generate investment decisions from unstructured financial data. However, most evaluations of LLM timing-based investing strategies are conducted on narrow timeframes and limited stock universes, overstating effectiveness due to survivorship and data-snooping biases. We critically assess their generalizability and robustness by proposing FINSABER, a backtesting framework evaluating timing-based strategies across longer periods and a larger universe of symbols. Systematic backtests over two decades and 100+ symbols reveal that previously reported LLM advantages deteriorate significantly under broader cross-section and over a longer-term evaluation. Our market regime analysis further demonstrates that LLM strategies are overly conservative in bull markets, underperforming passive benchmarks, and overly aggressive in bear markets, incurring heavy losses. These findings highlight the need to develop LLM strategies that are able to prioritise trend detection and regime-aware risk controls over mere scaling of framework complexity.

2505.05371 2026-05-26 eess.SP cs.LG q-bio.NC

From Sleep Staging to Spindle Detection: A Case Study on End-to-End Automated Sleep Analysis

从睡眠分期到纺锤波检测:端到端自动化睡眠分析的案例研究

Niklas Grieger, Siamak Mehrkanoon, Philipp Ritter, Stephan Bialonski

发表机构 * Department of Medical Engineering and Technomathematics, FH Aachen University of Applied Sciences(医学工程与技术数学系,亚琛应用科学大学) Department of Information and Computing Sciences, Utrecht University(信息与计算科学系,乌得勒支大学) Institute for Data-Driven Technologies, FH Aachen University of Applied Sciences(数据驱动技术研究所,亚琛应用科学大学) Department of Psychiatry and Psychotherapy, University Hospital Carl Gustav Carus, Technische Universität Dresden(精神病学与心理治疗系,卡尔·古斯塔夫·卡尔斯大学医院,德累斯顿技术大学)

AI总结 本研究通过案例评估,使用已验证的机器学习模型(RobustSleepNet和SUMOv2)实现全自动化睡眠分析,成功复现了专家基于双相情感障碍的研究发现,表明全自动化方法可促进大规模睡眠研究。

Comments 12 pages, 4 figures, 2 tables

详情
Journal ref
Scientific Reports 16, 16014 (2026)
AI中文摘要

睡眠分析的自动化,包括宏观结构(睡眠分期)和微观结构(例如睡眠纺锤波)元素,有望实现大规模睡眠研究,并减少由于评分者间不一致导致的差异。虽然睡眠分期和纺锤波检测等单个步骤已被分别研究,但多步骤睡眠分析自动化的可行性仍不清楚。在本案例研究中,我们评估了使用经过验证的机器学习模型进行睡眠分期(RobustSleepNet)和后续纺锤波检测(SUMOv2)的全自动化分析是否能够复现基于专家的双相情感障碍研究结果。自动化分析定性地复现了专家研究的关键发现,包括双相情感障碍患者与健康对照之间快速纺锤波密度的显著差异,在几分钟内完成了以前需要数月手动完成的工作。虽然自动化分析的结果在定量上与专家研究存在差异,可能是由于专家评分者之间或评分者与模型之间的偏差,但各个模型在睡眠分期和纺锤波检测方面的表现达到或超过了评分者间一致性。我们的结果表明,全自动化方法具有促进大规模睡眠研究的潜力。我们通过共享代码并引入SomnoBot(一个保护隐私的睡眠分析平台),公开提供自动化分析中使用的工具。

英文摘要

Automation of sleep analysis, including both macrostructural (sleep stages) and microstructural (e.g., sleep spindles) elements, promises to enable large-scale sleep studies and to reduce variance due to inter-rater incongruencies. While individual steps, such as sleep staging and spindle detection, have been studied separately, the feasibility of automating multi-step sleep analysis remains unclear. In this case study, we evaluate whether a fully automated analysis using validated machine learning models for sleep staging (RobustSleepNet) and subsequent spindle detection (SUMOv2) can replicate findings from an expert-based study of bipolar disorder. The automated analysis qualitatively reproduced key findings from the expert-based study, including significant differences in fast spindle densities between bipolar patients and healthy controls, accomplishing in minutes what previously took months to complete manually. While the results of the automated analysis differed quantitatively from the expert-based study, possibly due to biases between expert raters or between raters and the models, the models individually performed at or above inter-rater agreement for both sleep staging and spindle detection. Our results demonstrate that fully automated approaches have the potential to facilitate large-scale sleep research. We are providing public access to the tools used in our automated analysis by sharing our code and introducing SomnoBot, a privacy-preserving sleep analysis platform.

2504.05181 2026-05-26 cs.IR cs.AI cs.DL cs.LG

Lightweight and Direct Document Relevance Optimization for Generative Information Retrieval

轻量级直接文档相关性优化用于生成式信息检索

Kidist Amde Mekonnen, Yubao Tang, Maarten de Rijke

发表机构 * Institute for Clarity in Documentation(文档清晰度研究所) Inria Paris-Rocquencourt(巴黎- Rocquencourt 国家信息与自动化所) Rajiv Gandhi University(拉朱·甘地大学) Tsinghua University(清华大学) Palmer Research Laboratories(帕勒尔研究实验室) University of Amsterdam(阿姆斯特丹大学)

AI总结 提出直接文档相关性优化(DDRO)方法,通过成对排序直接对齐令牌级文档ID生成与文档级相关性估计,无需显式奖励建模和强化学习,在MS MARCO和Natural Questions上分别提升MRR@10 7.4%和19.9%。

Comments 12 pages, 3 figures. SIGIR '25 Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval July 13--18, 2025 Padua, Italy. Code and pretrained models available at: https://github.com/kidist-amde/ddro/

详情
Journal ref
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '25), pages 1327-1338, 2025
AI中文摘要

生成式信息检索(GenIR)是一种有前景的神经检索范式,它将文档检索形式化为文档标识符(docid)生成任务,允许朝着统一的全局检索目标进行端到端优化。然而,现有的GenIR模型存在令牌级错位问题,即训练用于预测下一个令牌的模型往往无法有效捕捉文档级相关性。虽然基于强化学习的方法(如相关性反馈强化学习(RLRF))旨在通过奖励建模解决这种错位,但它们引入了显著的复杂性,需要优化辅助奖励函数,然后进行强化微调,这在计算上昂贵且往往不稳定。为了解决这些挑战,我们提出了直接文档相关性优化(DDRO),它通过成对排序的直接优化,将令牌级docid生成与文档级相关性估计对齐,无需显式的奖励建模和强化学习。在包括MS MARCO文档和Natural Questions在内的基准数据集上的实验结果表明,DDRO优于基于强化学习的方法,在MS MARCO上MRR@10提升了7.4%,在Natural Questions上提升了19.9%。这些发现凸显了DDRO通过简化优化方法增强检索效果的潜力。通过将对齐问题框架化为直接优化问题,DDRO简化了GenIR模型的排序优化流程,同时为基于强化学习的方法提供了一种可行的替代方案。

英文摘要

Generative information retrieval (GenIR) is a promising neural retrieval paradigm that formulates document retrieval as a document identifier (docid) generation task, allowing for end-to-end optimization toward a unified global retrieval objective. However, existing GenIR models suffer from token-level misalignment, where models trained to predict the next token often fail to capture document-level relevance effectively. While reinforcement learning-based methods, such as reinforcement learning from relevance feedback (RLRF), aim to address this misalignment through reward modeling, they introduce significant complexity, requiring the optimization of an auxiliary reward function followed by reinforcement fine-tuning, which is computationally expensive and often unstable. To address these challenges, we propose direct document relevance optimization (DDRO), which aligns token-level docid generation with document-level relevance estimation through direct optimization via pairwise ranking, eliminating the need for explicit reward modeling and reinforcement learning. Experimental results on benchmark datasets, including MS MARCO document and Natural Questions, show that DDRO outperforms reinforcement learning-based methods, achieving a 7.4% improvement in MRR@10 for MS MARCO and a 19.9% improvement for Natural Questions. These findings highlight DDRO's potential to enhance retrieval effectiveness with a simplified optimization approach. By framing alignment as a direct optimization problem, DDRO simplifies the ranking optimization pipeline of GenIR models while offering a viable alternative to reinforcement learning-based methods.

2503.01684 2026-05-26 nucl-th cs.LG physics.comp-ph stat.ML

An Efficient Learning Method to Connect Observables

一种连接可观测量与高效学习方法

Hang Yu, Takayuki Miyagi

发表机构 * Center for Computational Sciences, University of Tsukuba(茨川大学计算科学中心)

AI总结 提出多参数本征值问题(MEP)仿真器,通过连接不同仿真器实现从可观测量到可观测量的直接预测,并利用特征向量延续(EC)和参数矩阵模型(PMM)数据进行训练,在一维格点模拟和$^{28}$O示例中验证了性能与预测概率分布获取的简便性。

Comments 5+2 pages, 4 figures, matched published version. Shared data and toy model code in the source file (shared.zip)

详情
Journal ref
Phys. Rev. Lett. 136, 202502 (2026)
AI中文摘要

构建快速准确的替代模型是许多主题中做出稳健预测的关键要素。我们引入了一种新模型,即多参数本征值问题(MEP)仿真器。新方法连接了仿真器,并可以直接从可观测量到可观测量进行预测。我们展示了MEP仿真器可以使用来自特征向量延续(EC)和参数矩阵模型(PMM)仿真器的数据进行训练。在一维格点上的简单模拟证实了MEP仿真器的性能。以$^{28}$O为例,我们还证明了通过新仿真器可以轻松获得目标可观测量的预测概率分布。

英文摘要

Constructing fast and accurate surrogate models is a key ingredient for making robust predictions in many topics. We introduce a new model, the Multiparameter Eigenvalue Problem (MEP) emulator. The new method connects emulators and can make predictions directly from observables to observables. We present that the MEP emulator can be trained with data from Eigenvector Continuation (EC) and Parametric Matrix Model (PMM) emulators. A simple simulation on a one-dimensional lattice confirms the performance of the MEP emulator. Using $^{28}$O as an example, we also demonstrate that the predictive probability distribution of the target observables can be easily obtained through the new emulator.

2501.15131 2026-05-26 math.OC cs.LG

Split-Merge: A Difference-based Approach for Dominant Eigenvalue Problem

分裂-合并:一种基于差异的主特征值问题方法

Xiaozhi Liu, Mengmeng Song, Yong Xia

发表机构 * LMIB of the Ministry of Education, School of Mathematical Sciences, Beihang University(教育部离散数学与信息检索重点实验室,北京航空航天大学数学科学学院) National Frontiers Science Center for Industrial Intelligence and Systems Optimization, Northeastern University(工业智能与系统优化国家级前沿科学中心,东北大学)

AI总结 针对对称半正定矩阵的主特征对计算问题,提出一种基于差异的无约束优化框架,并设计分裂-合并算法,实现无矩阵、无参数迭代,收敛速度优于经典幂法。

详情
AI中文摘要

对称半正定矩阵的主特征对的计算是数值优化的基础。本文将范式从经典瑞利商转变为无约束差异公式,其全局最优解恢复主特征对。在此框架下,我们证明步长 $α\in (0, 1)$ 的常数步长梯度下降几乎必然以局部线性速率收敛到全局最优解。该分析从而将经典幂法重新解释为保守特例 $α=1/2$,并严格建立了其渐近次优性。为了推进这一一阶方案,我们基于最大化-最小化原理提出了分裂-合并算法。在分裂矩阵后,我们引入辅助向量以有效合并分解因子,得到一种无矩阵、无参数的迭代,该迭代捕获更紧的曲率信息。我们证明分裂-合并几乎必然收敛到全局最小化器,并表明该迭代展现出一种谱剥离机制,抑制目标特征空间,可能超越幂法的静态线性速率。在合成和真实数据集上的数值评估证实,我们的方法具有可扩展的效率,相比幂法实现超过 $10\times$ 的加速,性能与子空间迭代相当。

英文摘要

The computation of the dominant eigenpair for symmetric positive semidefinite matrices is fundamental in numerical optimization. This work shifts the paradigm from the classical Rayleigh quotient to an unconstrained difference formulation, whose global optimum recovers the dominant eigenpair. Within this framework, we prove that gradient descent with a constant step-size $α\in (0, 1)$ converges almost surely to the global optimum at a local linear rate. This analysis thereby reinterprets the classical power method as the conservative special case $α=1/2$ and rigorously establishes its asymptotic sub-optimality. To advance this first-order scheme, we propose the Split-Merge algorithm based on the majorization-minimization principle. After splitting the matrix, we introduce auxiliary vectors to effectively merge the decomposition factors, resulting in a matrix-free and parameter-free iteration that captures tighter curvature information. We establish that Split-Merge converges almost surely to a global minimizer, and show that the iteration exhibits a spectral peeling mechanism that suppresses the targeted eigenspace, potentially surpassing the static linear rate of power iterations. Numerical evaluations across synthetic and real-world datasets confirm that our method has scalable efficiency, achieving speed-ups exceeding $10\times$ over the power method, with performance comparable to subspace iterations.

2411.00934 2026-05-26 cs.CY cs.AI

The Meme Is the Message: Generative Memesis and AI Visuals in the 2024 USA Presidential Elections

模因即信息:生成式模因与2024年美国总统选举中的AI视觉内容

Ho-Chun Herbert Chang, Benjamin Shaman, Yung-chun Chen, Mingyue Zha, Sean Noh, Chiyu Wei, Tracy Weener, Maya Magee

发表机构 * Program in Quantitative Social Science, Dartmouth College(量化社会科学项目,达特茅斯学院) Department of Mathematics, Dartmouth College(数学系,达特茅斯学院)

AI总结 本研究通过分析Instagram图像数据集,结合计算机视觉、大语言模型和面部情感分析,发现模因格式比AI生成内容更能预测用户参与度,但AI生成的模因与人类策展结合时产生协同效应,并定义了生成式模因作为AI介导的模因传播新模式。

详情
AI中文摘要

社交媒体上的视觉内容在塑造政治话语和公民参与方面变得越来越有影响力,但由于多媒体制作成本的增加,它也限制了参与。与此同时,生成式AI的发展通过降低这些成本,为公民参与政治提供了新的方式。基于239,526张Instagram图像的数据集,我们使用结合计算机视觉、大语言模型和面部情感分析的多模态工作流程,分析了2024年美国总统选举期间合成图像的影响。结果表明,模因格式比单独的AI生成内容更能预测参与度。然而,AI生成的模因产生了显著的交互效应,表明当合成图像通过人类策展与模因结合时,参与度会协同增加。我们还描述了用户如何策展图像。党派人士以不同方式使用AI:倾向民主党的用户倾向于将其用于内部群体支持,而倾向共和党的用户则更常将其用于外部群体攻击。与真实照片相比,用户通常选择更快乐的合成面孔。我们将生成式模因定义为一种传播模式,其中模因不再是人传人,而是通过AI以定制化视觉内容为中介。我们讨论了生成式AI如何增强公民参与、内容生产与策展的分化,以及这对新技术历史和参与式文化的影响。

英文摘要

Visual content on social media has become increasingly influential in shaping political discourse and civic engagement, but it also limits participation due to the increased cost of multimedia production. In tandem, the growth of generative AI provides novel ways for citizens to participate in politics by lowering these costs. Drawing on a dataset of 239,526 Instagram images, we analyze the effects of synthetic images during the 2024 United States presidential election, using a multimodal workflow combining computer vision, large language models, and facial affect analysis. Results show that meme format is a stronger predictor of engagement than AI-generated content alone. However, AI-generated memes yield a significant interaction effect, suggesting synergistic increases in engagement when synthetic imagery is integrated with memes through human curation. We also characterize how users curate images. Partisans use AI in different ways: Democrat-leaning users tend to use it for in-group support, whereas Republican-leaning users more often employ it for out-group attacks. Users generally select happier synthetic faces compared to real photographs. We define generative memesis as a mode of communication in which memes are no longer shared person-to-person, but mediated by AI through customized visuals. We discuss how generative AI may empower civic participation, the bifurcation of content production and curation, and its implications for in the history of novel technologies and participatory culture.

2410.10652 2026-05-26 q-bio.QM cs.LG

Querying structural and functional niches on spatial transcriptomics data

查询空间转录组数据中的结构和功能生态位

Mo Chen, Minsheng Hao, Xinquan Liu, Lin Deng, Peng Liu, Chen Li, Dongfang Wang, Kui Hua, Liang Guo, Xuegong Zhang, Lei Wei

发表机构 * MOE Key Laboratory of Bioinformatics and Bioinformatics Division of BNRIST, Department of Automation, Tsinghua University(生物信息学教育部重点实验室和北京理工大学生物信息学分部,自动化系,清华大学) Center for Synthetic and Systems Biology, School of Life Sciences and School of Medicine, Tsinghua University(合成与系统生物学中心,生命科学学院和医学学院,清华大学) Department of Thoracic Surgery, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College(胸外科部门,国家癌症中心/国家癌症临床研究中心/癌症医院,中国医学科学院和北京协和医学院) Peking Union Medical College, Chinese Academy of Medical Sciences(北京协和医学院,中国医学科学院) Biomedical Pioneering Innovation Center (BIOPIC), Peking University(生物医学前瞻性创新中心(BIOPIC),北京大学) Cancer Research UK Cambridge Institute, University of Cambridge(英国癌症研究Cambridge研究所,剑桥大学) Department of Immunology, School of Basic Medical Sciences, Harbin Medical University(免疫学部门,基础医学学院,哈尔滨医科大学) Zhongguancun Academy, Beijing, China(中关村学院,北京,中国) Zhongguancun Institute of Artificial Intelligence, Beijing, China(中关村人工智能研究院,北京,中国)

AI总结 提出QueST方法,通过子图建模和对比学习查询空间转录组样本中的相似生态位,有效捕捉异质环境中的生态位结构并跨平台泛化。

详情
AI中文摘要

多细胞生物中的细胞协调形成结构和功能生态位。随着空间转录组学(ST)能够在空间背景下进行基因表达谱分析,已揭示空间生态位在生理和病理过程中作为内聚且重复出现的单位。这些观察表明,由保守生态位模式编码的普遍组织原则,并呼吁一种超越当前计算工具的基于查询的生态位分析范式。在这项工作中,我们定义了生态位查询任务,即给定感兴趣生态位(NOI),在ST样本中识别相似生态位。我们进一步开发了QueST,一种专门解决该任务的方法。QueST将每个生态位建模为子图,使用对比学习学习判别性生态位嵌入,并引入对抗训练以减轻批次效应。在模拟和基准数据集中,QueST优于为生态位查询改造的现有方法,准确捕捉异质环境中的生态位结构,并展现出跨不同测序平台的强泛化能力。应用于肾癌和肺癌中的三级淋巴结构,QueST揭示了与患者预后相关的功能不同生态位,并发现了跨癌症类型的保守和分歧空间结构。应用于组合空间扰动数据集,QueST展示了完整的从头发现导向工作流程,通过查询表征了先前未解析的肿瘤结节。这些结果表明,QueST能够跨样本进行系统、定量的空间生态位分析,为剖析健康和疾病中的空间组织结构提供了强大工具。

英文摘要

Cells in multicellular organisms coordinate to form structural and functional niches. With spatial transcriptomics (ST) enabling gene expression profiling in spatial contexts, it has been revealed that spatial niches serve as cohesive and recurrent units in physiological and pathological processes. These observations suggest universal tissue organization principles encoded by conserved niche patterns, and call for a query-based niche analytical paradigm beyond current computational tools. In this work, we defined the niche-query task, which is to identify similar niches across ST samples given a niche of interest (NOI). We further developed QueST, a specialized method for solving this task. QueST models each niche as a subgraph, uses contrastive learning to learn discriminative niche embeddings, and incorporates adversarial training to mitigate batch effects. In simulations and benchmark datasets, QueST outperformed existing methods repurposed for niche querying, accurately capturing niche structures in heterogeneous environments and demonstrating strong generalizability across diverse sequencing platforms. Applied to tertiary lymphoid structures in renal and lung cancers, QueST revealed functionally distinct niches associated with patient prognosis and uncovered conserved and divergent spatial architectures across cancer types. Applied to a combinatorial spatial perturbation dataset, QueST demonstrated a complete de novo discovery-oriented workflow, characterizing previously unresolved tumor nodules through querying. These results demonstrate that QueST enables systematic, quantitative profiling of spatial niches across samples, providing a powerful tool to dissect spatial tissue architecture in health and disease.

2409.08379 2026-05-26 cs.SE cs.AI econ.GN q-fin.EC

The Impact of Large Language Models on Open-source Innovation: Evidence from GitHub Copilot

大型语言模型对开源创新的影响:来自GitHub Copilot的证据

Doron Yeverechyahu, Raveesh Mayya, Gal Oestreicher-Singer

发表机构 * Coller School of Management, Tel Aviv University(特拉维夫大学科尔学院) Stern School of Business, New York University(纽约大学斯特恩商学院)

AI总结 利用GitHub Copilot推出的自然实验,通过三种识别策略和两种分类方法,发现LLM使开源贡献增加28%-40%,且增量贡献增长显著大于实质性贡献,表明LLM偏向于利用现有代码库而非探索新功能。

Comments JEL Classification: O31, C88, J24, O35, L86

详情
AI中文摘要

大型语言模型(LLM)正在重塑知识工作,但它们对自愿、自我指导的开源创新论坛(贡献者无管理指导地选择任务)的影响可能与组织环境中观察到的效果根本不同。我们在开源软件开发中研究这个问题,其中个人的贡献在社区层面共同推动创新。与产品创新不同,产品创新中创新的分类类型已明确,开源环境中的知识工作需要根据任务对贡献者的认知需求进行区分。新兴文献区分了实质性贡献(需要创造性地解决问题以引入新功能)和增量贡献(利用对现有代码的理解来维护和改进代码)。我们利用2021年10月GitHub Copilot推出的自然实验,其中Copilot支持Python等语言,但出于商业原因不支持R,从而在原本可比的生态系统之间创建了外生划分。使用三种互补的识别策略和两种分类方法,我们发现Copilot的可用性使开源贡献增加了28%到40%。在所有规格中,增量贡献的增长显著大于实质性贡献的增长。这种差异在活动水平较高的项目中更为明显,并在模型升级后扩大:当现有上下文有助于定义问题和约束解决方案时,LLM更有效地发挥作用,使协作创新偏向于利用现有代码库而非探索新功能。鉴于生成式AI在知识经济中的爆炸性速度,本文提供了关于LLM影响的罕见因果实地证据。

英文摘要

Large Language Models (LLMs) are reshaping knowledge work, yet their impact on voluntary, self-guided open innovation forums (contributors choose tasks without managerial direction) may differ fundamentally from effects observed in organizational settings. We study this question in open-source software development, where individuals' contributions collectively drive innovation at a community level. Unlike product innovation, where typologies for classifying innovation are well established, knowledge work in open-source settings calls for a distinction grounded in the cognitive demand a task places on the contributor. Burgeoning literature distinguishes substantive contributions, which require creative problem formulation to introduce new functionality, from incremental contributions, which draw on comprehension of existing code to maintain and refine it. We exploit a natural experiment around GitHub Copilot's launch in October 2021, where Copilot supported languages like Python while not supporting R for business reasons, creating an exogenous partition between otherwise comparable ecosystems. Using three complementary identification strategies and two classification approaches, we find that Copilot availability increases open-source contributions by 28 to 40 percent. The increase in incremental contributions is significantly larger than the increase in substantive contributions across all specifications. This disparity is more pronounced in projects with higher activity levels and widens following a model upgrade: LLMs function more effectively when existing context helps define the problem and constrain solutions, tilting collaborative innovation toward exploitation of established codebases rather than exploration of new functionality. This paper provides a rare instance of causal field evidence on LLM effects, given the speed at which GenAI has exploded across the knowledge economy.

2401.11963 2026-05-26 cs.NE cs.AI cs.LG

Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey on Hybrid Algorithms

桥接进化算法与强化学习:混合算法的全面综述

Pengyi Li, Jianye Hao, Hongyao Tang, Xian Fu, Yan Zheng, Ke Tang

发表机构 * College of Intelligence and Computing, Tianjin University(天津大学智能与计算学院) Montreal Institute of Learning Algorithms (MILA)(蒙特利尔学习算法研究所) Department of Computer Science and Engineering, Southern University of Science and Technology(南方科技大学计算机科学与工程系)

AI总结 本文全面综述了进化强化学习(ERL)领域,将进化算法(EA)与强化学习(RL)融合,系统总结了三种主要研究方向:EA辅助RL优化、RL辅助EA优化以及EA与RL协同优化,并分析了各分支解决的问题及未来挑战。

Comments New Version, add more methods

详情
AI中文摘要

进化强化学习(ERL)将进化算法(EA)和强化学习(RL)相结合用于优化,已展现出显著的性能提升。通过融合这两种方法,ERL已成为一个有前景的研究方向。本综述全面概述了ERL中的不同研究分支。具体而言,我们系统地总结了相关算法的最新进展,并确定了三个主要研究方向:EA辅助的RL优化、RL辅助的EA优化以及EA和RL的协同优化。随后,我们对每个研究方向进行了深入分析,组织了多个研究分支。我们阐明了每个分支旨在解决的问题,以及EA和RL的整合如何应对这些挑战。最后,我们讨论了各个研究方向中潜在的挑战和未来的研究方向。为了便于研究人员深入研究ERL,我们在https://github.com/yeshenpy/Awesome-Evolutionary-Reinforcement-Learning上整理了所涉及的算法和代码。

英文摘要

Evolutionary Reinforcement Learning (ERL), which integrates Evolutionary Algorithms (EAs) and Reinforcement Learning (RL) for optimization, has demonstrated remarkable performance advancements. By fusing both approaches, ERL has emerged as a promising research direction. This survey offers a comprehensive overview of the diverse research branches in ERL. Specifically, we systematically summarize recent advancements in related algorithms and identify three primary research directions: EA-assisted Optimization of RL, RL-assisted Optimization of EA, and synergistic optimization of EA and RL. Following that, we conduct an in-depth analysis of each research direction, organizing multiple research branches. We elucidate the problems that each branch aims to tackle and how the integration of EAs and RL addresses these challenges. In conclusion, we discuss potential challenges and prospective future research directions across various research directions. To facilitate researchers in delving into ERL, we organize the algorithms and codes involved on https://github.com/yeshenpy/Awesome-Evolutionary-Reinforcement-Learning.

2310.01285 2026-05-26 q-fin.CP cs.LG q-fin.MF stat.ML

Automated regime classification in multidimensional time series data using sliced Wasserstein k-means clustering

多维时间序列数据中的自动制度分类:基于切片Wasserstein k-means聚类

Qinmeng Luan, James Hamp

发表机构 * Citigroup, London, UK(伦敦英国摩根大通公司) Data Science Institute, London School of Economics, London, UK(伦敦经济学院数据科学研究所)

AI总结 提出切片Wasserstein k-means聚类方法,通过近似多维Wasserstein距离,实现多维时间序列数据的自动制度分类,并在合成数据和真实外汇数据中验证有效性。

详情
Journal ref
Data Science in Finance and Economics 2025, Volume 5, Issue 3: 387-418
AI中文摘要

最近的研究提出Wasserstein k-means(Wk-means)聚类作为对时间序列数据(特别是单维资产收益)进行制度分类的强大方法。本文首先详细研究应用于合成一维时间序列数据的Wasserstein k-means聚类算法的行为。我们通过详细研究聚类算法的动态以及超参数变化如何影响不同随机初始化的性能,扩展了先前的工作。我们计算简单的度量,发现这些度量有助于识别高质量的聚类。然后,我们将Wasserstein k-means聚类技术扩展到多维时间序列数据,通过将多维Wasserstein距离近似为切片Wasserstein距离,得到一种称为“切片Wasserstein k-means(sWk-means)聚类”的方法。我们将sWk-means聚类方法应用于多维时间序列数据中的自动制度分类问题,使用合成数据证明该方法的有效性和有效性。最后,我们以公开的外汇即期汇率数据作为案例研究,表明sWk-means方法能够识别真实多维金融时间序列中的不同市场制度。我们最后评论了该方法的一些局限性以及潜在的补充或替代方法。

英文摘要

Recent work has proposed Wasserstein k-means (Wk-means) clustering as a powerful method to classify regimes in time series data, and one-dimensional asset returns in particular. In this paper, we begin by studying in detail the behaviour of the Wasserstein k-means clustering algorithm applied to synthetic one-dimensional time series data. We extend the previous work by studying, in detail, the dynamics of the clustering algorithm and how varying the hyperparameters impacts the performance over different random initialisations. We compute simple metrics that we find to be useful in identifying high-quality clusterings. We then extend the technique of Wasserstein k-means clustering to multidimensional time series data by approximating the multidimensional Wasserstein distance as a sliced Wasserstein distance, resulting in a method we call 'sliced Wasserstein k-means (sWk-means) clustering'. We apply the sWk-means clustering method to the problem of automated regime classification in multidimensional time series data, using synthetic data to demonstrate the validity and effectiveness of the approach. Finally, we show that the sWk-means method is able to identify distinct market regimes in real multidimensional financial time series, using publicly available foreign exchange spot rate data as a case study. We conclude with remarks about some limitations of our approach and potential complementary or alternative approaches.

2605.24526 2026-05-26 cs.HC cs.AI

TRAFA: Anticipating User Actions to Reduce Errors in Procedural Tasks with Predictive Feedback

TRAFA:通过预测性反馈预见用户操作以减少程序性任务中的错误

Sassan Mokhtar, Lars Doorenbos, Fatemeh Jabbari, Marius Bock, Dominik Bach, Juergen Gall

发表机构 * University of Bonn(波恩大学) Lamarr Institute for Machine Learning and Artificial Intelligence(拉马尔人工智能与机器学习研究所)

AI总结 提出TRAFA系统,通过跟踪-预测-行动框架实时预测用户动作并触发反馈,在错误发生前干预,实验证明相比传统反应式反馈能提高任务准确性和效率。

详情
AI中文摘要

交互式辅助系统通常在动作完成后提供反馈,支持错误恢复但无法预防错误本身。我们提出TRAFA,一种用于程序性任务的实时预测性反馈系统,在错误发生前进行干预。TRAFA通过跟踪-预测-行动框架实现预测性反馈:跟踪手和物体状态,基于场景上下文预测用户运动,并在预测动作可能违反任务约束时触发反馈。我们在顺序组装场景中实例化该流程,并通过技术基准测试和对照用户研究(与传统反应式反馈对比)进行评估。结果表明,预测性反馈在保持反馈事件数量相当的同时,提高了任务准确性和效率。这些发现将反馈时机定位为系统设计的关键维度,并展示了如何将实时预测集成到交互系统中以在错误发生前预防错误。

英文摘要

Interactive assistance systems typically provide feedback after an action has been completed, supporting error recovery but not preventing the error itself. We present TRAFA, a real-time predictive feedback system for procedural tasks that intervenes before errors are committed. TRAFA operationalizes predictive feedback through a Track-Forecast-Act framework that tracks hand and object state, forecasts user motion conditioned on scene context, and triggers feedback when a predicted action is likely to violate task constraints. We instantiate this pipeline in a sequential assembly setting and evaluate it through both technical benchmarking and a controlled user study against conventional reactive feedback. Our results show that predictive feedback improves task accuracy and efficiency while maintaining a comparable number of feedback events. These findings position feedback timing as a key dimension in system design and show how real-time anticipation can be integrated into interactive systems to prevent errors before they occur.

2605.24520 2026-05-26 q-bio.GN cs.LG

AnnotateMissense: a genome-wide annotation and benchmarking framework for missense pathogenicity prediction

AnnotateMissense:一个用于错义致病性预测的全基因组注释和基准测试框架

Muhammad Muneeb, David B. Ascher

发表机构 * School of Chemistry and Molecular Biology(化学与分子生物学学院) The University of Queensland(昆士兰大学) Baker Heart and Diabetes Institute(贝克心脏病与糖尿病研究所)

AI总结 提出AnnotateMissense框架,整合多种特征,通过XGBoost模型在ClinVar数据集上实现高精度错义变异致病性预测,并生成全基因组预测结果。

详情
AI中文摘要

错义变异解读仍然具有挑战性,因为致病性取决于来自群体频率、进化保守性、转录本背景、氨基酸替代严重性、先验致病性预测因子以及蛋白质语言模型衍生特征的异质性证据。我们提出了AnnotateMissense,一个用于错义变异解读的可扩展注释、基准测试和全基因组预测框架。AnnotateMissense整合了来自dbNSFP v5.1的hg38错义变异与ANNOVAR注释、dbNSFP转录本/蛋白质描述符、AlphaMissense评分、ESM衍生特征、保守性指标、群体频率变量、已建立的致病性预测因子以及工程化的氨基酸/密码子背景特征。使用132,714个ClinVar标记的错义变异,我们在受控特征配置下对机器学习和深度学习模型进行了基准测试。完整的303特征基准集在XGBoost上实现了最强性能,在分层五折交叉验证中平均MCC=0.9411,ROC-AUC=0.9950。受限的朴素和位置导向特征集分别达到了较低的MCC最佳值0.4989和0.5113。循环控制消融实验表明,移除先验预测因子、群体频率和临床重叠证据会降低性能,而单独排除AlphaMissense和ESM衍生特征影响最小。在新观察到的致病/良性变异上的时间ClinVar验证实现了MCC=0.7613,准确率=0.8798,F1分数=0.8750。最终模型应用于90,643,830个hg38错义变异,生成AnnotateMissense致病性评分和二元预测标签。代码和输出可在https://github.com/MuhammadMuneeb007/CAGI7_Annotate_All_Missense和https://doi.org/10.5281/zenodo.19981867获取。

英文摘要

Missense variant interpretation remains challenging because pathogenicity depends on heterogeneous evidence from population frequency, evolutionary conservation, transcript context, amino acid substitution severity, prior pathogenicity predictors and protein-language-model-derived features. We present AnnotateMissense, a scalable annotation, benchmarking and genome-wide prediction framework for missense variant interpretation. AnnotateMissense integrates hg38 missense variants derived from dbNSFP v5.1 with ANNOVAR annotations, dbNSFP transcript/protein descriptors, AlphaMissense scores, ESM-derived features, conservation metrics, population-frequency variables, established pathogenicity predictors and engineered amino acid/codon-context features. Using 132,714 ClinVar-labelled missense variants, we benchmarked machine-learning and deep-learning models under controlled feature configurations. The full 303-feature benchmark set achieved the strongest performance with XGBoost, reaching mean MCC = 0.9411 and ROC-AUC = 0.9950 across stratified five-fold cross-validation. Restricted naive and location-oriented feature sets achieved lower best MCC values of 0.4989 and 0.5113, respectively. Circularity-controlled ablations showed that removing prior-predictor, population-frequency and clinically overlapping evidence reduced performance, whereas excluding AlphaMissense and ESM-derived features alone had minimal effect. Temporal ClinVar validation on newly observed pathogenic/benign variants achieved MCC = 0.7613, accuracy = 0.8798 and F1-score = 0.8750. The final model was applied to 90,643,830 hg38 missense variants to generate AnnotateMissense pathogenicity scores and binary prediction labels. Code and outputs are available at https://github.com/MuhammadMuneeb007/CAGI7_Annotate_All_Missense and https://doi.org/10.5281/zenodo.19981867.

2605.24516 2026-05-26 cs.MA cs.AI

Adaptive Punishment for Cooperation in Mixed-Motive Games

混合动机博弈中促进合作的自适应惩罚

Min Tang, Fanqi Kong, Linyuan Lü, Xue Feng

发表机构 * University of Science and Technology of China(中国科学技术大学) State Key Laboratory of General Artificial Intelligence, BIGAI(一般人工智能国家重点实验室,BIGAI)

AI总结 提出自适应惩罚合作方法(APC),通过动态惩罚概率和背叛严重程度确定惩罚强度,在迭代公共物品博弈中有效促进合作并降低惩罚成本。

详情
AI中文摘要

混合动机场景在现实多智能体交互中普遍存在,其中自私的智能体往往为了即时奖励而背叛,忽视了利他合作改善长期收益和集体福利的潜力。同伴惩罚可以阻止背叛,但作为代价高昂的二阶利他行为,其持续施加可能损害惩罚者的利益。现有方法通常难以有效实施惩罚以促进合作。为了平衡惩罚的有效性和成本,我们提出了自适应惩罚合作方法(APC),这是一种分布式方法,基于动态惩罚概率和背叛严重程度来确定惩罚强度。这种动态概率大大减少了代价高昂且无效的惩罚,同时促进了合作。为了准确评估背叛及其严重程度,我们使用了一个背叛感知模块,其学习由游戏奖励引导。理论分析和实证结果表明,APC在迭代公共物品博弈中表现有效。在实证中,APC在连续社会困境中也显著优于现有基线,学习到理性且有效的惩罚策略,通过战略性地阻止背叛来促进合作。

英文摘要

Mixed-motive scenarios are ubiquitous in real-world multi-agent interactions, where self-interested agents often defect for immediate rewards, overlooking the potential of altruistic cooperation to improve long-term gains and collective welfare. Peer punishment can deter defection, but as costly second-order altruism, its persistent imposition may undermine the punisher's interests. Existing approaches often struggle to effectively implement punishment to promote cooperation. To balance the efficacy and cost of punishment, we propose Adaptive Punishment for Cooperation (APC), a distributed method that determines punishment intensity based on both a dynamic punishment probability and the severity of defection. This dynamic probability substantially reduces costly and ineffective punishment while also promotes cooperation. To accurately assess defection and its severity, we use a defection awareness module, whose learning is guided by game reward. Theoretical analysis and empirical results show APC performs effectively in iterated public goods game. Empirically, APC also significantly outperforms existing baselines across sequential social dilemmas, learning rational and effective punishment policies that foster cooperation by strategically deterring defection.

2605.24502 2026-05-26 cond-mat.stat-mech cs.LG math.CO physics.comp-ph

Implicit Binarization via Complex Phase Dynamics in Combinatorial Optimization

组合优化中通过复相位动力学实现的隐式二值化

Khen Cohen, Mark Glass, Meir Feder, Yaron Oz

发表机构 * School of Physics and Astronomy, Tel Aviv University(特拉维夫大学物理与天文学学院) School of Electrical and Computer Engineering, Tel Aviv University(特拉维夫大学电气与计算机工程学院)

AI总结 提出一种受物理启发的连续松弛框架,通过将离散二进制变量参数化为复单位圆上的连续波状状态,隐式正则化促进收敛到离散状态,显著提升NP难组合优化问题的求解性能。

Comments 27 pages, 5 figures

详情
AI中文摘要

我们引入了一种受物理启发的连续松弛框架,该框架为NP难组合优化问题(包括二次无约束二进制优化(QUBO)、二进制稀疏编码和植入解伊辛模型)提供了显著改进的解。通过将离散二进制变量参数化为复单位圆上的连续波状状态,我们固有地平滑了高度非凸的能量景观。我们证明,将二进制变量表示为复相位揭示了一种隐式正则化机制,该机制促进向离散状态的收敛。即使在标准的实值优化框架中显式使用该正则化器,提取这一机制也能带来显著改进。实验上,该正则化比标准的实值替代方案实现了更高的基态收敛率。我们的模型在严重噪声(σ=0.25)下的大规模160x160 QUBO任务中实现了零误差,并在欠定稀疏编码中优于传统算法(OMP和LASSO),在σ=0.15时完美恢复。求解器的鲁棒性进一步通过11个严格设计的植入解基准中恢复8个精确基态配置得到验证。

英文摘要

We introduce a physics-inspired continuous relaxation framework that yields substantially improved solutions for NP-hard combinatorial optimization problems, including Quadratic Unconstrained Binary Optimization (QUBO), binary sparse coding, and planted-solution Ising models. By parameterizing discrete binary variables as continuous wave-like states on the complex unit circle, we inherently smooth highly non-convex energy landscapes. We show that representing binary variables as complex phases reveals an implicit regularization mechanism that promotes convergence toward discrete states. Extracting this mechanism yields significant improvements even within standard real-valued optimization frameworks, using this regularizer explicitly. Empirically, this regularization yields vastly higher ground-state convergence rates than standard real-valued alternatives. Our models achieved zero error in large-scale 160x160 QUBO tasks under severe noise (sigma=0.25), and outperformed traditional algorithms (OMP and LASSO) in underdefined sparse coding with perfect recovery at sigma=0.15. The solver's robustness was further validated by recovering exact ground-state configurations in 8 out of 11 rigorously engineered planted-solution benchmarks.

2605.24457 2026-05-26 eess.SY cs.LG cs.SY

Asymmetric Adaptation-based Real-time Fault Diagnosis Under Transitional Operating Conditions

基于非对称自适应的过渡工况实时故障诊断

Hongshuo Zhao, Zeyi Liu, Xiao He

发表机构 * MCC5 Group Shanghai Co. LTD(MCC5集团上海有限公司) Tsinghua University(清华大学)

AI总结 针对离线训练未覆盖的过渡工况导致分布偏移问题,提出一种结合离线域泛化与在线测试时自适应的非对称自适应故障诊断方法,通过周期原型重投影和不对称学习率策略实现快速适应并保持判别能力。

Comments 6 pages, 3 figures, Accepted by ICAIS & ISAS 2026

详情
AI中文摘要

实际工业场景中的数据流通常包含离线训练中未覆盖的过渡工况,导致显著的分布偏移。为弥合静态离线模型与动态在线数据之间的差距,本文提出了一种新颖的基于非对称自适应的故障诊断方法。具体地,在离线阶段,我们采用域泛化技术从多个稳定工况中提取域不变特征,并构建鲁棒的归一化故障原型作为参考锚点。随后,在在线推理阶段,我们设计了一种基于周期原型重投影机制的在线测试时自适应方法,以动态更新原型位置。此外,我们利用从锚点导出的几何分布来指导分类器的更新,并对特征提取器和分类器采用非对称学习率策略。所提方法确保快速适应新的过渡工况,同时保留从离线域泛化初始化继承的判别能力。实验结果表明,该机制有效利用离线泛化知识指导在线推理,显著提高了非平稳环境下的鲁棒性。

英文摘要

Data streams in real-world industrial scenarios often contain transitional operating conditions that are uncovered during offline training, leading to significant distribution shifts. To bridge the gap between static offline models and dynamic online data, a novel asymmetric adaptation-based fault diagnosis method is proposed in this paper. Specifically, in the offline stage, we employ domain generalization techniques to extract domain-invariant features from multiple stable conditions and construct robust normalized fault prototypes as reference anchors. Subsequently, during online inference, we design an online test-time adaptation method based on a periodic prototype re-projection mechanism to dynamically update prototype positions. Furthermore, we utilize the geometric distribution derived from anchors to guide the updates of classifiers and adopt an asymmetric learning rate strategy for the feature extractor and classifier. The proposed approach ensures rapid adaptation to new transitional conditions while preserving the discriminative power inherited from the offline domain generalization initialization. Experimental results demonstrate that this mechanism effectively leverages offline generalized knowledge to guide online inference, significantly improving robustness in non-stationary environments.

2605.24453 2026-05-26 cs.SE cs.AI

Code2UML: Agentic LLMs with context engineering for scalable software visualization

Code2UML: 基于上下文工程的可扩展软件可视化的智能体LLM

Alin-Gabriel Văduva, Anca-Ioana Andreescu, Simona-Vasilica Oprea, Adela Bâra

发表机构 * Bucharest University of Economic Studies(布加勒斯特经济大学)

AI总结 提出一种基于五个专门智能体和确定性IR压缩层的智能体架构,用于从源代码仓库自动生成UML图,在12个开源仓库和7种UML图上验证了高语法有效性(平均91.5%)和结构质量(平均81.7/100),且质量不随规模下降。

详情
AI中文摘要

基于大型语言模型(LLM)的代码分析工具被用于自动化软件文档任务。然而,这些方法在真实代码库中的可扩展性——其中中间表示(IR)超过LLM上下文限制——仍未充分探索。本文介绍了一种具有上下文工程的智能体架构,用于从源代码仓库自动生成UML图。它采用基于Claude Agent SDK构建的五个专门智能体的层次结构:PlannerAgent、AnalyzerAgent、DiagramAgent、CorrectorAgent和DependencyAnalyzerAgent,每个处理不同的认知子任务。一个确定性的、重要性加权的IR压缩层将完整项目IR转换为保证适合令牌限制的特定图视图,无需LLM调用且可在毫秒内完成。因此,我们在4种编程语言(Java、JavaScript、PHP、Python)的12个开源仓库和7种UML图上评估该系统,产生了84个观察结果,并在5个自动指标上进行了评估。结果表明高语法有效性(平均91.5%,其中组件图和部署图达到100%)、强关系精度(平均0.858)和一致的结构质量(平均81.7/100,跨语言方差为3.1分)。实体召回率平均为0.313,反映了有意的架构优先级而非全面覆盖。敏感性分析(31到4,578个IR实体)证实质量分数无论规模大小都保持稳定。

英文摘要

Large Language Model (LLM)-based code analysis tools are adopted to automate software documentation tasks. However, the scalability of these approaches to real codebases, where Intermediate Representations (IR) exceed LLM context limits, remains underexplored. This paper introduces an agentic architecture with context engineering for automated UML diagram generation from source code repositories. It employs a hierarchy of five specialized agents: PlannerAgent, AnalyzerAgent, DiagramAgent, CorrectorAgent and DependencyAnalyzerAgent, built on the Claude Agent SDK, each addressing a distinct cognitive subtask. A deterministic, importance-weighted IR compaction layer transforms full project IRs into diagram-specific views guaranteed to fit within token constraints, requiring no LLM calls and completing in milliseconds. Thus, we evaluate the system across 12 open-source repositories in 4 programming languages (Java, JavaScript, PHP, Python) and 7 UML diagram types, producing 84 observations assessed on 5 automated metrics. Results demonstrate high syntactic validity (mean: 91.5%, with component and deployment diagrams reaching 100%), strong relationship precision (mean: 0.858) and consistent structural quality (mean: 81.7/100, with cross-language variance of 3.1 points). Entity recall averaged 0.313, reflecting deliberate architectural prioritization over exhaustive coverage. A sensitivity analysis (31 to 4,578 IR entities) confirms that quality scores remain stable regardless of scale.

2605.24436 2026-05-26 cs.MA cs.LG cs.RO

A Reinforcement Learning Inspired Latent Yield Based Adaptive Algorithm Switching Mechanism

一种受强化学习启发的基于潜在收益的自适应算法切换机制

Jayprakash S. Nair, Jimson Mathew, Shivashankar B. Nair

发表机构 * Indian Institute of Technology Patna(印度理工学院帕纳布分校) Indian Institute of Technology Guwahati(印度理工学院古瓦哈提分校)

AI总结 针对在线或动态环境中算法选择困难的问题,提出一种受强化学习启发的潜在收益方法,通过封装奖励和惩罚触发探索与利用,实现自适应算法切换,并在排序算法和机器人避障任务中验证了有效性。

Comments Accepted and published in the Proceedings of the 29th European Conference on Applications of Evolutionary Computation (EvoApplications 2026), held as part of EvoStar 2026, Toulouse, France, April 8 to 10, 2026. Lecture Notes in Computer Science (LNCS), Springer Nature Switzerland

详情
Journal ref
Applications of Evolutionary Computation, EvoApplications 2026, LNCS, Springer Nature Switzerland, 2026
AI中文摘要

对于给定的问题实例,选择最合适的算法仍然是一项具有挑战性的任务,尤其是在问题特征随时间演变的在线或动态环境中。仅依赖瞬时性能指标可能导致反应性和不稳定的行为,通常会导致次优的算法切换。本文介绍了一种计算高效的方法,用于聚合算法在多个问题实例上的性能,该方法对实例特征的剧烈变化具有相当的免疫性。受强化学习(RL)固有特征的启发,该技术将奖励和惩罚封装到一个潜在收益中,进而触发利用和探索,从而产生自适应算法切换。所提出的技术采用受遗传算法启发的岛屿模型,以促进并行探索和算法种群之间的性能交换,这些算法种群栖息在局部库中。在排序算法和机器人避障任务上的实验评估证明了该方法的可行性和有效性,突显了其在自适应算法选择至关重要的领域中的潜力。

英文摘要

Selecting the most suitable algorithm for a given problem instance remains a challenging task, particularly in online or dynamic environments where problem characteristics evolve over time. Relying solely on instantaneous performance metrics can result in a reactive and unstable behaviour, often leading to suboptimal algorithm switching. This paper introduces a computationally efficient approach for aggregating an algorithm's performance across multiple problem instances that is fairly immune to erratic variations in instance features. Inspired by features inherent to Reinforcement Learning (RL), this technique encapsulates rewards and penalties into a latent yield that, in turn, triggers exploitation and exploration, consequently resulting in adaptive algorithm switching. The proposed technique employs island models, inspired by Genetic Algorithms, to facilitate parallel exploration and performance exchanges among algorithm populations inhabiting local repertoires. Experimental evaluations on sorting algorithms and robotic obstacle avoidance tasks demonstrate the feasibility and effectiveness of the approach, highlighting its potential in domains where adaptive algorithm selection is critical.

2605.24422 2026-05-26 stat.ML cs.LG

Clustering based on Stochastic Dominance with application for risk averters and risk seekers

基于随机占优的聚类及其对风险规避者和风险寻求者的应用

Hua Li, Xue Jia, Yilin Kang, Wing-Keung Wong

发表机构 * School of Science, Changchun University, Changchun, China(长春大学科学学院,中国长春) School of Mathematics and Science, Northeast Normal University, Changchun, China(东北师范大学数学与科学学院,中国长春)

AI总结 针对传统聚类方法无法捕捉资产间风险占优关系的问题,提出基于随机占优检验统计量的聚类分析框架,通过构造随机占优系数矩阵并改进K-means和层次聚类算法,实现面向不同风险偏好投资者的定制化资产配置。

详情
AI中文摘要

随机占优(SD)理论为选择适合不同风险偏好(即风险规避、风险寻求和风险中性)投资者资产配置需求的优质资产提供了严格框架。然而,传统的股票聚类方法通常依赖欧氏距离等几何度量,往往无法有效捕捉资产间的内在风险占优关系。为解决这一局限,本文提出一种基于SD检验统计量的创新聚类分析框架。方法上,本研究将SD理论与机器学习算法深度融合。超越传统依赖几何距离的限制,我们创新性地利用一阶、二阶和三阶SD的检验统计量构建“随机占优系数矩阵”。在此矩阵基础上,我们修改了经典的K-means和层次聚类算法。具体地,针对不同阶次的SD关系,我们推导出12种不同的算法变体。同时,我们构建了SD-SC系数和SD-DBI指数作为专门的有效性指标来评估聚类性能。实证上,我们分析了代表性发达市场(美国纳斯达克指数)和新兴市场(中国沪深100指数)的成分股数据。结果验证了所提方法的有效性和稳健性。此外,我们将聚类结果应用于单指数模型的修正和全局最小方差投资组合(GMVP)的构建。结果表明,所提方法有效促进了投资者的定制化资产配置,具有重要的理论价值和实践意义。

英文摘要

Stochastic Dominance (SD) theory provides a rigorous framework for selecting superior assets tailored to the asset allocation needs of investors with varying risk preferences (i.e., risk-averse, risk-seeking, and risk-neutral). However, traditional stock clustering methods typically rely on geometric metrics such as Euclidean distance, which often fail to effectively capture the intrinsic risk dominance relationships among assets. To address this limitation, this paper proposes an innovative clustering analysis framework based on SD test statistics. Methodologically, this study deeply integrates SD theory with machine learning algorithms. Transcending the limitations of traditional reliance on geometric distance, we innovatively utilize test statistics from first-, second-, and third-order SD to construct a "Stochastic Dominance Coefficient Matrix." Building upon this matrix, we modify the classic K-means and Hierarchical Clustering algorithms. Specifically, we derive 12 distinct algorithm variants tailored to different orders of SD relationships. Simultaneously, we construct the SD-SC coefficient and the SD-DBI index as specialized validity indices to evaluate the clustering performance. Empirically, we analyze constituent stock data from a representative developed market (the US NASDAQ Index) and an emerging market (China's CSI 100 Index). The results verify the effectiveness and robustness of the proposed method. Furthermore, we apply the clustering results to the modification of the Single Index Model and the construction of Global Minimum Variance Portfolios (GMVP). The findings demonstrate that the proposed method effectively facilitates customized asset allocation for investors, holding significant theoretical value and practical implications.