arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 4033
2511.17228 2026-06-16 quant-ph cs.LG 版本更新

Intrinsic preservation of plasticity in continual quantum learning

连续量子学习中可塑性的内在保持

Yu-Qin Chen, Shi-Xin Zhang

发表机构 * Graduate School of China Academy of Engineering Physics(中国工程物理研究院研究生部) Institute of Physics, Chinese Academy of Sciences(中国科学院物理研究所)

AI总结 量子学习模型通过其内在的物理约束(如酉变换)自然克服了经典深度学习中的可塑性丧失问题,在监督学习和强化学习等多种任务中保持长期学习能力。

Comments 17 pages, 13 figures

详情
AI中文摘要

动态现实环境中的人工智能需要具备持续学习的能力。然而,标准深度学习面临一个基本问题:可塑性丧失,即网络逐渐失去从新数据中学习的能力。在这里,我们展示量子学习模型自然地克服了这一限制,在长时间尺度上保持可塑性。我们在来自多个学习范式(包括监督学习和强化学习)以及多种数据模态(从经典高维图像到量子原生数据集)的广泛任务中系统地证明了这一优势。尽管经典模型表现出与无界权重和梯度增长相关的性能退化,但量子神经网络无论数据或任务如何都保持一致的学习能力。我们将这一优势的根源归因于量子模型的内在物理约束。与经典网络中无界权重增长导致景观崎岖或饱和不同,酉约束将优化限制在一个紧致流形上。我们的结果表明,量子计算在机器学习中的效用不仅限于潜在的加速,还为构建自适应人工智能和终身学习者提供了一条稳健的途径。

英文摘要

Artificial intelligence in dynamic, real-world environments requires the capacity for continual learning. However, standard deep learning suffers from a fundamental issue: loss of plasticity, in which networks gradually lose their ability to learn from new data. Here we show that quantum learning models naturally overcome this limitation, preserving plasticity over long timescales. We demonstrate this advantage systematically across a broad spectrum of tasks from multiple learning paradigms, including supervised learning and reinforcement learning, and diverse data modalities, from classical high-dimensional images to quantum-native datasets. Although classical models exhibit performance degradation correlated with unbounded weight and gradient growth, quantum neural networks maintain consistent learning capabilities regardless of the data or task. We identify the origin of the advantage as the intrinsic physical constraints of quantum models. Unlike classical networks where unbounded weight growth leads to landscape ruggedness or saturation, the unitary constraints confine the optimization to a compact manifold. Our results suggest that the utility of quantum computing in machine learning extends beyond potential speedups, offering a robust pathway for building adaptive artificial intelligence and lifelong learners.

2505.03201 2026-06-16 stat.ML cs.LG 版本更新

Enhancing Visual Feature Attribution via Weighted Integrated Gradients

通过加权积分梯度增强视觉特征归因

Kien Tran Duc Tuan, Tam Nguyen Trong, Son Nguyen Hoang, Khoat Than, Anh Nguyen Duc

发表机构 * Institute of Information and Communication Technology, Vietnam Academy of Science and Technology(越南科学与技术学院信息与通信技术研究所)

AI总结 针对积分梯度方法对基线选择敏感的问题,提出加权积分梯度,通过无监督准则自适应选择和加权基线,在保持公理性质的同时提升归因可靠性,实验显示在卷积和Transformer架构上最高提升36%。

详情
AI中文摘要

积分梯度(IG)是可解释AI中广泛使用的归因方法,尤其在需要可靠特征归因的计算机视觉应用中。IG的一个关键限制是其对基线(参考)图像选择的敏感性。多基线扩展如期望梯度(EG)假设基线均匀加权,隐含地认为所有基线图像信息量相等。在高维视觉模型中,这一假设常导致噪声或不稳定的解释。本文提出加权积分梯度(WG),一种通过评估和加权基线来增强归因可靠性的原则性方法。WG引入了一个无监督的基线适用性标准,实现了基于每个输入的自适应基线选择和加权。该方法在广义加权基线形式下保留了IG的核心公理性质。在预期的、基于代理的适应度-相关性单调性假设下,WG为更信息丰富的基线分配更大权重提供了概率依据。在常用图像数据集和模型上的实验表明,在我们的协议下,WG优于EG,在评估的卷积和Transformer架构上最高提升36%。这些提升伴随着额外的适应度评估成本,因此WG应被视为归因保真度的权衡,而非EG的更快替代方案。通过超越所有基线贡献相等的假设,加权积分梯度为解释计算机视觉模型提供了更清晰、更可靠的方法,提高了可解释AI的理解和实际可用性。

英文摘要

Integrated Gradients (IG) is a widely used attribution method in explainable AI, particularly in computer vision applications where reliable feature attribution is essential. A key limitation of IG is its sensitivity to the choice of baseline (reference) images. Multi-baseline extensions such as Expected Gradients (EG) assume uniform weighting over baselines, implicitly treating all baseline images as equally informative. In high-dimensional vision models, this assumption often leads to noisy or unstable explanations. This paper proposes Weighted Integrated Gradients (WG), a principled approach that evaluates and weights baselines to enhance attribution reliability. WG introduces an unsupervised criterion for baseline suitability, enabling adaptive selection and weighting of baselines on a per-input basis. The method preserves the core axiomatic properties of IG in a generalized weighted-baseline form. Under an expected, proxy-based fitness--relevance monotonicity assumption, WG provides a probabilistic justification for assigning larger weights to more informative baselines. Experiments on commonly used image datasets and models show that WG improves over EG under our protocol, with up to 36% gains across evaluated convolutional and Transformer architectures. These gains come with additional fitness-evaluation cost, so WG should be viewed as an attribution-fidelity trade-off rather than a faster alternative to EG. By moving beyond the assumption that all baselines contribute equally, Weighted Integrated Gradients offers a clearer and more reliable approach to explaining computer-vision models, improving both understanding and practical usability in explainable AI.

2505.13553 2026-06-16 cs.SE cs.LG 版本更新

Towards Functional Correctness of Large Code Models with Selective Generation

面向大型代码模型的功能正确性:选择性生成方法

Jaewoo Jeong, Taesoo Kim, Sangdon Park

发表机构 * KAIST(韩国科学技术院)

AI总结 针对代码生成模型的幻觉问题,提出利用动态代码分析自动生成单元测试,基于功能正确性评估进行选择性生成,以控制非弃权答案的错误发现率,并引入FuzzEval范式用于精确评估。

Comments ICML 2026

详情
AI中文摘要

代码生成模型的幻觉阻碍了其在需要更高安全标准的系统中的应用。解决代码幻觉的一个关键瓶颈是难以识别生成代码的功能正确性,因为其形式不自然。我们通过利用代码的可执行性质,使用动态代码分析工具自动生成单元测试来解决这一核心瓶颈。据此,我们提出了一种选择性代码生成器,它基于生成的单元测试评估的功能正确性,放弃不确定的生成,从而在理论上控制非弃权答案的正确性,即错误发现率。最后,我们建议在评估以及学习中使用生成的单元测试进行精确代码评估,称此范式为FuzzEval。我们展示了我们方法的有效性,以及代码幻觉的可控性和合理的选择效率。

英文摘要

The hallucination of code generation models hinders their applicability to systems requiring higher safety standards. One critical bottleneck in addressing code hallucination is the difficulty of identifying the functional correctness of generated code, due to its unnatural form. We address this core bottleneck by automatically generating unit tests using dynamic code analysis tools, leveraging the \emph{executable nature} of code. Accordingly, we propose a \emph{selective code generator} that abstains from uncertain generations -- based on the functional correctness evaluated by generated unit tests -- to theoretically control the correctness among non-abstained answers, \ie the false discovery rate. Finally, we propose to use generated unit tests in evaluation as well as in learning for precise code evaluation, calling this paradigm \emph{FuzzEval}. We demonstrate the efficacy of our method along with the controllability of code hallucination and reasonable selection efficiency.

2510.18189 2026-06-16 cs.GR cs.CV 版本更新

A Generalizable Light Transport 3D Embedding for Global Illumination

一种可泛化的全局光照光传输3D嵌入

Bing Xu, Mukund Varma T, Cheng Wang, Tzu-Mao Li, Lifan Wu, Bartlomiej Wronski, Ravi Ramamoorthi, Marco Salvi

发表机构 * UC San Diego and NVIDIA USA(加州大学圣迭戈分校和美国NVIDIA公司) UC San Diego USA(加州大学圣迭戈分校(美国)) NVIDIA USA(美国NVIDIA公司) UC San Diego USA and NVIDIA USA(加州大学圣迭戈分校和美国NVIDIA公司)

AI总结 提出一种可泛化的3D光传输嵌入方法,通过点云和Transformer直接预测全局光照,无需光栅化或路径追踪线索,适用于多种室内场景。

Comments SIGGRAPH 2026

详情
AI中文摘要

全局光照(GI)对于真实感渲染至关重要,但由于模拟间接光传输的复杂性,计算成本仍然很高。最近的神经方法主要依赖于逐场景优化,有时扩展到处理相机或几何体的变化。跨场景泛化的努力大多停留在2D屏幕空间,例如神经去噪或基于G-buffer的GI预测,这些方法常常遭受视角不一致和空间理解有限的问题。我们提出了一种可泛化的3D光传输嵌入,直接从3D场景配置近似全局光照,而不使用光栅化或路径追踪线索。每个场景被表示为具有几何和材质特征的点云。一个可扩展的Transformer建模全局点对点交互,将这些特征编码为神经基元。在渲染时,每个查询点通过最近邻搜索检索附近的基元,并通过交叉注意力聚合它们的潜在特征,以预测所需的渲染量。我们展示了在具有不同布局、几何体和材质的多样化室内场景中,漫反射全局光照预测的结果。为辐照度估计训练的嵌入可以通过有限的微调快速适应新的渲染任务。我们还展示了用于光泽材质空间方向辐射场估计的初步结果,并展示了归一化场如何加速无偏路径引导。该方法突显了一条将学习先验集成到渲染管线中的路径,而无需显式的光线追踪光照线索。

英文摘要

Global illumination (GI) is essential for realistic rendering but remains computationally expensive due to the complexity of simulating indirect light transport. Recent neural methods have mainly relied on per-scene optimization, sometimes extended to handle changes in camera or geometry. Efforts toward cross-scene generalization have largely stayed in 2D screen space, such as neural denoising or G-buffer based GI prediction, which often suffer from view inconsistency and limited spatial understanding. We propose a generalizable 3D light transport embedding that approximates global illumination directly from 3D scene configurations, without using rasterized or path-traced cues. Each scene is represented as a point cloud with geometric and material features. A scalable transformer models global point-to-point interactions to encode these features into neural primitives. At render time, each query point retrieves nearby primitives via nearest-neighbor search and aggregates their latent features through cross-attention to predict the desired rendering quantity. We demonstrate results on diffuse global illumination prediction across diverse indoor scenes with varying layouts, geometry, and materials. The embedding trained for irradiance estimation can be quickly adapted to new rendering tasks with limited fine-tuning. We also present preliminary results for spatial-directional radiance field estimation for glossy materials and show how the normalized field can accelerate unbiased path guiding. This approach highlights a path toward integrating learned priors into rendering pipelines without explicit ray-traced illumination cues.

2510.07063 2026-06-16 cs.HC cs.RO 版本更新

Artists' Views on Robotics Involvement in Painting Productions

艺术家对机器人参与绘画创作的看法

Francesca Cocchella, Nilay Roy Choudhury, Eric Chen, Patrícia Alves-Oliveira

发表机构 * CONTACT Unit, Italian Institute of Technology(意大利理工学院联络单位) University of Michigan(密歇根大学)

AI总结 通过八位抽象艺术家与机器人合作绘画的实证研究,发现人机协作更富趣味性和反思性,提供更大自主性,并激发克服系统限制的新策略。

Comments 10 pages, 9 figures, submitted to RAM special issue: Arts and Robotics

详情
AI中文摘要

随着机器人技术的发展,其在艺术创作中的潜力成为一个日益相关的研究课题。本研究探讨了专业抽象艺术家如何感知和体验与自主绘画机械臂的协同创作互动。八位艺术家参与了六次绘画会话——三次与人类伙伴,随后三次与机器人——并随后参加了通过反思性主题分析分析的半结构化访谈。人与人之间的互动被描述为直观、对话性和情感投入,而人与机器人的会话则感觉更有趣和反思性,提供了更大的自主性,并促使采用新颖策略来克服系统的局限性。这项工作提供了对艺术家与机器人真实体验的首批实证研究之一,强调了长期参与和多学科方法在人机协同创作中的价值。

英文摘要

As robotic technologies evolve, their potential in artistic creation becomes an increasingly relevant topic of inquiry. This study explores how professional abstract artists perceive and experience co-creative interactions with an autonomous painting robotic arm. Eight artists engaged in six painting sessions -- three with a human partner, followed by three with the robot -- and subsequently participated in semi-structured interviews analyzed through reflexive thematic analysis. Human-human interactions were described as intuitive, dialogic, and emotionally engaging, whereas human-robot sessions felt more playful and reflective, offering greater autonomy and prompting for novel strategies to overcome the system's limitations. This work offers one of the first empirical investigations into artists' lived experiences with a robot, highlighting the value of long-term engagement and a multidisciplinary approach to human-robot co-creation.

2510.06647 2026-06-16 stat.ML cs.LG 版本更新

Q-Learning with Fine-Grained Gap-Dependent Regret

具有细粒度间隙依赖遗憾的Q学习

Haochen Zhang, Zhong Zheng, Lingzhou Xue

发表机构 * Department of Statistics, The Pennsylvania State University(统计学系,宾夕法尼亚州立大学)

AI总结 针对表格型马尔可夫决策过程,提出细粒度间隙依赖遗憾界,分别改进UCB和非UCB算法,并修正了AMB算法的设计缺陷。

详情
AI中文摘要

我们研究了在情节式表格马尔可夫决策过程中无模型强化学习的细粒度间隙依赖遗憾界。现有的无模型算法实现了极小化极大最坏情况遗憾,但其间隙依赖界仍然粗糙,未能完全捕捉次优间隙的结构。我们通过为基于UCB和非UCB的算法建立细粒度间隙依赖遗憾界来解决这一限制。在基于UCB的设置中,我们开发了一个新颖的分析框架,明确分离了最优和次优状态-动作对的分析,从而为UCB-Hoeffding (Jin et al., 2018) 提供了第一个细粒度遗憾上界。为了突出该框架的通用性,我们引入了ULCB-Hoeffding,这是一种受AMB (Xu et al., 2021) 启发但结构简化的新UCB算法,它享有细粒度遗憾保证并在经验上优于AMB。在非UCB设置中,我们重新审视了唯一已知的算法AMB,并识别出其算法设计和分析中的两个关键问题:Q更新中的不当截断以及其集中论证中鞅差条件的违反。我们提出了AMB的改进版本,解决了这些问题,为非UCB方法建立了第一个严格的细粒度间隙依赖遗憾,实验表明其性能优于AMB。

英文摘要

We study fine-grained gap-dependent regret bounds for model-free reinforcement learning in episodic tabular Markov Decision Processes. Existing model-free algorithms achieve minimax worst-case regret, but their gap-dependent bounds remain coarse and fail to fully capture the structure of suboptimality gaps. We address this limitation by establishing fine-grained gap-dependent regret bounds for both UCB-based and non-UCB-based algorithms. In the UCB-based setting, we develop a novel analytical framework that explicitly separates the analysis of optimal and suboptimal state-action pairs, yielding the first fine-grained regret upper bound for UCB-Hoeffding (Jin et al., 2018). To highlight the generality of this framework, we introduce ULCB-Hoeffding, a new UCB-based algorithm inspired by AMB (Xu et al.,2021) but with a simplified structure, which enjoys fine-grained regret guarantees and empirically outperforms AMB. In the non-UCB-based setting, we revisit the only known algorithm AMB, and identify two key issues in its algorithm design and analysis: improper truncation in the $Q$-updates and violation of the martingale difference condition in its concentration argument. We propose a refined version of AMB that addresses these issues, establishing the first rigorous fine-grained gap-dependent regret for a non-UCB-based method, with experiments demonstrating improved performance over AMB.

2510.04127 2026-06-16 cs.IR cs.AI cs.CV cs.LG 版本更新

Projection and Quantisation: A Unifying View of Learning to Hash, from Random Projections to the RAG Era

投影与量化:学习哈希的统一视角,从随机投影到RAG时代

Sean Moran

发表机构 * Independent Researcher(独立研究者) London United Kingdom(伦敦英国)

AI总结 提出投影-量化-组织(PQO)框架,统一理解从局部敏感哈希到深度哈希、乘积量化、图索引及向量数据库二进制嵌入的方法,并通过可复现实验揭示量化轴上的内存-质量权衡。

Comments 80 pages, 19 figures, 22 tables. Survey. Accompanying open benchmark (BitBudget): https://github.com/sjmoran/bitbudget ; live leaderboard: https://sjmoran.github.io/bitbudget/

详情
AI中文摘要

近似最近邻(ANN)搜索支撑着大规模检索,尤其是在增强大型语言模型的检索增强生成管道中,但解决该问题的方法已在不同社区中激增,以至于很少被视为一个统一领域。我们认为它们构成一个具有三个设计选择的领域,并开发了投影-量化-组织(PQO)视角,在该视角下,局部敏感哈希、学习二进制哈希、深度端到端哈希、乘积量化、基于图的索引以及现代向量数据库的二进制嵌入都是三个耦合问题的设置:投影放置在哪里,量化阈值放置在哪里,以及如何组织生成的编码。投影然后量化的解读是已有的;我们的贡献是第三个同等重要的组织阶段,证明这三个阶段从该领域的起源到深度、乘积量化、图和检索增强时代一脉相承,以及一个可复现的测量,将视角从分类方法转向预测方法。该测量得出三个发现。首先,内存节省在量化轴上:一位编码的大小是浮点数的三十二分之一,而在短候选列表上单次全精度重排序即可完全恢复未压缩的质量。其次,视角预期的权衡顺序在嵌入增长时保持不变。第三,在有监督的情况下,八字节编码的质量比其替换的两千字节浮点数提高一倍以上。我们将这些测量结果发布为BitBudget,一个带有实时排行榜的可扩展基准,将生成式检索的“语义标识符”重新解释为量化编码,并指出随着紧凑编码重回大规模检索中心,随之而来的开放问题。

英文摘要

Approximate nearest-neighbour search underpins large-scale retrieval and retrieval-augmented generation, yet its methods are studied in communities that seldom read one another. We argue that they form one field with three design choices. We develop the projection-quantisation-organisation lens: every method places its projections, places its quantisation thresholds, and organises the resulting codes for search. We test the lens with a reproducible measurement, released as the open BitBudget benchmark, and report three findings. First, the quantisation axis delivers the largest memory savings: a one-bit code with full-precision re-ranking matches uncompressed quality for six of seven embedders, the scanned code one thirty-second of the float's size. Second, the orderings the lens anticipates, including a learned-embedding regime where binary codes overtake an inverted-file product quantiser at a matched byte budget, recur as the embedding is enlarged. Third, given class labels, an eight-byte supervised code more than doubles the retrieval quality of the two-kilobyte task-agnostic float it replaces. We also recast the semantic identifiers of generative retrieval as quantisation codes. The main contribution is a single, tested account of compact-code search, from random projections to the retrieval-augmented era.

2509.07561 2026-06-16 cs.MA cs.RO 版本更新

Bio-inspired decision making in robot swarms under biases

偏差下机器人群体中的生物启发式决策

Raina Zakir, Timoteo Carletti, Marco Dorigo, Andreagiovanni Reina

发表机构 * IRIDIA, Université Libre de Bruxelles(布鲁塞尔自由大学IRIDIA实验室) Department of Mathematics and Namur Institute for Complex Systems, naXys, University of Namur(纳慕尔大学数学系和复杂系统纳慕尔研究所) Centre for the Advanced Study of Collective Behaviour, Universität Konstanz(康斯坦茨大学集体行为高级研究所以) Department of Computer and Information Science, Universität Konstanz(康斯坦茨大学计算机与信息科学系) Department of Collective Behaviour, Max Planck Institute of Animal Behaviour(动物行为Max Planck研究所集体行为系)

AI总结 研究在存在非社会偏差时,直接切换与交叉抑制两种意见动力学机制对机器人群体决策性能的影响,发现交叉抑制在偏差条件下更优。

详情
AI中文摘要

最小化机器人群体提供了一种可扩展、鲁棒且成本效益高的方法来执行复杂任务,有望改变医疗、灾难响应和环境监测等领域的应用。然而,协调这种去中心化系统仍然是一个基本挑战,特别是当机器人在通信、计算和内存方面受限时。在我们的研究中,单个机器人在感知环境时经常出错,但群体能够快速可靠地就$n$个离散选项中的最佳选项达成共识。我们比较了两种典型的意见动力学机制——直接切换和交叉抑制——它们是简单而有效的集体信息处理规则,在从神经群体到昆虫群体的生物系统中广泛存在。我们通过考虑影响意见动力学的非社会偏差,推广了现有的平均场模型。使用直接切换的群体在没有非社会动力学时能可靠地选择最佳选项,但一旦引入此类偏差,其性能下降,常常导致决策死锁。相比之下,受生物启发的交叉抑制在广泛的偏差条件下实现了更快、更一致、更准确、更鲁棒和更可扩展的决策。我们的研究结果为最小化群体的协调提供了理论和实践见解,并扩展到了生物学和工程学中广泛类别的去中心化决策系统。

英文摘要

Minimalistic robot swarms offer a scalable, robust, and cost-effective approach to performing complex tasks with the potential to transform applications in healthcare, disaster response, and environmental monitoring. However, coordinating such decentralised systems remains a fundamental challenge, particularly when robots are constrained in communication, computation, and memory. In our study, individual robots frequently make errors when sensing the environment, yet the swarm can rapidly and reliably reach consensus on the best among $n$ discrete options. We compare two canonical mechanisms of opinion dynamics -- direct-switch and cross-inhibition -- which are simple yet effective rules for collective information processing observed in biological systems across scales, from neural populations to insect colonies. We generalise the existing mean-field models by considering asocial biases influencing the opinion dynamics. While swarms using direct-switch reliably select the best option in absence of asocial dynamics, their performance deteriorates once such biases are introduced, often resulting in decision deadlocks. In contrast, bio-inspired cross-inhibition enables faster, more cohesive, accurate, robust, and scalable decisions across a wide range of biased conditions. Our findings provide theoretical and practical insights into the coordination of minimal swarms and offer insights that extend to a broad class of decentralised decision-making systems in biology and engineering.

2509.06108 2026-06-16 cs.CG cs.LG 版本更新

Using Reinforcement Learning to Optimize the Global and Local Crossing Number

使用强化学习优化全局和局部交叉数

Timo Brand, Henry Förster, Stephen Kobourov, Daniel Kohrt, Robin Schukrafft, Markus Wallinger, Johannes Zink

发表机构 * Technical University of Munich, Heilbronn, Germany(慕尼黑技术大学(海因斯贝格)) John Cabot University, Rome, Italy(约翰·卡博特大学) Technical University of Munich, Garching, Germany(慕尼黑技术大学(戈林根))

AI总结 将图绘制视为单玩家优化游戏,利用强化学习通过移动顶点减少边交叉,提出一种优化全局或局部交叉数的策略,在局部交叉数最小化上具有竞争力。

详情
AI中文摘要

图绘制关注图的算法可视化。一个好的图绘制易于阅读并有助于解决图上的任务。已确定好的图绘制中出现的几个属性。这些属性包括低交叉数、边之间的大角度、短边以及描绘对称性。其中许多属性是可明确度量的指标。这使我们认识到图绘制可以看作一个游戏。在本文中,我们研究一个单玩家优化游戏,其中玩家迭代移动直线图绘制的顶点以减少边交叉。该游戏自然产生于图绘制挑战赛的自动赛道,其中解决方案通过重复执行局部顶点移动获得。我们将此过程形式化为一个具有完全信息的游戏,并研究强化学习是否能发现有效的策略来玩这个游戏。我们的强化学习代理观察顶点的局部几何和结构上下文,并选择一个移动方向,目标是减少全局或局部交叉数,即总交叉数或每条边的最大交叉数。我们将所得策略与现有方法和标准基准图上的既定交叉最小化启发式算法进行比较。虽然我们的方法在最小化全局交叉数方面未超越最先进的方法,但在最小化局部交叉数方面具有竞争力且通常更优。

英文摘要

Graph drawing concerns the algorithmic visualization of graphs. A good drawing of a graph is easy to read and facilitates solving tasks on the graph. Several properties have been identified to occur in good drawings of graphs. Such properties include a low number of crossings, large angles between edges, short edges, and depicting symmetries. Many of these properties are explicitly measurable metrics. This brings us to the insight that graph drawing can be seen as a game. In this paper, we study a single-player optimization game in which the player iteratively moves vertices of a straight-line graph drawing to reduce edge crossings. This game arose naturally from the automatic track of the Graph Drawing Challenge, where solutions are obtained by repeatedly performing local vertex movements. We formalize this process as a game with full information and investigate whether reinforcement learning can discover effective strategies for playing it. Our reinforcement-learning agent observes the local geometric and structural context of a vertex and selects a movement direction with the goal of reducing either the global or the local crossing number, that is, the total number of crossings or the maximum number of crossings per edge. We compare the resulting strategies to existing methods and established crossing-minimization heuristics on standard benchmark graphs. While our approach does not out-compete state-of-the-art methods for minimizing the global crossing number, it is competitive and often superior for minimizing the local crossing number.

2505.13986 2026-06-16 math.OC cs.AI cs.LG 版本更新

RIDGECUT: Learning Graph Partitioning with Rings and Wedges

RIDGECUT:基于环与楔形结构的图分割学习

Qize Jiang, Angelo Zangari, Linsey Pang, Alice Gatti, Mahima Aggarwal, Giovanna Vantini, Xiaosong Ma, Weiwei Sun, Sourav Medya, Sanjay Chawla

发表机构 * College of Computer Science and Artificial Intelligence, Shanghai Key Laboratory of Data Science(计算机科学与人工智能学院,上海数据科学重点实验室) University of Illinois Chicago(伊利诺伊大学芝加哥分校) PayPal Inc.(PayPal公司) Center for AI Safety(人工智能安全中心) Qatar Computing Research Institute(卡塔尔计算研究所) Hamad Bin Khalifa University(哈马德·本·卡西姆大学) Computing and Mathematical Sciences (CMS) Division(计算与数学科学(CMS)部门) Mohamed bin Zayed University of Artificial Intelligence (MBZUAI)(Mohamed bin Zayed人工智能大学(MBZUAI)) Fudan University(复旦大学)

AI总结 提出RidgeCut框架,通过将动作空间约束为环与楔形结构,利用强化学习解决归一化割问题,在交通网络上实现结构感知分割,降低归一化割值并展现强泛化能力。

Comments Extended version of the paper accepted at KDD 2026

详情
AI中文摘要

强化学习通过学习跨实例泛化的启发式方法,在图的组合优化问题上展现出潜力。然而,如何有效地将领域知识融入强化学习框架进行图分割仍然具有挑战性,因为现有方法通常依赖于无约束的节点级动作,导致动作空间大且探索效率低。在本文中,我们提出RidgeCut,一种强化学习框架,通过约束动作空间来在归一化割问题中实现结构感知分割。以交通网络为动机示例,我们引入了一个利用城市道路拓扑领域知识的新概念——其中自然分割通常呈现为同心环和径向楔形。通过将图转换为线性或圆形表示,我们的方法能够使用基于变换器的策略并通过近端策略优化进行高效学习。RidgeCut产生的分割不仅与预期的空间布局一致,而且与现有方法相比实现了更低的归一化割值。在合成和真实交通图上的实验结果表明,RidgeCut在跨图大小的归纳泛化方面始终优于现有方法。尽管以道路网络为动机,RidgeCut为将结构先验嵌入到图分割的强化学习框架中提供了一种通用机制。

英文摘要

Reinforcement learning (RL) has shown promise for combinatorial optimization problems on graphs by learning heuristics that generalize across instances. However, effectively incorporating domain knowledge into RL frameworks for graph partitioning remains challenging, as existing approaches typically rely on unconstrained node-level actions that lead to large action spaces and inefficient exploration. In this paper, we propose RidgeCut, an RL framework that constrains the action space to enforce structure-aware partitioning in the Normalized Cut problem. Using transportation networks as a motivating example, we introduce a novel concept that leverages domain knowledge about urban road topology -- where natural partitions often take the form of concentric rings and radial wedges. By transforming the graph into linear or circular representations, our method enables the use of transformer-based policies and efficient learning via Proximal Policy Optimization. The resulting partitions from RidgeCut are not only aligned with expected spatial layouts but also achieve lower normalized cuts compared to existing methods. Experimental results on synthetic and real-world traffic graphs demonstrate that RidgeCut consistently outperforms existing methods while exhibiting strong inductive generalization across graph sizes. Although motivated by road networks, RidgeCut provides a general mechanism for embedding structural priors into RL frameworks for graph partitioning.

2508.03867 2026-06-16 math.AG cs.LG stat.ML 版本更新

Constraining the outputs of ReLU neural networks

约束ReLU神经网络的输出

Yulia Alexandr, Guido Montúfar

发表机构 * University of California, Los Angeles(加州大学洛杉矶分校) Max Planck Institute for Mathematics in the Sciences(马克斯·普朗克数学研究所)

AI总结 通过引入与ReLU网络相关的代数簇,利用激活区域内的秩约束推导多项式方程,刻画网络可表示的函数,并研究簇达到预期维度的条件。

Comments 33 pages, 4 figures

详情
AI中文摘要

我们引入了一类与ReLU神经网络自然相关的代数簇,这些代数簇源于网络输出在输入空间激活区域上的分段线性结构,以及在参数空间上的分段多线性结构。通过分析每个激活区域内网络输出的秩约束,我们推导出刻画网络可表示函数的多项式方程。我们进一步研究了这些簇达到预期维度的条件,从而深入理解ReLU网络的表达能力和结构特性。

英文摘要

We introduce a class of algebraic varieties naturally associated with ReLU neural networks, arising from the piecewise linear structure of their outputs across activation regions in input space, and the piecewise multilinear structure in parameter space. By analyzing the rank constraints on the network outputs within each activation region, we derive polynomial equations that characterize the functions representable by the network. We further investigate conditions under which these varieties attain their expected dimension, providing insight into the expressive and structural properties of ReLU networks.

2507.17804 2026-06-16 astro-ph.HE astro-ph.CO astro-ph.IM cs.LG hep-ph 版本更新

On the Energy Distribution of the Galactic Center Excess' Sources

银河系中心过量辐射源的能谱分布

Florian List, Yujin Park, Nicholas L. Rodd, Eve Schoen, Florian Wolf

发表机构 * Department of Astrophysics, University of Vienna(维也纳大学天体物理系) Theory Group, Lawrence Berkeley National Laboratory(伯克利劳伦斯国家实验室理论组) Berkeley Center for Theoretical Physics, University of California(加州大学伯克利分校理论物理中心) University of California, Berkeley(加州大学伯克利分校) Lawrence Berkeley National Laboratory(伯克利劳伦斯国家实验室)

AI总结 利用基于神经网络模拟的推理方法联合分析空间和能谱数据,发现银河系中心过量辐射若由点源贡献,所需源数量比之前估计高两个数量级,支持其可能为暗物质湮灭产生的弥散辐射。

Comments 7+22 pages, 2+22 figures; v2: journal version

详情
AI中文摘要

银河系中心过量辐射(GCE)可能预示着湮灭暗物质的发现。但与此结论相悖的分析表明,在发射的空间结构内存在暗弱点源的证据。由于技术限制,这些分析纯粹基于空间信息,丢弃了所有可能将过量辐射与天体物理背景区分开来的能谱信息。在这里,我们证明基于神经网络模拟的推理方法可以联合分析空间和能谱数据。这一改进意义深远:能量信息使假定的点源显著变暗,表明GCE本质上是弥散的,或者由异常大量的源组成。定量而言,对于我们的最佳拟合背景模型,过量辐射基本上与暗物质预测的泊松发射一致。如果由点源引起,我们的中值预测为$\mathcal{O}(10^5)$个源,或在90%置信度下超过35,000个,两者都比早期GCE点源分析所偏好的数百个源高出几个数量级,尽管背景系统学允许的变化可能将所需源数量减少大约一个数量级。

英文摘要

The Galactic Center Excess (GCE) may yet herald the discovery of annihilating dark matter. Weighing against that conclusion are analyses showing evidence for dim point sources within the spatial structure of the emission. Due to technical limitations these analyses are purely spatial with all spectral information that could disentangle the excess from astrophysical backgrounds discarded. Here, we demonstrate that a neural network simulation-based inference approach can jointly analyze the spatial and spectra data. The addition is profound: energy information drives the putative point sources to be significantly dimmer, indicating either the GCE is truly diffuse in nature or made of an exceptionally large number of sources. Quantitatively, for our best fit background model, the excess is essentially consistent with Poisson emission as predicted by dark matter. If due to point sources, our median prediction is $\mathcal{O}(10^5)$ sources, or more than 35,000 at 90\% confidence, both orders of magnitude larger than the hundreds preferred by earlier point-source analyses of the GCE, although variations allowed by background systematics could reduce the required number of sources by roughly an order of magnitude.

2504.11775 2026-06-16 stat.ML cs.CY cs.LG q-fin.RM 版本更新

Discrimination-free Insurance Pricing with Privatized Sensitive Attributes

基于隐私化敏感属性的无歧视保险定价

Tianhe Zhang, Suhan Liu, Peng Shi

发表机构 * Department of Risk and Insurance, University of Wisconsin-Madison(风险与保险系,威斯康星大学麦迪逊分校) Department of Statistics and Operations Research, University of North Carolina-Chapel Hill(统计与运筹系,北卡罗来纳大学教堂山分校)

AI总结 针对保险公司无法直接获取敏感属性(如性别、种族)的公平定价问题,提出利用隐私化(加噪)敏感属性估计无歧视保费的方法,并建立理论保证与实证验证。

详情
AI中文摘要

公平性已成为保险定价中的重要关注点,因为保险公司越来越依赖机器学习模型来预测预期损失。同时,监管和隐私约束通常限制保险公司访问或使用敏感属性(如性别或种族)。最近的精算研究通过无歧视保费的概念来解决这一背景下的公平性问题,该概念消除了敏感属性的直接和间接影响,同时保持精算一致性。然而,实施这种方法通常需要访问敏感属性本身,而在实践中可能无法获得。本文研究了当敏感属性仅以隐私化或噪声扰动形式被观测时,无歧视保险保费的估计问题。我们考虑一个多方数据设置,其中保险公司观测非敏感属性和结果,而一个可信第三方持有通过隐私机制生成的隐私化敏感属性。在此框架内,我们开发了仅使用隐私化属性估计无歧视保费的统计方法。我们研究了两种实际相关的情况:隐私机制已知和其噪声水平未知。对于这两种情况,我们为所提出的估计量建立了理论保证。数值实验和实证应用表明,所提出的方法能够在尊重隐私和监管约束的同时实现公平的保险定价。

英文摘要

Fairness has become an important concern in insurance pricing as insurers increasingly rely on machine learning models to predict expected losses. At the same time, regulatory and privacy constraints often restrict insurers' ability to access or use sensitive attributes such as gender or race. Recent actuarial research addresses fairness in this context through the concept of the discrimination-free premium, which removes both the direct and indirect effects of sensitive attributes while preserving actuarial consistency. However, implementing this approach typically requires access to the sensitive attributes themselves, which may not be available in practice. This paper studies the estimation of discrimination-free insurance premiums when sensitive attributes are observed only in privatized or noise-perturbed form. We consider a multi-party data setting in which insurers observe non-sensitive attributes and outcomes, while a trusted third party holds privatized sensitive attributes generated through a privacy mechanism. Within this framework, we develop statistical methods for estimating discrimination-free premiums using only the privatized attributes. We study two settings of practical relevance: when the privacy mechanism is known and when its noise level is unknown. For both cases, we establish theoretical guarantees for the proposed estimators. Numerical experiments and empirical applications demonstrate that the proposed approach enables fair insurance pricing while respecting privacy and regulatory constraints.

2506.20686 2026-06-16 q-bio.BM cs.DC cs.LG cs.PF 版本更新

MegaFold: Efficient Training of Next-Generation 3D Attention Protein Models on Cross-Platform GPUs

MegaFold: 跨平台GPU上高效训练下一代3D注意力蛋白质模型

Hoa La, Ahan Gupta, Alex Morehead, Jianlin Cheng, Minjia Zhang

发表机构 * UIUC SSAIL Lab(UIUC SSAIL实验室)

AI总结 针对AlphaFold3类模型因3D注意力机制导致训练效率低的问题,提出MegaFold系统,通过高效内核、分片策略、算子融合和流水线优化,在NVIDIA和AMD GPU上实现更长序列训练和加速。

Comments 13 pages, 12 figures

详情
AI中文摘要

最近,生物分子建模的进展受到AlphaFold3(AF3)等模型的推动,这些模型将科学信息引入Transformer架构。与Transformer不同,AF3风格模型的一个定义性特征是它们在二维成对表示上的3D注意力,这产生的张量的计算和内存成本随序列长度呈立方增长。因此,尽管参数数量适中,AF3风格模型的训练成本远高于同等大小的Transformer,并且受到GPU内存容量的严重限制。我们的特征分析表明,3D注意力从根本上改变了训练工作负载,导致巨大的3D注意力图、复杂的算子间依赖、内核碎片化和繁重的主机端数据管道,这些与LLM训练截然不同,导致现代GPU系统的利用率低下。此外,由于3D注意力引入的复杂跨层算子间依赖,现有的GPU优化未能充分应对这些挑战。受这些挑战的启发,我们引入了MegaFold,一个新颖的跨平台系统,用于高效训练下一代3D注意力蛋白质模型。MegaFold结合了内存高效的3D注意力内核、用于二次表示的通信高效分片策略、关键执行路径的融合算子实现,以及一个确定性感知的主机-设备流水线,消除了预处理停顿。在NVIDIA H200和AMD MI250 GPU上的评估表明,MegaFold能够在32个GPU上训练长达3.36倍的序列长度,同时将端到端执行时间减少高达1.73倍(NVIDIA)和1.62倍(AMD)。

英文摘要

Recent advances in biomolecular modeling have been catalyzed by models such as AlphaFold3 (AF3), which introduce science-informed changes to the transformer architecture. Unlike transformers, a defining characteristic of AF3-style models is their 3D attention over 2D pairwise representations which produces tensors whose computation and memory costs scale cubically with sequence length. As a result, despite moderate parameter counts, AF3-style models are far more expensive to train than size-equivalent transformers, and are severely constrained by GPU memory capacity. Our characterization shows 3D attention fundamentally changes the training workload, causing massive 3D attention maps, complex inter-operator dependencies, kernel fragmentation, and heavy host-side data pipelines which differ substantially from LLM training, leading to poor utilization on modern GPU systems. Moreover, existing GPU optimizations do not adequately address these challenges due to complex cross-layer inter-operator dependencies introduced by 3D attention. Motivated by these challenges, we introduce MegaFold, a novel cross-platform system for efficient training of next-generation 3D-attention protein models. MegaFold combines a memory-efficient 3D-attention kernel, a communication-efficient sharding strategy for quadratic representations, fused operator implementations for critical execution paths, and a determinism-aware host-device pipeline that eliminates preprocessing stalls. Evaluation on both NVIDIA H200 and AMD MI250 GPUs shows that MegaFold enables training with up to 3.36$\times$ longer sequence lengths on 32 GPUs while reducing end-to-end execution time by up to 1.73$\times$ (NVIDIA) and 1.62$\times$ (AMD).

2411.19567 2026-06-16 cs.SE cs.RO 版本更新

DynNPC: Finding More Violations Induced by ADS in Simulation Testing through Dynamic NPC Behavior Generation

DynNPC:通过动态NPC行为生成在仿真测试中发现更多由ADS引发的违规

You Lu, Yifan Tian, Dingji Wang, Bihuan Chen, Xin Peng

发表机构 * College of Computer Science and Artificial Intelligence, Fudan University(计算机科学与人工智能学院,复旦大学)

AI总结 提出DynNPC框架,让NPC车辆在仿真执行中根据交通信号和自车行为动态生成驾驶策略,以生成更多由自动驾驶系统(ADS)引发的违规场景,提升测试效率。

Comments Accepted by TOSEM 2026

详情
AI中文摘要

最近,许多仿真测试方法被提出,用于生成多样化的驾驶场景以测试自动驾驶系统(ADS)。然而,先前方法生成的场景中NPC车辆的行为是预定义并在仿真执行前变异的,忽略了交通信号和自车(Ego)车辆的行为。因此,它们发现的大量违规是由NPC车辆的不现实行为引发的,并未揭示ADS的缺陷。此外,迭代变异过程中NPC行为的巨大场景搜索空间限制了先前方法的效率。为解决这些限制,我们提出了一种新颖的基于场景的测试框架DynNPC,以生成更多由ADS引发的违规场景。具体来说,DynNPC允许NPC车辆在仿真执行期间根据交通信号和自车车辆的实时行为,使用不同的驾驶策略动态生成行为。我们将DynNPC与最先进的基于场景的测试方法进行比较。评估结果表明,DynNPC在发现更多由ADS引发的违规场景方面具有有效性和高效性。

英文摘要

Recently, a number of simulation testing approaches have been proposed to generate diverse driving scenarios for autonomous driving systems (ADSs) testing. However, the behaviors of NPC vehicles in these scenarios generated by previous approaches are predefined and mutated before simulation execution, ignoring traffic signals and the behaviors of the Ego vehicle. Thus, a large number of the violations they found are induced by unrealistic behaviors of NPC vehicles, revealing no bugs of ADSs. Besides, the vast scenario search space of NPC behaviors during the iterative mutations limits the efficiency of previous approaches. To address these limitations, we propose a novel scenario-based testing framework, DynNPC, to generate more violation scenarios induced by the ADS. Specifically, DynNPC allows NPC vehicles to dynamically generate behaviors using different driving strategies during simulation execution based on traffic signals and the real-time behavior of the Ego vehicle. We compare DynNPC with state-of-the-art scenario-based testing approaches. Our evaluation has demonstrated the effectiveness and efficiency of DynNPC in finding more violation scenarios induced by the ADS.

2401.14283 2026-06-16 stat.ML cs.LG 版本更新

Information Leakage Detection through Approximate Bayes-optimal Prediction

通过近似贝叶斯最优预测的信息泄露检测

Pritha Gupta, Marcel Wever, Eyke Hüllermeier

发表机构 * University of Potsdam(波恩大学) University of Hanover(汉诺威大学) Ludwig-Maximilians-University Munich(慕尼黑大学)

AI总结 提出基于统计学习与信息论的理论框架,通过自动机器学习近似贝叶斯预测器的对数损失和准确率来估计互信息,从而检测信息泄露,在合成和真实OpenSSL TLS服务器数据集上优于现有方法。

Comments Accepted at Information Sciences

详情
AI中文摘要

在当今数据驱动的世界中,公开可用信息的激增因信息泄露(IL)问题而引发安全担忧。IL涉及通过可观察的系统信息无意中将敏感信息暴露给未经授权的方。传统的统计方法依赖于估计可观察信息与秘密信息之间的互信息(MI)来检测IL,面临维度灾难、收敛性、计算复杂性和MI误估计的挑战。尽管有效,新兴的基于监督机器学习的方法检测IL仅限于二元系统敏感信息,并且缺乏全面的框架。为了解决这些局限性,我们利用统计学习理论和信息论建立了一个理论框架,以准确量化和检测IL。使用自动机器学习,我们证明通过近似通常未知的贝叶斯预测器的对数损失和准确率,可以准确估计MI。基于此,我们展示了如何有效估计MI以检测IL。在考虑合成和真实OpenSSL TLS服务器数据集的实证研究中,我们的方法优于最先进的基线方法。

英文摘要

In today's data-driven world, the proliferation of publicly available information raises security concerns due to the information leakage (IL) problem. IL involves unintentionally exposing sensitive information to unauthorized parties via observable system information. Conventional statistical approaches rely on estimating mutual information (MI) between observable and secret information for detecting ILs, face challenges of the curse of dimensionality, convergence, computational complexity, and MI misestimation. Though effective, emerging supervised machine learning based approaches to detect ILs are limited to binary system sensitive information and lack a comprehensive framework. To address these limitations, we establish a theoretical framework using statistical learning theory and information theory to quantify and detect IL accurately. Using automated machine learning, we demonstrate that MI can be accurately estimated by approximating the typically unknown Bayes predictor's log-loss and accuracy. Based on this, we show how MI can effectively be estimated to detect ILs. Our method performs superior to state-of-the-art baselines in an empirical study considering synthetic and real-world OpenSSL TLS server datasets.

2505.08774 2026-06-16 q-bio.BM cs.LG 版本更新

Generative Molecular Design with Steerable and Granular Synthesizability Control

具有可引导和粒度合成性控制的生成式分子设计

Jeff Guo, Víctor Sabanza-Gil, Olha Semenenko, Oleksii Hrabovskyi, Mykola Protopopov, Anna Kapeliukha, Oleksandr Mosia, Sofiia Hatych, Diana Alieksieieva, Tom Nelis, Patrick Molliet, Helena Solé-Àvila, Valentas Olikauskas, Nina Aregger, Irina Morozova, Joseph Schmidt, Zlatko Jončev, Olga Tarkhanova, Petro Borysko, Jerome Waser, Bruno Correia, Jeremy Luterbacher, Philippe Schwaller

发表机构 * Laboratory of Artificial Chemical Intelligence (LIAC)(人工化学智能实验室) NCCR Catalysis(催化联合研究所) Laboratory of Sustainable and Catalytic Processing (LPDC)(可持续与催化加工实验室) CHEMSPACE LLC Enamine Ltd.(Enamine有限公司) Taras Shevchenko National University of Kyiv(基塔斯·谢甫琴科基辅国立大学) V. P. Kukhar Institute of Bioorganic Chemistry and Petrochemistry(V. P. Kukhar生物有机化学与石油化学研究所) Palladin Institute of Biochemistry(Palladin生物化学研究所) Laboratory of Catalysis and Organic Synthesis (LCSO)(催化与有机合成实验室) Laboratory of Protein Design and Immunoengineering (LPDI)(蛋白质设计与免疫工程实验室)

AI总结 提出统一合成约束分子设计与超大规模虚拟筛选的生成框架,通过可引导和粒度合成性控制,生成满足多参数优化目标且具有预测合成路径的分子,在BRD4和Wee1靶点上验证了有效性。

详情
AI中文摘要

设计既具有最佳性质又易于合成的分子是药物发现中的核心挑战。现有考虑合成性的工作可以联合输出生成分子的预测合成路线。然而,在解决合成难易程度以及灵活纳入所需反应约束方面,关注甚少。另一方面,虚拟筛选搜索可商购化合物,但在扩展到超大规模(十亿级及以上)化学空间时带来挑战。在这里,我们提出一个生成式设计框架,通过可引导和粒度合成性控制,统一了合成约束分子设计与超大规模虚拟筛选。生成的分子满足任意多参数优化目标,其预测合成路线满足混合匹配约束:包括或排除特定反应、纳入特定构建模块以及最小化合成路线长度。在针对BRD4的端到端内部活动中,我们设计了可用特定选定反应和构建模块合成的分子,合成了所有六个选定化合物,并鉴定了两个微摩尔级结合剂。我们进一步证明,反应控制能够有效导航超大规模按需化学空间,以识别性质最优的候选分子。通过将我们的框架应用于Chemspace的Freedom 4.0按需空间(1420亿分子),我们在单个消费级GPU(仅8 GB GPU内存)上生成了约32万分子(库的0.00023%),并在60个合成候选物中鉴定出一个微摩尔级Wee1结合剂。因此,单一统一框架能够生成新颖的可合成分子并检索目录就绪候选物,为缓解合成性瓶颈提供了灵活解决方案。

英文摘要

Designing molecules that are both property-optimal and readily synthesizable is a central challenge in drug discovery. Existing works that do consider synthesizability can jointly output predicted synthesis routes for generated molecules. However, there has been minimal attention in addressing the ease of synthesis and with flexibility to incorporate desired reaction constraints. On the other hand, virtual screening searches for commercially available compounds, but imposes challenges when scaling to ultra-large (billion-size and beyond) chemical spaces. Here, we propose a generative design framework that unifies synthesis-constrained molecular design and ultra-large-scale virtual screening through steerable and granular synthesizability control. Generated molecules satisfy arbitrary multi-parameter optimization objectives with predicted synthesis routes satisfying mix-and-match constraints: including or avoiding certain reactions, incorporating specific building blocks, and minimizing synthesis route length. In an end-to-end in-house campaign targeting BRD4, we designed molecules synthesizable with specific selected reactions and building blocks, synthesized all six selected compounds, and identified two micromolar binders. We further demonstrate that reaction control enables efficient navigation of ultra-large make-on-demand chemical spaces to identify property-optimal candidates. By applying our framework to Chemspace's Freedom 4.0 make-on-demand space (142 billion molecules), we generated ~320k molecules (0.00023% of the library) on a single consumer-grade GPU (with only 8 GB GPU memory) and identified a micromolar Wee1 binder amongst 60 synthesized candidates. The single unified framework thus enables generating novel synthesizable molecules and retrieving catalogue-ready candidates, offering a flexible solution to mitigating the synthesizability bottleneck.

2505.05647 2026-06-16 eess.SP cs.CV 版本更新

A New k-Space Model for Non-Cartesian Fourier Imaging

一种用于非笛卡尔傅里叶成像的新k空间模型

Chin-Cheng Chan, Justin P. Haldar

发表机构 * USC Center for Advanced Research Computing(USC高级研究计算中心) Signal and Image Processing Institute(信号与图像处理研究所)

AI总结 针对传统基于体素的傅里叶成像模型计算成本高、收敛慢且易产生伪影的问题,提出一种基于傅里叶域基展开的新模型,在非笛卡尔MRI重建中实现更优图像质量和更低计算复杂度。

详情
AI中文摘要

在过去的几十年中,使用基于模型的方法重建傅里叶成像数据一直很流行,这些方法可以轻松地融入物理约束和先进的正则化/机器学习先验。最常见的建模方法是将连续图像表示为平移的“体素”基函数的线性组合。尽管这种基于体素的模型已被广泛研究和部署,但它存在长期以来的局限性,包括高计算成本、慢收敛和易产生伪影。在这项工作中,我们从新的角度重新审视该模型,识别出可能之前被忽视的新问题(包括不良近似、环绕和零空间特性)。我们的见解促使我们提出一种新模型,该模型对先前方法的局限性(旧的和新的)更具鲁棒性。具体来说,新模型基于傅里叶域基展开,而不是标准的图像域体素方法。在非笛卡尔MRI重建背景下呈现的示例结果表明,新模型能够改善图像质量(减少伪影)和/或降低计算复杂度(更快的计算和更好的收敛)。

英文摘要

For the past several decades, it has been popular to reconstruct Fourier imaging data using model-based approaches that can easily incorporate physical constraints and advanced regularization/machine learning priors. The most common modeling approach is to represent the continuous image as a linear combination of shifted "voxel" basis functions. Although well-studied and widely-deployed, this voxel-based model is associated with longstanding limitations, including high computational costs, slow convergence, and a propensity for artifacts. In this work, we reexamine this model from a fresh perspective, identifying new issues that may have been previously overlooked (including undesirable approximation, wrap-around, and nullspace characteristics). Our insights motivate us to propose a new model that is more resilient to the limitations (old and new) of the previous approach. Specifically, the new model is based on a Fourier-domain basis expansion rather than the standard image-domain voxel-based approach. Illustrative results, which are presented in the context of non-Cartesian MRI reconstruction, demonstrate that the new model enables improved image quality (reduced artifacts) and/or reduced computational complexity (faster computations and improved convergence).

2502.11201 2026-06-16 cs.DB cs.AI 版本更新

Bridging the Gap: Enabling Natural Language Queries for NoSQL Databases through Text-to-NoSQL Translation

弥合差距:通过文本到NoSQL翻译实现NoSQL数据库的自然语言查询

Jinwei Lu, Jiawei Lu, Chen Zhang, Zhiqian Qin, Haodi Zhang, Yuanfeng Song, Raymond Chi-Wing Wong

发表机构 * University of Science and Technology of China(中国科学技术大学) Tsinghua University(清华大学) The University of Hong Kong(香港大学)

AI总结 本文研究Text-to-NoSQL任务,提出TEND基准和SAG求解器,用于将自然语言请求翻译为MongoDB聚合管道,验证了无模式文档推理的独特挑战。

详情
AI中文摘要

NoSQL数据库是核心数据基础设施,但对其的自然语言访问仍不成熟:正确的查询生成必须恢复非关系数据模型如何表示实体、嵌套路径、数组、缺失字段和动态键。本文研究Text-to-NoSQL,将自然语言请求翻译为可执行的NoSQL查询,实例化为对无模式文档存储的MongoDB聚合管道。我们提出TEND(Text-to-NoSQL Dataset的缩写),一个执行验证的基准,包含11个数据库上的1,210个MongoDB原生任务。据我们所知,TEND是第一个数据库世界设计为MongoDB原生的Text-to-NoSQL基准:专家手动定义集合边界、嵌套数组、可选和稀疏路径、多态形状以及动态键约定;这些世界填充真实数据并通过冻结的MongoDB执行验证,因此TEND评估无模式文档推理而非SQL到MQL的迁移。我们进一步引入SAG(Schema-as-Data Grounding)求解器,该求解器在受限MQL生成、执行接地修复和结果一致性选择之前,从存储文档证据中诱导路径和值接地。评估使用受限列容忍执行准确率(EXC)作为主要指标,辅以分级结果集F1和互斥执行结果分解。实验表明,在NL2SQL上表现强劲的LLM在TEND上大幅下降,验证了Text-to-NoSQL作为一个独特的无模式文档推理问题。

英文摘要

NoSQL databases are core data infrastructure, yet natural-language access to them remains underdeveloped: correct query generation must recover how a non-relational data model represents entities, nested paths, arrays, missing fields, and dynamic keys. This paper studies Text-to-NoSQL, translating natural-language requests into executable NoSQL queries, instantiated with MongoDB aggregation pipelines over schema-less document stores. We present TEND, short for Text-to-NoSQL Dataset, an execution-verified benchmark with 1,210 MongoDB-native tasks across 11 databases. To our knowledge, TEND is the first Text-to-NoSQL benchmark whose database worlds are MongoDB-native by design: experts manually define collection boundaries, nested arrays, optional and sparse paths, polymorphic shapes, and dynamic-key conventions; these worlds are populated with real data and verified through frozen MongoDB execution, so TEND evaluates schema-less document reasoning rather than SQL-to-MQL transfer. We further introduce SAG, a Schema-as-Data Grounding solver that induces path and value grounding from stored-document evidence before bounded MQL generation, execution-grounded repair, and result-consistency selection. Evaluation uses bounded column-tolerant execution accuracy (EXC) as the headline metric, complemented by a graded result-set F1 and a mutually exclusive execution-outcome decomposition. Experiments show that LLMs with strong NL2SQL performance degrade substantially on TEND, validating Text-to-NoSQL as a distinct schema-less document reasoning problem.

2407.02362 2026-06-16 cs.AR cs.AI cs.LG 版本更新

Mitigating scalability challenges in LUT-based neural networks via pruning optimisations

通过剪枝优化缓解基于LUT的神经网络的可扩展性挑战

Xuqi Zhu, Huaizhi Zhang, JunKyu Lee, Jiacheng Zhu, Chandrajit Pal, Sangeet Saha, Klaus D. McDonald-Maier, Xiaojun Zhai

发表机构 * School of Computer Science and Electronic Engineering, University of Essex(埃塞克斯大学计算机科学与电子工程学院)

AI总结 针对LUT矩阵乘法可扩展性差的问题,提出集成剪枝策略的LUT-MU架构,在FPGA上实现最高1.6倍吞吐量和4.2倍能效提升。

详情
AI中文摘要

现代深度神经网络严重依赖大量的乘加运算,这构成了主要的计算成本。为了解决这个问题,基于查找表(LUT)的矩阵乘法已成为减少神经网络中乘加运算计算成本和时间的有效替代方案。然而,由于LUT矩阵乘法的固有限制,基于LUT的神经网络仍然面临可扩展性挑战。为了缓解这些可扩展性限制,本文提出了一种可扩展且节能的基于LUT的近似矩阵乘法单元(LUT-MU),通过将剪枝策略集成到MADDNESS算法(一种基于LUT的矩阵乘法方法)中,构成神经网络的基本组件。随着矩阵乘法中问题规模和精度要求的增加,我们提出的LUT-MU架构有效约束了资源扩展。案例研究表明,将我们的LUT-MU部署在神经网络架构中,包括全连接层(MNIST)和ResNets(CIFAR-10、ImageNet)——在XCZU7EV和XCZU19EG FPGA上——与主流的基于CUDA的网络实现相比,产生了高达1.6倍的吞吐量提升和4.2倍的能效提升,与领先的量化神经网络实现相比,能效提升1.8倍,且对精度影响适中。与基于原始MADDNESS的神经网络相比,我们的LUT-MU根据MADDNESS的不同分辨率配置设置,节省了1.3到2.6倍的资源。

英文摘要

Modern deep neural networks heavily rely on a large number of multiply-accumulate operations, which constitute the predominant computational cost. To address this, Look-Up Table (LUT)-based matrix multiplications have emerged as a promising alternative for reducing the computational cost and time of the multiply-accumulate operations in a neural network. However, the LUT-based neural network still faces the scalability challenge due to the inherent limitations of LUT-based matrix multiplication. To mitigate these scalability limitations, this paper proposes a scalable and energy-efficient LUT-based approximate matrix multiplication unit (LUT-MU) constituting the basic component of the neural networks by integrating a pruning strategy on the MADDNESS algorithm, a LUT-based matrix multiplication methodology. With increasing problem size and precision demands in matrix multiplication, our proposed LUT-MU architecture effectively constrains resource expansion. The case study shows that deploying our LUT-MU in neural network architectures, including fully connected layers (MNIST) and ResNets (CIFAR-10, ImageNet)-on XCZU7EV and XCZU19EG FPGAs, produces up to $1.6 \times$ throughput improvement and $4.2 \times$ energy efficiency gains over mainstream CUDA-based network implementations, and $1.8\times$ energy efficiency compared to leading quantised neural network implementations, with moderate impact on accuracy. Compared to original MADDNESS-based neural networks, our LUT-MU shows $1.3$ to $2.6\times$ resource savings based on various resolution configuration settings of MADDNESS.

2405.15768 2026-06-16 stat.ML cs.AI cs.LG 版本更新

Canonical Variates in Wasserstein Metric Space

Wasserstein度量空间中的典型变量

Jia Li, Lin Lin

发表机构 * Department of Statistics, The Pennsylvania State University(宾夕法尼亚州立大学统计学系) Department of Biostatistics and Bioinformatics, Duke University(杜克大学生物统计学与生物信息学系)

AI总结 针对分布数据分类问题,提出基于Wasserstein距离的Fisher比最大化降维方法,通过迭代优化算法实现,实验证明能显著提升分类性能。

Comments single space 39 pages, 10 figures

详情
AI中文摘要

在本文中,我们处理由向量空间上的分布(而非单个点)表示的实例的分类问题。我们考虑基于成对距离的分类算法,特别是分布之间的Wasserstein度量。我们研究的核心是在Wasserstein度量空间中进行降维以提高分类准确性。我们引入了一种基于最大化Fisher比(定义为类间变异与类内变异之比)原理的新方法。该比值最大化的方向被称为判别坐标或典型变量轴。在实践中,类间变异和类内变异被定义为分布对之间的平均平方Wasserstein距离,这些分布对要么属于同一类,要么属于不同类。该比值优化通过一种迭代算法实现,该算法在向量空间中的最优传输和最大化步骤之间交替进行。进行了实证研究以评估算法的收敛性;实验结果表明,降维技术显著提高了分类性能。此外,新方法优于基于从分布数据派生的向量表示运行的成熟算法。它对实例如何由分布总结的变化(例如高斯混合模型表示中的分量数量)也表现出鲁棒性。

英文摘要

In this paper, we address the classification of instances represented by distributions on a vector space rather than single points. We consider classification algorithms based on pairwise distances, specifically, the Wasserstein metric between distributions. Central to our investigation is dimension reduction within the Wasserstein metric space to enhance classification accuracy. We introduce a novel approach grounded in the principle of maximizing Fisher's ratio, defined as the quotient of between-class variation to within-class variation. The directions in which this ratio is maximized are termed discriminant coordinates or canonical variates axes. In practice, both between-class and within-class variations are defined as the average squared Wasserstein distances between pairs of distributions, with the pairs either belonging to the same class or to different classes. This ratio optimization is achieved through an iterative algorithm, which alternates between optimal transport and maximization steps within the vector space. Empirical studies are conducted to assess the algorithm's convergence; and experimental results demonstrate that the dimension reduction technique substantially enhances classification performance. Moreover, the new method outperforms well-established algorithms that operate on vector representations derived from distributional data. It also exhibits robustness to variations in how instances are summarized by distributions, such as the number of components in a Gaussian mixture model (GMM) representation.

2405.02369 2026-06-16 cs.NE cs.AI cs.LG 版本更新

No One-Size-Fits-All Neurons: Task-based Neurons for Artificial Neural Networks

没有万能神经元:面向任务的人工神经网络神经元

Feng-Lei Fan, Meng Wang, Hang-Cheng Dong, Jianwei Ma, Tieyong Zeng

发表机构 * Department of Data Science, City University of Hong Kong(城市大学数据科学系) School of Mathematics, Harbin Institute of Technology(哈尔滨工业大学数学系) School of Instrumentation, Harbin Institute of Technology(哈尔滨工业大学仪器系) School of Earth and Space Sciences, Peking University(北京大学地球与空间科学学院) Institute for Advanced Study, Beijing Normal-Hong Kong Baptist University(北京师范大学-香港 Baptist大学高级研究院)

AI总结 受大脑神经元任务特异性的启发,提出一种两阶段框架设计任务导向神经元,通过多项式基函数引入归纳偏置,在合成数据、经典基准和实际应用中性能优于现有模型。

Comments 8 pages, 4 figures

详情
AI中文摘要

在过去十年中,许多成功的网络都采用了新颖的架构,这些架构几乎无一例外地使用相同类型的神经元。最近,越来越多的深度学习研究受到NeuroAI理念和人类大脑中观察到的神经元多样性的启发,从而提出了新颖的人工神经元设计。设计性能良好的神经元代表了相对于设计性能良好的神经架构的一个新维度。从生物学角度看,大脑并不依赖一种在所有方面都普遍适用的单一类型神经元。相反,在我们的大脑中,神经元通常是基于任务的。在本研究中,我们探讨以下问题:既然人脑是一个基于任务的神经元使用者,那么人工网络设计能否从基于任务的架构设计转向基于任务的神经元设计?由于方法论上不存在万能神经元,在相同结构下,基于任务的神经元由于对任务具有内在的归纳偏置,相比现有的通用神经元可以增强特征表示能力。具体来说,我们提出了一个用于原型化基于任务神经元的两阶段框架。作为初始步骤,我们使用多项式作为基函数来评估所提出的框架。实验上,在合成数据、经典基准和实际应用上的系统实验结果表明,所提出的基于任务的神经元设计不仅可行,而且相比其他最先进模型具有竞争力的性能。

英文摘要

In the past decade, many successful networks are on novel architectures, which almost exclusively use the same type of neurons. Recently, more and more deep learning studies have been inspired by the idea of NeuroAI and the neuronal diversity observed in human brains, leading to the proposal of novel artificial neuron designs. Designing well-performing neurons represents a new dimension relative to designing well-performing neural architectures. Biologically, the brain does not rely on a single type of neuron that universally functions in all aspects. Instead, in our brain, neurons are often task-based. In this study, we address the following question: since the human brain is a task-based neuron user, can the artificial network design go from the task-based architecture design to the task-based neuron design? Since methodologically there are no one-size-fits-all neurons, given the same structure, task-based neurons can enhance the feature representation ability relative to the existing universal neurons due to the intrinsic inductive bias for the task. Specifically, we propose a two-step framework for prototyping task-based neurons. As the initial step, we evaluate the proposed framework using polynomials as base functions. Empirically, systematic experimental results on synthetic data, classic benchmarks, and real-world applications show that the proposed task-based neuron design is not only feasible but also delivers competitive performance over other state-of-the-art models.

2309.07401 2026-06-16 math.NA cs.AI cs.NA 版本更新

Multi-Grade Deep Learning for Partial Differential Equations with Applications to the Burgers Equation

多级深度学习用于偏微分方程及其在Burgers方程中的应用

Yuesheng Xu, Taishan Zeng

发表机构 * Department of Mathematics and Statistics, Old Dominion University(数学与统计学系,老 Dominion 大学) School of Mathematical Science, South China Normal University(数学科学学院,华南师范大学)

AI总结 提出两阶段多级深度学习方法,通过渐进式分级训练浅层网络拟合目标函数,再微调部分层,有效解决非线性PDE优化难题,在Burgers方程上误差降低达60倍。

详情
AI中文摘要

深度神经网络在求解偏微分方程方面显示出巨大潜力,但其深层架构带来了复杂、大规模、非凸的优化挑战。非线性PDE,如粘性Burgers方程,由于陡峭梯度和激波类解而加剧了这些困难。为此,我们提出了一种两阶段多级深度学习方法。在第一阶段,浅层网络逐级渐进训练,从低频到高频分量拟合目标函数;先前学习的级被冻结,每个新的残差块仅训练以最小化剩余逼近误差。第二阶段解冻并重新训练选定层,以第一阶段网络为初始化,实现可解释、稳定的层次细化,同时减轻优化复杂性。此外,我们从理论上证明,在适当的优化策略下,TS-MGDL中的每一级和每一阶段都单调地减少损失函数。在一维、二维和三维粘性Burgers方程上的数值实验表明,TS-MGDL显著优于单级学习,预测误差降低高达60倍。

英文摘要

Deep neural networks (DNNs) show great promise for solving partial differential equations (PDEs), but their deep architectures introduce complex, large-scale, non-convex optimization challenges. Nonlinear PDEs, like the viscous Burgers' equation, compound these difficulties due to steep gradients and shock-like solutions. To address this, we propose a two-stage multi-grade deep learning (TS-MGDL) method. In the first stage, shallow networks are trained progressively grade by grade to fit the target function from low- to high-frequency components; previously learned grades are frozen, and each new residual block is trained solely to minimize the remaining approximation error. The second stage unfreezes and retrains selected layers using the first-stage network as initialization, achieving an interpretable, stable hierarchical refinement while mitigating optimization complexity. Furthermore, we theoretically prove that each grade and stage in TS-MGDL monotonically reduces the loss function under an appropriate optimization strategy. Numerical experiments on 1D, 2D, and 3D viscous Burgers' equations demonstrate that TS-MGDL significantly outperforms single-grade learning (SGL), reducing predictive errors by up to a factor of 60.

2502.06178 2026-06-16 math.OC cs.LG stat.ML

Bayesian Optimization by Kernel Regression and Density-based Exploration

基于核回归和密度探索的贝叶斯优化

Tansheng Zhu, Hongyu Zhou, Ke Jin, Xusheng Xu, Qiufan Yuan, Lijie Ji

发表机构 * Zhiyuan College, Shanghai Jiao Tong University, Shanghai 200240, P. R. China(上海交通大学紫阳学院) School of Mathematical Sciences, Shanghai Jiao Tong University, Shanghai 200240, P. R. China(上海交通大学数学科学学院) Shanghai Institute of Aerospace Systems Engineering, Shanghai 201109, P. R. China(上海航天系统工程研究院) Department of Mathematics, Shanghai University, Shanghai 200444, P. R. China(上海大学数学系) Newtouch Center for Mathematics of Shanghai University, Shanghai University, Shanghai 200444, P. R. China(上海大学数学中心)

AI总结 该研究提出了一种新的贝叶斯优化算法BOKE,通过核回归和密度探索结合,减少计算成本至二次复杂度,并在理论和实验上证明了其收敛性和有效性。

详情
AI中文摘要

贝叶斯优化在优化昂贵评估的黑盒函数时非常有效,但因高斯过程的每次迭代三次计算复杂度而面临显著的计算挑战,导致总时间复杂度与迭代次数的四次方成正比。为了解决这一限制,我们提出了一种新的算法,即基于核回归和密度探索的贝叶斯优化(BOKE)。BOKE利用核回归进行高效的函数近似,核密度用于探索,并将它们整合到置信界标准中以指导优化过程,从而将计算成本降低到二次。我们的理论分析严格建立了在噪声评估下的BOKE全局收敛性。通过广泛的数值实验,在合成和现实优化任务中,我们证明了BOKE不仅在与高斯过程方法和其他基线方法相比具有竞争力,而且表现出优越的计算效率。这些结果突显了BOKE在资源受限环境中的有效性,为工程应用中的优化问题提供了一种实用的方法。

英文摘要

Bayesian optimization is highly effective for optimizing expensive-to-evaluate black-box functions, but it faces significant computational challenges due to the cubic per-iteration cost of Gaussian processes, which results in a total time complexity that is quartic with respect to the number of iterations. To address this limitation, we propose a novel algorithm, Bayesian optimization by kernel regression and density-based exploration (BOKE). BOKE uses kernel regression for efficient function approximation, kernel density for exploration, and integrates them into the confidence bound criteria to guide the optimization process, thus reducing computational costs to quadratic. Our theoretical analysis rigorously establishes the global convergence of BOKE under noisy evaluations. Through extensive numerical experiments on both synthetic and real-world optimization tasks, we demonstrate that BOKE not only performs competitively compared to Gaussian process-based methods and several other baseline methods but also exhibits superior computational efficiency. These results highlight BOKE's effectiveness in resource-constrained environments, providing a practical approach for optimization problems in engineering applications.

2603.25777 2026-06-16 physics.plasm-ph cs.AI

Challenges and opportunities for AI to help deliver fusion energy

人工智能在实现聚变能源中的挑战与机遇

Adriano Agnello, Helen Brooks, Cyd Cowley, Iulia Georgescu, Alex Higginbottom, Richard Pearson, Tara Shears, Melanie Windridge

发表机构 * STFC Hartree Centre(英国科学与技术创新中心) UK Atomic Energy Authority(英国原子能局) digiLab Solutions(digiLab解决方案) Institute of Physics(物理研究所) Zenithon AI(Zenithon人工智能) Eindhoven University of Technology(埃因霍温理工大学) Oliver Lodge Laboratory(Oliver Lodge实验室) University of Liverpool(利物浦大学) Fusion Energy Insights(聚变能源洞察)

AI总结 本文探讨了人工智能在聚变能源研发中的应用潜力与挑战,强调需通过跨领域合作与稳健方法提升AI应用效果,同时指出并非所有聚变问题都适合AI解决。

Comments Submitted to Plasma Physics and Confined Fusion

详情
Journal ref
Plasma Physics and Controlled Fusion 68 063701 (2026)
AI中文摘要

人工智能工具在聚变研究中的应用具有巨大潜力,若能实现可控核聚变,将带来全球性效益。然而,使用AI面临诸多挑战,这些挑战可通过在现有方法中引入负责任和稳健的方法加以缓解。为此,需要聚变领域专家与AI开发者之间紧密、长期的合作,并意识到并非所有聚变研究问题都最适合用AI工具解决。2025年4月,学术界、工业界、UKAEA和STFC专家在《经济学人》FusionFest活动上讨论了AI如何推动聚变能源研发。本文是对圆桌讨论的扩展和更新总结,提供了更多背景和实例。

英文摘要

There is great potential for the application of AI tools in fusion research, and substantial worldwide benefit if fusion power is realised. However, using AI comes with its own challenges, many of which can be mitigated if responsible and robust methodologies are built into existing approaches. To do that requires close, long-term collaborations between fusion domain experts and AI developers and awareness of the fact that not all problems in fusion research are best tackled with AI tools. In April 2025, experts from academia, industry, UKAEA and STFC discussed how AI can be used to advance R&D in fusion energy at the first edition of The Economist FusionFest event. This Perspective is an expanded and updated summary of the round table discussion, providing more context and examples.

2605.06184 2026-06-16 cs.SE cs.LG cs.LO cs.PL

Teaching LLMs Program Semantics via Symbolic Execution Traces

Jonas Bayer, Stefan Zetzsche, Olivier Bouissou, Remi Delmas, Michael Tautschnig, Soonho Kong

发表机构 * University of Cambridge(剑桥大学) Amazon Web Services(亚马逊网络服务)

详情
英文摘要

We introduce an evaluation framework of 500 C verification tasks across five property types (memory safety, overflow, termination, reachability, data races) built on SV-COMP 2025, and evaluate 14 models across six families. We find that high overall accuracy masks a critical weakness: while most models reliably confirm properties hold, violation detection varies widely and degrades sharply with program length. To close this gap, we train on formal verification artifacts: running the Soteria symbolic execution engine on generic open-source C code and using the resulting traces for continued pretraining of Qwen3-8B. Just ${\sim}$3,000 bug traces combined with chain-of-thought reasoning at inference time improve violation detection by over 17 percentage points, producing one of the most balanced accuracy profiles among evaluated models. On violation detection, the trained 8B model outperforms the 4$\times$ larger Qwen3-32B without thinking and approaches it in overall accuracy. The interaction between trace training and chain-of-thought is superadditive: neither alone provides meaningful gains, but their combination does. Improvements transfer across all five property types, including ones the training traces do not target. Our 28 configurations confirm the gains stem from trace semantics, not code volume, and that trace curation and format matter.

2602.21954 2026-06-16 physics.soc-ph cs.RO

The Swarm Intelligence Freeway-Urban Trajectories (SWIFTraj) Dataset -- Part II: A Graph-Based Approach for Trajectory Connection

Xinkai Ji, Pan Liu, Ying Yang, Yu Han

发表机构 * Hong Kong University of Science and Technology - Guangzhou(香港科技大学(广州))

详情
英文摘要

In Part I of this companion paper series, we introduced SWIFTraj, a new open-source vehicle trajectory dataset collected using a unmanned aerial vehicle (UAV) swarm. The dataset has two distinctive features. First, by connecting trajectories across consecutive UAV videos, it provides long-distance continuous trajectories, with the longest exceeding 4.5 km. Second, it covers an integrated traffic network consisting of both freeways and their connected urban roads. Obtaining such long-distance continuous trajectories from a UAV swarm is challenging, due to the need for accurate time alignment across multiple videos and the irregular spatial distribution of UAVs. To address these challenges, this paper proposes a novel graph-based approach for connecting vehicle trajectories captured by a UAV swarm. An undirected graph is constructed to represent flexible UAV layouts, and an automatic time alignment method based on trajectory matching cost minimization is developed to estimate optimal time offsets across videos. To associate trajectories of the same vehicle observed in different videos, a vehicle matching table is established using the Hungarian algorithm. The proposed approach is evaluated using both simulated and real-world data. Results from real-world experiments show that the time alignment error is within three video frames, corresponding to approximately 0.1 s, and that the vehicle matching achieves an F1-score of about 0.99. These results demonstrate the effectiveness of the proposed method in addressing key challenges in UAV-based trajectory connection and highlight its potential for large-scale vehicle trajectory collection.

2604.22795 2026-06-16 eess.SY cs.LG cs.SY

Load constrained wind farm flow control through multi-objective multi-agent reinforcement learning

基于多目标多智能体强化学习的负载约束风电场流动控制

Teodor Åstrand, Marcus Binder Nilsen, Iasonas Tsaklis, Tuhfe Göçmen, Pierre-Elouan Réthoré, Nikolay Dimitrov

发表机构 * Department of Wind and Energy Systems, Technical University of Denmark(丹麦技术大学风能与能源系统系)

AI总结 提出多智能体强化学习框架,结合独立软演员-评论家架构和数据驱动代理模型,在风电场流动控制中通过形状奖励函数约束损伤等效载荷增量,实现功率提升与负载控制的多目标优化。

Comments Submitted to Journal of Physics: Conference Series (Torque 2026). This is the Accepted Manuscript version of an article accepted for publication in Journal of Physics: Conference Series. IOP Publishing Ltd is not responsible for any errors or omissions in this version of the manuscript or any version derived from it. This Accepted Manuscript is published under a CC BY licence

详情
Journal ref
J. Phys.: Conf. Ser. 3224 032065 (2026)
AI中文摘要

本研究提出了一种用于负载约束风电场流动控制(WFFC)的多智能体强化学习(MARL)框架。虽然尾流偏转可以提升风电场总功率,但通常会增加下游风机的结构载荷。为了解决这一问题,我们将独立软演员-评论家(I-SAC)架构与数据驱动的局部入流扇区平均代理模型相结合,以实时估计损伤等效载荷(DELs)。通过将这些估计值纳入形状奖励函数,训练特定风机的智能体在相对于基线控制器遵守特定载荷增加阈值($Δ_{max}$)为10%、20%和30%的同时最大化发电量。该框架在WindGym环境中实现,使用带有动态尾流蜿蜒(DWM)模型的DYNAMIKS流动求解器来捕捉非稳态尾流物理特性。结果表明,MARL智能体成功学习了协作策略,优先考虑功率增益,同时主动回避高DEL控制策略。

英文摘要

This study presents a multi-agent reinforcement learning (MARL) framework for load-constrained wind farm flow control (WFFC). While wake steering can enhance total wind farm power, it often introduces increased structural loads on downstream turbines. To address this, we integrate an Independent Soft Actor-Critic (I-SAC) architecture with a data-driven, local inflow sector-averaged surrogate model to provide real-time estimates of Damage Equivalent Loads (DELs). By incorporating these estimates into a shaped reward function, turbine-specific agents are trained to maximize power production while adhering to specific load-increase thresholds ($Δ_{max}$) of 10%, 20%, and 30% relative to a baseline controller. The framework is implemented within the WindGym environment using the DYNAMIKS flow solver with Dynamic Wake Meandering (DWM) model to capture non-stationary wake physics. Results indicate that the MARL agents successfully learn collaborative policies that prioritise power gain while actively retreating from high-DEL control strategies.

2511.12635 2026-06-16 cs.SE cs.AI cs.LG

LLM4SCREENLIT: Recommendations on Assessing the Performance of Large Language Models for Screening Literature in Systematic Reviews

Lech Madeyski, Barbara Kitchenham, Martin Shepperd

发表机构 * University of Kent(肯特大学) University of Leicester(利兹大学) University of Birmingham(伯明翰大学)

Comments 34 pages, 6 figures

详情
Journal ref
Information and Software Technology 198 (2026) 108204
英文摘要

Context: Large language models (LLMs) are increasingly used to screen literature for systematic reviews (SRs), but the standard confusion-matrix metrics used to evaluate them can mislead under the imbalanced, cost-asymmetric conditions of screening. Objective: We develop and justify LLM4SCREENLIT-practical recommendations for researchers conducting LLM-screening evaluations and for editors and reviewers assessing such studies-differentiated by study type (retrospective benchmarking vs deployment for a specific SR). Method: Using Delgado-Chaves et al. (2025), an 18-LLM benchmark across three biomedical SRs, as a motivating example, we reviewed 28 additional papers and extracted their reported metrics. We propose a Weighted Matthews Correlation Coefficient (WMCC) that integrates MCC's chance-correction with asymmetric misclassification costs, and validated it on three software-engineering (SE) reanalyses, the largest covering 9 LLMs x 24 SE secondary studies (34,528 articles). Results: Across the 29 papers, only 10% reported MCC, only 24% reported full confusion matrices, and none of the five papers claiming workload savings priced false-negative cost. In the largest SE reanalysis, MCC and WMCC disagree on the best LLM in 55% of evaluable studies; in the most striking 9,695-article SE study, the Accuracy-best LLM loses 63.3% of relevant evidence (Lost Evidence), the MCC-best 43.9%, but the WMCC-best only 5.8%. Sensitivity analysis (median crossover at w~=2.7, all <7) supports w=10 as a conservative default. Conclusions: SR-screening evaluations should prioritize Lost Evidence and use cost-sensitive WMCC alongside MCC for ranking. Reporting must include the full confusion matrix and treat unclassifiable outputs as positives requiring human review. Designs should be leakage-aware, with non-LLM baselines when the study aims to inform SR practice and labels are available.

2603.11729 2026-06-16 cs.DS cs.AI cs.RO

Adapting Dijkstra for Buffers and Unlimited Transfers

为缓冲区和无限换乘调整Dijkstra算法

Denys Katkalo, Andrii Rohovyi, Toby Walsh

发表机构 * University of Oxford(牛津大学)

AI总结 本文提出Transfer Aware Dijkstra (TAD)算法,通过扫描完整行程序列而非单条边,解决了带缓冲区时间的无限换乘路径规划中传统Dijkstra过滤失效的问题,并在伦敦和瑞士网络上实现比MR快两倍以上的速度且保持最优性。

Comments v4: clarified RAPTOR description in the Background section

详情
AI中文摘要

近年来,基于RAPTOR的算法被认为是无需预处理即可处理无限换乘路径规划的最先进技术。然而,这一地位很大程度上源于路由研究的演进,其中基于Dijkstra的解决方案被基于时间表的算法取代,而缺乏系统性的比较。在这项工作中,我们重新审视了经典的基于Dijkstra的无限换乘公共交通路由方法,并证明时间依赖Dijkstra (TD-Dijkstra) 优于MR。然而,高效的TD-Dijkstra实现依赖于在预处理期间过滤被支配的连接,这假设乘客总是可以切换到更快的连接。我们表明,当站点有缓冲区时间时,这种过滤是不合理的,因为它无法区分可能继续等待的坐席乘客和必须遵守缓冲区的换乘乘客。为了解决这一限制,我们引入了Transfer Aware Dijkstra (TAD),这是一种修改后的算法,它扫描整个行程序列而不是单个边,从而正确处理缓冲区时间,同时保持相对于MR的性能优势。我们在伦敦和瑞士网络上的实验表明,与MR相比,我们可以在有和没有缓冲区时间的两个网络上实现超过两倍的速度提升,同时产生最优结果。

英文摘要

In recent years, RAPTOR based algorithms have been considered the state-of-the-art for path-finding with unlimited transfers without preprocessing. However, this status largely stems from the evolution of routing research, where Dijkstra-based solutions were superseded by timetable-based algorithms without a systematic comparison. In this work, we revisit classical Dijkstra-based approaches for public transit routing with unlimited transfers and demonstrate that Time-Dependent Dijkstra (TD-Dijkstra) outperforms MR. However, efficient TD-Dijkstra implementations rely on filtering dominated connections during preprocessing, which assumes passengers can always switch to a faster connection. We show that this filtering is unsound when stops have buffer times, as it cannot distinguish between seated passengers who may continue without waiting and transferring passengers who must respect the buffer. To address this limitation, we introduce Transfer Aware Dijkstra (TAD), a modification that scans entire trip sequences rather than individual edges, correctly handling buffer times while maintaining performance advantages over MR. Our experiments on the London and Switzerland networks show that we can achieve more than a twofold speedup over MR while producing optimal results on both networks, with and without buffer times.