arXivDaily arXiv每日学术速递 周一至周五更新

科学与医疗

AI for Science

科学智能、蛋白质、分子、药物、材料、气象、物理和数学 AI。

今日/当前日期收录 478 信号源:cs.LG, q-bio, physics, cond-mat, math, stat.ML

1. 物理仿真 19 篇

2604.16897 2026-06-19 physics.chem-ph quant-ph 版本更新 85%

Ultrafast nonadiabatic dynamics of tetraphenylsubstituted nitrogen-based heterocycles

四苯基取代氮杂环的超快非绝热动力学

Javier Hernández-Rodríguez, Alberto Martín Santa Daría, Susana Gómez-Carrasco, Sandra Gómez

专题命中 物理仿真 :模拟四苯基氮杂环的激发态弛豫动力学

AI总结 通过表面跳跃混合量子-经典轨迹模拟,研究四苯基吡嗪和四苯基吡咯的激发态弛豫路径,揭示固态发光增强与双态发射差异的机制。

详情
AI中文摘要

四苯基吡嗪(TPP)和2,3,4,5-四苯基-1H-吡咯(TePP)是带有四个苯基取代基的密切相关杂环化合物,其结构相似性使其成为比较分子内柔性如何影响气相和固态中激发态弛豫和发射的有用配对。TPP是典型的固态发光增强(SLE)发射体,在分子聚集时量子产率显著增加。相反,TePP在溶液和固态中显示出相似的量子产率,具有双态发射(DSE)特征。这种行为表明,在孤立分子体系中,分子内旋转已经受到显著阻碍,这与我们之前对TPP和其他固态发射体的观察结果一致(Hernández-Rodríguez等人,ChemPhysChem,2024,25,e202400563)。为了揭示这种对比行为背后的激发态动力学,我们采用表面跳跃方法对TPP和TePP的单分子进行了混合量子-经典轨迹模拟。在TD-B3LYP-D3/def2-SVP水平上包含了12个单重态,该水平之前已与耦合簇方法进行了基准测试。模拟的可观测值,如气相超快电子衍射(GUED)和时间分辨荧光(TR-FL)信号,使我们能够剖析两种系统在气相中不同的失活路径,同时提供关于这些路径在溶液和固态环境中如何演化的机制性见解。

英文摘要

Tetraphenylpyrazine (TPP) and 2,3,4,5-tetraphenyl-1H-pyrrole (TePP) are closely related heterocycles bearing four phenyl substituents, whose structural similarity makes them a useful pair for comparing how intramolecular flexibility influences excited-state relaxation and emission in the gas phase and in the solid state. TPP is a prototypical solid-state luminescence enhancement (SLE) emitter, exhibiting a markedly increased quantum yield upon molecular aggregation. In contrast, TePP displays similar quantum yields in solution and solid state, characteristic of dual-state emission (DSE). This behaviour indicates that intramolecular rotations are already significantly hindered in the isolated-molecule regime, consistent with our previous observations for TPP and other solid-state emitters (Hernández-Rodríguez et al., ChemPhysChem, 2024, 25, e202400563). To unravel the excited-state dynamics underlying this contrasting behaviour, we performed mixed quantum-classical trajectory simulations on a single molecule of TPP and TePP employing the surface-hopping method. Twelve singlet states were included at the TD-B3LYP-D3/def2-SVP level, which were previously benchmarked against coupled cluster methods. Simulated observables such as gas phase ultrafast electron diffraction (GUED) and time-resolved fluorescence (TR-FL) signals allow us to dissect the distinct deactivation pathways operating in both systems in the gas phase, while also providing mechanistic insight into how these pathways are expected to evolve in solution and solid-state environments.

2604.11774 2026-06-19 hep-ex physics.ins-det 版本更新 85%

Neutron Reconstruction via Blips in Liquid Argon Time Projection Chambers

液氩时间投影室中通过闪烁点进行中子重建

Miguel Hernandez Morquecho, Bryce Littlejohn, Paola Sala, Linyan Wan

专题命中 物理仿真 :液氩时间投影室中子重建

AI总结 提出基于模拟的概念验证,利用中子非弹性散射产生的孤立MeV级能量沉积(闪烁点)在LArTPC中重建中子方向和能量,并探索其改善中微子-反中微子区分等物理研究的应用。

Comments 19 pages + 6 pages appendix; Accepted for publication in Physical Review D

详情
AI中文摘要

中微子相互作用中,中子是重要的末态粒子,但在当前大多数中微子LArTPC物理分析中,中子未被考虑或重建。本文在通用LArTPC探测器中,基于模拟进行了中子重建的概念验证研究。利用中子非弹性散射产生的孤立MeV级能量沉积(闪烁点),并结合已发表实验结果中的真实闪烁点响应,我们展示了识别中子以及重建亚GeV中微子相互作用中末态中子系统方向和能量的能力。随后,我们探讨了如何利用中子相关闪烁点属性来改进中微子相互作用的物理研究,例如增强大气中微子和反向喇叭电流束中微子中的中微子-反中微子区分。这项简单研究初步量化了LArTPC的中子重建能力,我们预期随着闪烁点重建、识别和分类算法以及中子建模的未来进展,该能力将得到提升。

英文摘要

Neutrons are important final-state particles in neutrino interactions, yet they are not considered or reconstructed in most current neutrino LArTPC physics analyses. In this paper, we present a simulation-based proof-of-concept study of neutron reconstruction in a generic LArTPC detector. Leveraging isolated, MeV-scale energy deposits, or blips, from neutron inelastic scattering, and using realistic blip response from published experimental results, we demonstrate the capability to identify neutrons and to reconstruct the direction and energy of the final-state neutron system in sub-GeV neutrino interactions. We then explore how neutron-related blip attributes can be used to improve physics studies of neutrino interactions, such as enhancing neutrino-antineutrino separation in atmospheric neutrinos and reverse-horn-current beam neutrinos. This simple study provides an initial quantification of LArTPC neutron reconstruction capabilities, which we expect to improve with future advancements in blip reconstruction, identification, and classification algorithms, as well as the modeling of neutrons.

2601.02149 2026-06-19 cond-mat.mes-hall cond-mat.dis-nn cs.AI 版本更新 85%

AI-enhanced tuning of quantum dot Hamiltonians toward Majorana modes

基于人工智能的量子点哈密顿量调优以实现马约拉纳模式

Mateusz Krawczyk, Jarosław Pawłowski

发表机构 * Institute of Theoretical Physics, Wrocław University of Science and Technology(理论物理研究所,沃林大学技术学院)

专题命中 物理仿真 :AI调谐量子点哈密顿量实现马约拉纳模式

AI总结 本文提出基于神经网络的模型,通过学习量子点模拟器的工作区域,利用输运测量自动调优设备以获得马约拉纳模式。模型在无监督条件下训练于导电图合成数据,采用融合马约拉纳零模关键性质的物理引导损失函数。

Comments 12 pages, 8 figures, 2 tables

Journal ref Phys. Rev. Applied 25, 064032 (2026)

详情
AI中文摘要

我们提出了一种基于神经网络的模型,能够学习量子点模拟器广泛的工作区域,并利用此知识通过输运测量自动调优这些设备,以在结构中获得马约拉纳模式。模型在无监督条件下训练于导电图合成数据,采用融合马约拉纳零模关键性质的物理引导损失函数。我们展示了通过适当训练,深度视觉变换器网络可以高效记忆哈密顿量参数与导电图之间的关系,并利用此提出量子点链参数更新,驱动系统进入拓扑相。从参数空间的广泛初始调谐范围开始,单步更新足以生成非平凡零模。此外,通过启用迭代调优过程——系统在每一步获得更新的导电图——我们证明该方法可以处理参数空间更大的区域。

英文摘要

We propose a neural network-based model capable of learning the broad landscape of working regimes in quantum dot simulators, and using this knowledge to autotune these devices - based on transport measurements - toward obtaining Majorana modes in the structure. The model is trained in an unsupervised manner on synthetic data in the form of conductance maps, using a physics-informed loss that incorporates key properties of Majorana zero modes. We show that, with appropriate training, a deep vision-transformer network can efficiently memorize relation between Hamiltonian parameters and structures on conductance maps and use it to propose parameters update for a quantum dot chain that drive the system toward topological phase. Starting from a broad range of initial detunings in parameter space, a single update step is sufficient to generate nontrivial zero modes. Moreover, by enabling an iterative tuning procedure - where the system acquires updated conductance maps at each step - we demonstrate that the method can address a much larger region of the parameter space.

2604.06001 2026-06-19 physics.comp-ph cs.LG 版本更新 85%

A deep learning framework for jointly solving transient Fokker-Planck equations with arbitrary parameters and initial distributions

一种联合求解具有任意参数和初始分布的瞬态Fokker-Planck方程的深度学习框架

Xiaolong Wang, Jing Feng, Qi Liu, Chengli Tan, Yuanyuan Liu, Yong Xu

发表机构 * School of Mathematics and Statistics, Shaanxi Normal University(陕西师范大学数学与统计学院) School of Mathematics and Statistics, Northwestern Polytechnical University(西北工业大学数学与统计学院) MOE Key Laboratory for Complexity Science in Aerospace, Northwestern Polytechnical University(航空复杂科学教育部重点实验室,西北工业大学) School of Science, Xi’an University of Posts and Telecommunications(西安邮电大学理学院) Department of Systems and Control Engineering, Institute of Science Tokyo(东京科学大学系统与控制工程系)

专题命中 物理仿真 :深度学习求解瞬态Fokker-Planck方程

AI总结 提出基于深度学习的伪解析概率解(PAPS),通过单次训练同时求解任意多模态初始分布、系统参数和时间点的瞬态FPE,速度比GPU加速蒙特卡洛快四个数量级。

详情
AI中文摘要

高效求解Fokker-Planck方程(FPE)是分析复杂参数化随机系统的核心。然而,当前数值方法缺乏跨不同条件的并行计算能力,严重限制了全面的参数探索和瞬态分析。本文引入一种基于深度学习的伪解析概率解(PAPS),通过单次训练过程,同时求解任意多模态初始分布、系统参数和时间点的瞬态FPE解。核心思想是通过高斯混合分布(GMD)统一初始、瞬态和稳态分布,并开发一个约束保持自编码器,将受约束的GMD参数双射映射到无约束的低维潜在表示。在该表示空间中,可以建模跨不同初始条件和系统参数的全局瞬态动力学。在典型系统上的大量实验表明,所提出的PAPS在保持高精度的同时,推理速度比GPU加速的蒙特卡洛模拟快四个数量级。这种效率提升使得以前难以实现的实时参数扫描和随机分岔的系统研究成为可能。通过将表示学习与物理信息瞬态动力学解耦,我们的工作为多维参数化随机系统的概率建模建立了一个可扩展的范式。

英文摘要

Efficiently solving the Fokker-Planck equation (FPE) is central to analyzing complex parameterized stochastic systems. However, current numerical methods lack parallel computation capabilities across varying conditions, severely limiting comprehensive parameter exploration and transient analysis. This paper introduces a deep learning-based pseudo-analytical probability solution (PAPS) that, via a single training process, simultaneously resolves transient FPE solutions for arbitrary multi-modal initial distributions, system parameters, and time points. The core idea is to unify initial, transient, and stationary distributions via Gaussian mixture distributions (GMDs) and develop a constraint-preserving autoencoder that bijectively maps constrained GMD parameters to unconstrained, low-dimensional latent representations. In this representation space, the panoramic transient dynamics across varying initial conditions and system parameters can be modeled by a single evolution network. Extensive experiments on paradigmatic systems demonstrate that the proposed PAPS maintains high accuracy while achieving inference speeds four orders of magnitude faster than GPU-accelerated Monte Carlo simulations. This efficiency leap enables previously intractable real-time parameter sweeps and systematic investigations of stochastic bifurcations. By decoupling representation learning from physics-informed transient dynamics, our work establishes a scalable paradigm for probabilistic modeling of multi-dimensional, parameterized stochastic systems.

2604.04173 2026-06-19 math-ph hep-th math.MP quant-ph 版本更新 85%

Spatial Localization of Relativistic Quantum Systems: The Commutativity Requirement and the Locality Principle. Part II: A Model from Local QFT

相对论量子系统的空间局域化:交换性要求与局域性原理。第二部分:来自局域QFT的模型

Valter Moretti

专题命中 物理仿真 :量子场论中构造空间局域化可观测量

AI总结 在标准量子场论中,利用应力-能量-动量张量与测试函数的涂抹,构造了闵可夫斯基时空中的正能相对论空间局域化可观测量,给出了条件局域化可观测量的交换性恢复。

Comments 87 pages, no figures, some typos/errors fixed, and some results improved

详情
AI中文摘要

本文是两部分研究的第二部分。我们在标准量子场论中,利用涂抹适当测试函数的应力-能量-动量张量,构造了闵可夫斯基时空中的正能相对论空间局域化可观测量。对于每个固定的类时方向,该构造在类空超曲面上给出正算子值测度(POVM),在每个n粒子扇区上定义良好,并满足排除探测概率超光速传播的相对论因果性条件。这些可观测量由局域或准局域场论量构建,从而为早期启发式提议提供了严格版本。在单粒子扇区中,该构造简化为作者先前引入的可观测量,并且在适当的归一化和居中假设下,其一阶矩给出牛顿-维格纳位置算子。由于Reeh-Schlieder定理阻止了正规排序的应力-能量-动量张量在全Fock空间上为正,我们使用量子能量不等式获得控制偏离正性的下界。这导致有下界的正则化算子族,近似局域化效应。最后,我们通过修正的局域能量算子定义有限实验室的条件局域化可观测量。根据Haag对偶性,相应的条件POVM属于局域冯·诺依曼代数,并且对于因果分离的区域可交换,符合Araki-Haag-Kastler框架。结果表明,在有限时空区域的条件测量中,局域化可观测量的交换性得以恢复。

英文摘要

This paper is the second and final part of a two-part study. We construct positive-energy relativistic spatial localization observables in Minkowski spacetime within standard quantum field theory, using the stress--energy--momentum tensor smeared with suitable test functions. For each fixed timelike direction, the construction gives positive operator-valued measures (POVMs) on spacelike hypersurfaces, well defined on every $n$-particle sector and satisfying a relativistic causality condition excluding superluminal propagation of detection probabilities. The observables are built from local or quasi-local field-theoretic quantities, thus providing a rigorous version of earlier heuristic proposals. In the one-particle sector, the construction reduces to the observable previously introduced by the author, and its first moment gives the Newton--Wigner position operator under appropriate normalization and centering assumptions. Because the Reeh--Schlieder theorem prevents the normally ordered stress--energy--momentum tensor from being positive on the full Fock space, we use quantum energy inequalities to obtain lower bounds controlling deviations from positivity. This leads to regularized operator families, bounded from below, which approximate the localization effects. Finally, we define conditional localization observables for finite laboratories through modified local energy operators. By Haag duality, the corresponding conditional POVMs belong to local von Neumann algebras and commute for causally separated regions, in accordance with the Araki--Haag--Kastler framework. The results show how commutativity of localization observables is recovered for conditional measurements in finite spacetime regions.

2602.14621 2026-06-19 math.OC 版本更新 85%

Extragradient methods for mean field games of controls and mean field type FBSDEs

控制平均场博弈与平均场类型正倒向随机微分方程的超梯度方法

Charles Meynard

专题命中 物理仿真 :提出数值方案求解平均场博弈方程,属于数学优化与物理仿真。

AI总结 提出一种基于超梯度方法的数值方案,用于求解由单调向量场驱动的耦合平均场正倒向随机微分方程,并证明在强单调性假设下近似解指数收敛。

详情
AI中文摘要

本文提出一种数值方案,用于求解由单调向量场驱动的耦合平均场正倒向随机微分方程。该方案基于超梯度方法的改编,通过将解刻画为希尔伯特空间中单调变分不等式的零点。我们首先在控制平均场博弈的背景下介绍该过程,并强调其与虚拟博弈的联系。在足够强的单调性假设下,我们证明了近似解序列指数快速收敛。然后,我们将该方法及主要结果推广到不一定源于最优控制的一般正倒向随机微分方程系统。

英文摘要

In this paper we present a numerical scheme to solve coupled mean field forward-backward stochastic differential equations driven by monotone vector fields. This is based on an adaptation of so called extragradient methods by characterizing solutions as zeros of monotone variational inequalities in a Hilbert space. We first introduce the procedure in the context of mean field games of controls and highlight its connection to the fictitious play. Under sufficiently strong monotonicity assumptions, we demonstrate that the sequence of approximate solutions converges exponentially fast. Then we extend the method and main results to general forward backward systems of stochastic differential equations that do not necessarily stem from optimal control.

2603.10336 2026-06-19 math.OC 版本更新 85%

A Globally Convergent Flow for Time-Dependent Mean Field Games and a Solver-Agnostic Framework for Inverse Problems

时间依赖平均场博弈的全局收敛流与逆问题的求解器无关框架

Hanwei Yan, Xianjin Yang, Jingguo Zhang

专题命中 物理仿真 :提出全局收敛流求解时间依赖平均场博弈。

AI总结 提出Hessian-Riemannian流用于时间依赖平均场博弈的全局收敛求解,并构建求解器无关的逆问题框架,通过双层优化和伴随梯度实现参数估计。

详情
AI中文摘要

平均场博弈(MFGs)描述了大量策略交互主体的极限行为。本文针对MFGs的两个数值挑战:全局收敛的正向求解器和逆问题的求解器无关方法。对于正向问题,我们将先前为静态MFGs开发的Hessian-Riemannian流(HRF)扩展到时间依赖MFGs。我们首先在空间和时间上离散化系统,然后直接在所得的有限维问题上构造流。所提出的流利用Lasry-Lions单调性,保留初始密度和终端值函数,并保持密度的正性和质量。在标准假设下,我们证明了HRF的全局收敛性,并展示了如何从其极限恢复完全离散化的时间依赖MFG系统的解。对于逆问题,我们将参数估计表述为双层问题,其中外层问题更新未知系数,内层问题求解离散化的MFG系统。外层目标的梯度通过在内层解处对离散化MFG系统求导获得,而不是通过特定正向求解器的迭代求导。这产生了一个求解器无关的框架,采用伴随梯度下降和高斯-牛顿加速。关于静态和时间依赖MFGs的数值实验证明了所提出方法的有效性。

英文摘要

Mean field games (MFGs) describe the limiting behavior of large populations of strategically interacting agents. This paper addresses two numerical challenges for MFGs: globally convergent forward solvers and solver-agnostic methods for inverse problems. For the forward problem, we extend the Hessian--Riemannian flow (HRF), previously developed for stationary MFGs, to time-dependent MFGs. We first discretize the system in space and time and then construct the flow directly on the resulting finite-dimensional problem. The proposed flow exploits Lasry--Lions monotonicity, preserves the initial density and terminal value function, and maintains positivity and mass of the density. Under standard assumptions, we prove global convergence of the HRF and show how to recover a solution of the full discretized time-dependent MFG system from its limit. For the inverse problem, we formulate parameter estimation as a bilevel problem in which the outer problem updates unknown coefficients and the inner problem solves the discretized MFG system. Gradients of the outer objective are obtained by differentiating the discretized MFG system at the inner solution, rather than differentiating through the iterations of a particular forward solver. This yields a solver-agnostic framework with adjoint-based gradient descent and Gauss--Newton acceleration. Numerical experiments on stationary and time-dependent MFGs demonstrate the effectiveness of the proposed methods.

2602.15687 2026-06-19 cond-mat.soft 版本更新 85%

Flexoelectricity-driven softening of bend elasticity leads to spontaneous chiral symmetry breaking in a polar fluid

挠曲电效应驱动的弯曲弹性软化导致极性流体中自发手性对称性破缺

Aitor Erkoreka, Josu Martinez-Perdiguero, Luka Cmok, Ema Hanžel, Jordan Hobbs, Calum J. Gibb, Richard J. Mandle, Nerea Sebastián, Alenka Mertelj

专题命中 物理仿真 :研究极性流体中自发手性对称性破缺的物理机制

AI总结 研究通过实验和理论揭示极性流体中自发手性对称性破缺的机制,发现挠曲电耦合引起的弯曲弹性软化是形成螺旋结构的关键。

Comments 8 pages, 8 figures

详情
AI中文摘要

最近观察到的极性流体中自发手性对称性破缺的起源是一个未解决的问题,并提出了关于由非手性分子组成的系统中如何出现螺旋结构的基本问题。我们报道了接近这种相变时弯曲弹性的软化,表明电极化与弯曲变形之间的挠曲电耦合是负责的机制,可能源于组成的高度极性分子的弯曲形状。

英文摘要

The origin of recently observed spontaneous chiral symmetry breaking in polar fluids is an unsolved problem, and poses fundamental questions as to how heliconical structures emerge in systems composed of achiral molecules. We report on the softening of bend elasticity close to such phase transition, showing that flexoelectric coupling between the electric polarization and the bend deformation is the responsible mechanism, presumably arising from the bent shape of the constituent highly polar molecules.

2512.04615 2026-06-19 quant-ph cond-mat.str-el 85%

Ground state energy and phase transitions of Long-range XXZ using VQE

使用VQE的长程XXZ模型的基态能量与相变

Mrinal Dev, Shraddha Sharma

专题命中 物理仿真 :使用VQE求解量子物理模型,属于物理仿真

AI总结 利用变分量子本征求解器(VQE)通过设计对相位敏感的ansatz电路,基于基态能量误差行为识别长程XXZ链的无穷阶相变边界。

详情
AI中文摘要

变分量子本征求解器(VQE)已被广泛用于寻找没有解析解且经典计算困难的哈密顿量的基态能量。在我们的工作中,我们使用VQE来识别无穷阶相变的相变边界。我们使用长程XXZ(LRXXZ)链进行研究。为了探测无穷阶相变,我们提出利用从VQE获得的基态能量。这一想法基于以下论点:VQE需要一个ansatz电路;因此,VQE的准确性将依赖于这个ansatz电路。我们设计了这个电路,使得估计的基态能量对其评估所在的相位敏感。这是通过施加在优化过程中净自旋保持恒定的约束来实现的。因此,ansatz在某个相位中工作良好,在该相位中它给出相对较小的随机误差,正如预期的那样,而在其他相位中,ansatz失败,基态能量计算误差较大。通过使用VQE识别基态能量误差行为的这些变化,我们能够确定相边界。使用精确对角化,我们还比较了该模型在两个相变边界上的能量梯度和能隙的行为。此外,通过增加优化电路的深度,我们还准确评估了耦合常数J等于-1时LRXXZ链的基态能量。

英文摘要

The variational quantum eigen solver (VQE), has been widely used to find the ground state energy of different Hamiltonians with no analytical solutions and are classically difficult to compute. In our work, we have used VQE to identify the phase transition boundary for an infinite order phase transition. We use long-range XXZ (LRXXZ) chain for our study. In order to probe infinite order phase transition, we propose to utilise the ground state energy obtained from VQE. The idea rests on the argument that VQE requires an ansatz circuit; therefore, the accuracy of the VQE will rely on this ansatz circuit. We have designed this circuit such that the estimated ground state energy is sensitive to the phase it is evaluated in. It is achieved by applying the constraint that the net spin remains constant throughout the optimisation process. Consequently, the ansatz works in a certain phase where it gives relatively small random error, as it should, when compared to the error in ground state energy calculations of the other phases, where the ansatz fails. By identifying these changes in the behaviour of the error in ground state energy using VQE, we were able to determine the phase boundaries. Using exact diagonalisation, we also compare the behaviour of the energy gradient and energy gap across both the phase transition boundaries for this model. Further, by increasing the depth of the optimisation circuit, we also accurately evaluate the ground energy of the LRXXZ chain for the value of coupling constant, J equal to -1

2601.01690 2026-06-19 physics.optics physics.app-ph physics.comp-ph 版本更新 85%

Quantum Nonlinearity for Optical Neural Computing

用于光学神经计算的量子非线性

Qingyi Zhou, Jungmin Kim, Yutian Tao, Guoming Huang, Ming Zhou, Zewei Shao, Zongfu Yu

专题命中 物理仿真 :量子非线性用于光学神经计算,属于物理仿真

AI总结 提出嵌入量子发射体的逆向设计纳米光子结构,利用量子发射体的饱和特性实现强非线性,通过物理感知训练实现全光神经网络的非线性分类和强化学习,并建立量化非线性与网络表达能力的框架。

Comments Main text: 11 pages, 4 figures; Supplementary: 36 pages, 26 figures

详情
AI中文摘要

深度神经网络的快速扩展以不可持续的功耗为代价。虽然光学神经网络提供了一种替代方案,但其能力仍受限于缺乏高效的光学非线性。为了解决这一问题,我们提出了一种光学神经计算架构,通过将量子发射体嵌入逆向设计的纳米光子结构中。由于量子发射体的可饱和性,与传统材料相比,它们表现出极强的非线性。通过物理感知训练,我们数值证明了所提出的架构可以在全光神经网络中解决复杂任务,包括非线性分类和强化学习。为了在不同平台之间进行公平比较,我们引入了一个框架,将非线性与网络的表达能力定量联系起来。分析表明,我们的量子激活在纳瓦每平方微米的强度下工作,比传统光学材料的非线性阈值低七个数量级。展望大型语言模型,我们估算了非线性限制的光功率,该功率随模型大小呈次线性增长。我们的结果表明,量子纳米光子学可能为可持续的人工智能推理提供一条途径。

英文摘要

The rapid scaling of deep neural networks comes at the cost of unsustainable power consumption. While optical neural networks offer an alternative, their capabilities remain constrained by the lack of efficient optical nonlinearities. To address this, we propose an optical neural computing architecture by embedding quantum emitters in inverse-designed nanophotonic structures. Due to their saturability, quantum emitters exhibit exceptionally strong nonlinearity compared with conventional materials. Using physics-aware training, we numerically demonstrate that the proposed architecture can solve complex tasks, including nonlinear classification and reinforcement learning, within all-optical neural networks. To enable fair comparison across different platforms, we introduce a framework that quantitatively links nonlinearity to a network's expressive power. Analysis shows that our quantum activation operates at $\text{nW}/μ\text{m}^2$ intensity, which is seven orders of magnitude below the nonlinearity threshold of conventional optical materials. Looking ahead to large language models, we estimate the nonlinearity-limited optical power, which scales sublinearly with model size. Our results indicate that quantum nanophotonics may provide a route toward sustainable AI inference.

2512.14415 2026-06-19 quant-ph 85%

Ground State Energy via Adiabatic Evolution and Phase Measurement for a Molecular Hamiltonian on an Ion-Trap Quantum Computer

通过绝热演化和相位测量估算分子哈密顿量在离子阱量子计算机上的基态能量

Ludwig Nützel, Michael J. Hartmann, Henrik Dreyer, Etienne Granet

专题命中 物理仿真 :在离子阱量子计算机上估算分子基态能量

AI总结 本文通过绝热态制备和噪声鲁棒的迭代量子相位估算方法,研究了离子阱量子计算机在H3+分子六量子位编码中的基态能量测量,改进了经典Hartree-Fock能量并揭示了漏泄误差对化学精度的主要影响。

详情
AI中文摘要

估算分子基态能量是量子计算的核心应用,需要准确的量子态制备和高效的能量读出。理解硬件噪声对这些实验的影响至关重要,以区分低影响的误差、可缓解的误差和必须在硬件层面减少的误差。我们在一个离子阱量子计算机上运行了一个态制备和能量测量协议,没有将计算任务非可扩展地卸载到经典计算机上,并展示了漏泄误差是化学精度的主要障碍。更具体地说,我们应用绝热态制备来制备六量子位编码的H3+分子的基态,并利用噪声鲁棒的迭代量子相位估算变体提取其能量。我们的结果优于经典Hartree-Fock能量。分析硬件噪声对结果的影响,我们发现尽管相干和非相干噪声影响较小,但硬件结果主要受漏泄误差影响。在没有漏泄误差的情况下,噪声数值模拟显示,即使包含射频噪声,我们的实验设置也能接近化学精度。这些见解突显了未来算法和硬件开发中针对漏泄抑制的重要性。

英文摘要

Estimating molecular ground-state energies is a central application of quantum computing, requiring both the preparation of accurate quantum states and efficient energy readout. Understanding the effect of hardware noise on these experiments is crucial to distinguish errors that have low impact, errors that can be mitigated, and errors that must be reduced at the hardware level. We ran a state preparation and energy measurement protocol on an ion-trap quantum computer, without any non-scalable off-loading of computational tasks to classical computers, and show that leakage errors are the main obstacle to chemical accuracy. More specifically, we apply adiabatic state preparation to prepare the ground state of a six-qubit encoding of the H3+ molecule and extract its energy using a noise-resilient variant of iterative quantum phase estimation. Our results improve upon the classical Hartree-Fock energy. Analyzing the effect of hardware noise on the result, we find that while coherent and incoherent noise have little influence, the hardware results are mainly impacted by leakage errors. Absent leakage errors, noisy numerical simulations show that with our experimental settings we would have achieved close to chemical accuracy, even shot noise included. These insights highlight the importance of targeting leakage suppression in future algorithm and hardware development.

2510.21290 2026-06-19 math.NA cs.NA 版本更新 85%

A Variational Framework for the Complexity of PDE Solutions

偏微分方程解复杂性的变分框架

Juan Esteban Suarez Cardona, Holger Boche, Gitta Kutyniok

专题命中 物理仿真 :基于变分框架分析PDE解的可计算性和复杂性。

AI总结 提出基于最小二乘变分公式和梯度流的框架,从优化角度分析PDE解的可计算性和复杂性,建立多项式时间逼近与复杂性爆炸的充分条件。

详情
AI中文摘要

偏微分方程是描述物理现象的基本数学模型,但大多数实际感兴趣的PDE需要数值近似。这些方法的可行性受到现有计算模型的限制。由于数字计算机是数值计算的主要实现,而图灵机定义了其理论极限,因此PDE解的可计算性具有根本意义。它提供了一个严格的框架来区分有效可解的方程与那些编码了不可判定或不可计算行为的方程。一旦可计算性确立,复杂性理论量化了近似PDE解所需的资源。在这项工作中,我们提出了一个基于最小二乘变分公式和相关梯度流的新框架,从优化角度分析PDE解的可计算性和复杂性。我们的方法通过离散梯度流近似PDE解算子,将PDE性质(如强制性、椭圆性和凸性)与解复杂性联系起来。在此设置下,我们刻画了依赖于表示和离散化的充分条件,用于PDE允许多项式时间逼近的情形,以及出现复杂性爆炸(即多项式时间输入数据产生超多项式复杂性的解)的情形。总之,本文开发了一个用于分析PDE解类可计算性和计算复杂性的变分框架。结果展示了PDE结构和解正则性如何通过建立可计算性和复杂性界限的充分条件来影响其复杂性。除了理论刻画,该框架为有效数值方法提供了指导,并有助于理解数字计算在PDE问题上的局限性。

英文摘要

Partial Differential Equations (PDEs) are fundamental mathematical models for describing physical phenomena, yet most PDEs of practical interest require numerical approximations. The feasibility of such methods is constrained by existing computational models. Since digital computers are the primary realizations of numerical computations, and Turing machines define their theoretical limits, computability of PDE solutions is of fundamental significance. It provides a rigorous framework to distinguish equations that are effectively solvable from those that encode undecidable or non-computable behavior. Once computability is established, complexity theory quantifies the resources required to approximate PDE solutions. In this work, we present a novel framework based on least-squares variational formulations and associated gradient flows to analyze the computability and complexity of PDE solutions from an optimization perspective. Our approach approximates PDE solution operators via discrete gradient flows, linking PDE properties, such as coercivity, ellipticity, and convexity, to solution complexity. Within this setting, we characterize representation- and discretization-dependent sufficient conditions for regimes where PDEs admit polynomial-time approximations, as well as regimes exhibiting complexity blowup, where polynomial-time input data produce solutions with super-polynomial complexity. In summary, this paper develops a variational framework for analyzing computability and computational complexity of PDE solution classes. The results show how PDE structure and solution regularity influence their complexity, by establishing sufficient conditions for computability and complexity bounds. Beyond the theoretical characterization, the framework provides guidelines for effective numerical methods and contributes to understanding the limitations of digital computation for PDE problems.

2511.22558 2026-06-19 gr-qc hep-th math-ph math.MP 版本更新 85%

A Universal Smarr Formula via Coupling Constants

通过耦合常数的通用Smarr公式

Kamal Hajian, Bayram Tekin, Onur Ucanok

专题命中 物理仿真 :提出引力理论中耦合常数作为热力学变量的通用Smarr公式。

AI总结 提出将引力理论中所有有量纲耦合常数视为热力学变量,通过引入辅助标量场和规范场,使Smarr公式和第一定律得到一致扩展,实现黑洞热力学的通用表述。

Comments 20 pages, published version with some typos removed

Journal ref Eur.Phys.J.C 86 (2026) 5, 541

详情
AI中文摘要

在包含物质场和高阶导数修正的引力理论中,除非所有有量纲耦合被一致地纳入,否则标准的Smarr公式往往失效。传统上,诸如宇宙学常数或高阶导数项的系数被视为理论的不变特征,因此被排除在热力学相空间之外。在我们最近的工作中,我们发展了一个完全通用的框架,将每个这样的耦合提升为黑洞解的一个动力学、自由变化的参数。这是通过为每个耦合引入一个辅助标量场和规范场来实现的,通过这些场,耦合作为与涌现规范对称性的全局部分相关联的守恒电荷出现。相应的共轭变量自然地作为在黑洞视界处评估的电势出现。结果,第一定律和Smarr关系获得了额外的、系统确定的贡献,产生了黑洞热力学的一致且通用的扩展。我们通过重新审视文献中的几个黑洞例子来证明这一构造的有效性,在这些例子中,即使将宇宙学常数视为热力学变量,Smarr公式仍然不一致。我们的分析表明,只有通过这种广义方式包含所有有量纲耦合,才能获得内部一致的Smarr关系,从而为真正通用的黑洞热力学表述提供基础。

英文摘要

In gravitational theories containing matter fields and higher-derivative corrections, the standard Smarr formula often fails unless all dimensionful couplings are incorporated consistently. Traditionally, parameters such as the cosmological constant or the coefficients of higher-derivative terms are regarded as immutable features of the theory and therefore excluded from the thermodynamic phase space. In our recent work, we developed a fully general framework that promotes every such coupling to a dynamical, freely varying parameter of black hole solutions. This is accomplished by introducing, for each coupling, an auxiliary scalar and gauge field, through which the coupling appears as a conserved charge associated with the global sector of an emergent gauge symmetry. The corresponding conjugate variables naturally arise as electric potentials evaluated at the black hole horizon. As a result, the first law and the Smarr relation acquire additional, systematically determined contributions, yielding a consistent and universal extension of black hole thermodynamics. We illustrate the validity of this construction by revisiting several black hole examples in the literature where the Smarr formula remains inconsistent even after treating the cosmological constant as a thermodynamic variable. Our analysis shows that only by including all dimensionful couplings in this generalized manner can one obtain an internally consistent Smarr relation, thereby providing the foundation for a truly universal formulation of black hole thermodynamics.

2511.18341 2026-06-19 cond-mat.str-el 版本更新 85%

Phase Structure and Machine Learning Identification in One Dimensional Systems with Power Law Correlated Disorder and Long Range Hopping

具有幂律关联无序和长程跳跃的一维系统中的相结构与机器学习识别

Mohammad Pouranvari

专题命中 物理仿真 :研究一维无序系统的相结构,结合机器学习识别。

AI总结 研究一维紧束缚模型,其中位势具有幂律空间关联(指数α),跳跃振幅按|i-j|^{-β}衰减。通过大规模精确对角化,结合谱统计、态密度分析和能量分辨局域化指标,构建(α,β)平面上的完整相图,揭示稳健的迁移边和多重谱共存区域,并利用监督自编码器验证相分类。

Journal ref Sci Rep 16, 17720 (2026)

详情
AI中文摘要

我们研究了一个一维紧束缚模型,其中在位势$\{\varepsilon_i\}$具有幂律空间关联(指数$\alpha$),跳跃振幅按$t_{ij}\sim |i-j|^{-\beta}$衰减。这个双参数族在短程安德森型无序、具有常规跳跃的关联无序以及具有非平凡离域化趋势的长程跳跃模型之间连续插值。通过大规模精确对角化,我们结合谱统计、态密度分析和能量分辨局域化指标(如参与比、单粒子纠缠熵、能级间距比$r$以及几何与算术态密度之比),构建了$(\alpha,\beta)$平面上的完整相图。从这些可观测量中,我们定义了相指示函数,以紧凑地量化整个谱上的局域化行为。我们的分析揭示了稳健的迁移边以及局域态、扩展态、共振态和临界态之间的多重谱共存区域。通过基于显式平滑代价函数的有限尺寸标度,我们能够提取临界指数并描绘$(\alpha,\beta)$参数空间中的转变线。为了验证和补充这些基于物理的诊断,我们采用了一个监督自编码器,直接从原始特征学习本征态结构的高层表示,并可靠地再现由指示函数定义的相分类。这些方法共同提供了由关联无序和长程跳跃驱动的谱转变的一致且自洽的图像,为表征长程一维系统中的迁移边建立了统一框架。

英文摘要

We investigate a one-dimensional tight-binding model in which onsite potentials $\{\varepsilon_i\}$ exhibit power-law spatialcorrelations (with exponent $α$) and the hopping amplitudes decay as $t_{ij}\sim |i-j|^{-β}$. This two-parameter family interpolates continuously between short-range Anderson-like disorder, correlated disorder with conventional hopping, and long-range hopping models with nontrivial delocalization tendencies. Using large-scale exact diagonalization, we construct a comprehensive phase map in the $(α,β)$ plane by combining spectral statistics, density-of-states analysis, and energy-resolved localization indicators such as the participation ratio, single-particle entanglement entropy, level-spacing ratio $r$, and the ratio of the geometric to arithmetic density of states. From these observables we define phase-indicator functions that compactly quantify localization behavior across the spectrum. Our analysis reveals robust mobility edges and multiple regimes of spectral coexistence between localized, extended, resonant, and critical states. Finite-size scaling, implemented via an explicit smoothness-based cost function, enables extraction of critical exponents and delineation of transition lines across the $(α,β)$ parameter space. To validate and complement these physics-based diagnostics, we employ a supervised autoencoder that learns high-level representations of eigenstate structure directly from raw features and reliably reproduces the phase classification defined by the indicator functions. Together, these approaches provide a coherent and internally consistent picture of the spectral transitions driven by correlated disorder and long-range hopping, establishing a unified framework for characterizing mobility edges in long-range one-dimensional systems.

2509.11951 2026-06-19 math.NA cs.NA math.AP 版本更新 85%

X-ray imaging from nonlinear waves: numerical reconstruction of a cubic nonlinearity

非线性波X射线成像:三次非线性的数值重建

Suvi Anttila, Markus Harju, Teemu Tyni

专题命中 物理仿真 :非线性波方程反问题数值重建,X射线成像。

AI总结 针对2+1维非线性波动方程的反边界值问题,提出基于Radon变换的直接数值重建方法,通过谱正则化稳定数值微分,实现从边界测量恢复势函数。

Comments 26 pages, 10 figures. Revised version based on peer-review feedback with improvements to Theorem 1, an addition of Theorem 2, and an additional figure in the time-dependent case

详情
AI中文摘要

我们研究了$2+1$维非线性波动方程的反边界值问题。目标是利用实值波从相关的Dirichlet-to-Neumann映射中恢复未知势$q(x, t)$。我们提出了一种直接数值重建方法,用于$q$的Radon变换,然后可以使用标准的X射线断层扫描技术反演以确定$q$。我们的实现引入了一种谱正则化程序,以稳定重建中所需的数值微分步骤,提高了对边界数据噪声的鲁棒性。我们给出了噪声测量正则化谱微分的严格证明和最优稳定性估计,这可能具有独立的意义。数值实验证明了从非线性波的边界测量中恢复势的可行性,并说明了基于Radon重建的优势。

英文摘要

We study an inverse boundary value problem for the nonlinear wave equation in $2 + 1$ dimensions. The objective is to recover an unknown potential $q(x, t)$ from the associated Dirichlet-to-Neumann map using real-valued waves. We propose a direct numerical reconstruction method for the Radon transform of $q$, which can then be inverted using standard X-ray tomography techniques to determine $q$. Our implementation introduces a spectral regularization procedure to stabilize the numerical differentiation step required in the reconstruction, improving robustness with respect to noise in the boundary data. We give rigorous justification and optimal stability estimates for the regularized spectral differentiation of noisy measurements, which may be of independent interest. Numerical experiments demonstrate the feasibility of recovering potentials from boundary measurements of nonlinear waves and illustrate the advantages of the Radon-based reconstruction.

2508.01391 2026-06-19 cond-mat.soft cond-mat.mtrl-sci cond-mat.stat-mech 85%

Force and geometric signatures of the creep-to-failure transition in a granular pile

颗粒堆中蠕变-破坏过渡的力与几何特征

Qing Hao, Luca Montoya, Elena Lee, Luke K. Davis, Cacey Stevens Bester

专题命中 物理仿真 :研究颗粒堆蠕变破坏的力学机制,属于物理仿真。

AI总结 研究通过实验探讨颗粒堆中蠕变与破坏的特征,分析力网络和空隙几何结构的变化,揭示蠕变-破坏过渡的力学与几何机制。

详情
AI中文摘要

颗粒蠕变是由于颗粒尺度相互作用无序性导致的颗粒堆中缓慢的亚屈服运动。尽管蠕变在无序材料中普遍存在,但如何基于力和相互作用预测蠕变-破坏阶段仍不明确。为此,我们通过实验研究准二维颗粒堆中的蠕变与破坏,量化颗粒运动和颗粒尺度接触力网络。通过控制外部扰动,研究颗粒重组、力网络和空隙的出现与演变,以揭示蠕变和破坏的特征。令人惊讶的是,力链结构在无明显颗粒运动时仍保持动态。我们发现力链的移动预示着更大的雪崩级破坏。我们将这些力特征与堆中空隙的几何结构联系起来。总体而言,我们的新实验和分析加深了对颗粒系统蠕变-破坏过渡的机械和几何理解。

英文摘要

Granular creep is the slow, sub-yield movement of constituents in a granular packing due to the disordered nature of its grain-scale interactions. Despite the ubiquity of creep in disordered materials, it is still not understood how to best predict the creep-to-failure regime based on the forces and interactions among constituents. To address this gap, we perform experiments to explore creep and failure in quasi two-dimensional piles of photoelastic disks, allowing the quantification of both grain movements and grain-scale contact force networks. Through controlled external disturbances, we investigate the emergence and evolution of grain rearrangements, force networks, and voids to illuminate signatures of creep and failure. Surprisingly, the force chain structure remains dynamic even in the absence of observable particle motion. We find that shifts in force chains provide an indication to larger, avalanche-scale disruptions. We connect these force signatures with the geometry of the voids in the pile. Overall, our novel experiments and analyses deepen our mechanical and geometric understanding of the creep-to-failure transition in granular systems.

2507.18770 2026-06-19 cond-mat.mes-hall cond-mat.str-el quant-ph 版本更新 85%

Propagating Collective Spin-valley Modes in Twisted WSe2

扭曲WSe2中的传播性集体自旋谷模式

Richen Xiong, Yi Guo, Chenxin Qin, Taige Wang, Fanzhao Yin, Samuel L. Brantly, Youngjoon Choi, Junhang Qi, Jinfei Zhou, Zihan Zhang, Melike Erdi, Kenji Watanabe, Takashi Taniguchi, Shu Zhang, Seth Ariel Tongay, Andrea F. Young, Liang Fu, Chenhao Jin

专题命中 物理仿真 :扭曲WSe2中集体模式研究,属于物理仿真。

AI总结 通过超快成像技术在扭曲WSe2中发现了两种不同速度的传播性集体模式,快模式与IVC态的Goldstone模式一致,慢模式为有隙振幅模式,首次在凝聚态系统中成像了超流体的自旋谷类比集体模式。

Journal ref Nature Physics 22 877-883 (2026)

详情
AI中文摘要

中性集体模式的出现是关联量子相的一个标志,但通常在实验上难以探测。在二维平带系统中,电荷响应已被深入研究,而中性激发仍 largely 未被探索。特别是,谷间相干态(IVC)由于自发破缺的谷U(1)对称性而具有中性Goldstone模式。尽管IVC态已被提出作为石墨烯和半导体系统的统一主题,但其定义特征——中性Goldstone模式——在实验中仍然 elusive。在这里,我们通过一种新颖的超快成像技术,研究了扭曲WSe2莫尔超晶格中中性模式的空间和时间分辨输运。我们在中等角度(3.5至4度)和大角度(约5度)扭曲WSe2的范霍夫奇点(VHS)附近发现了两种具有非常不同速度的新传播性集体模式。快速传播模式的速度约为3 km/s,与IVC态的Goldstone模式一致,而慢速模式可能是一个有隙振幅模式。它们可以被理解为超流体集体模式的自旋谷类比,其传播首次在凝聚态系统中被成像。我们的研究展示了一种探测量子材料中电荷中性模式的新方法,并为莫尔超晶格中电荷与自旋谷物理之间的相互作用提供了关键见解。

英文摘要

The emergence of neutral collective modes is a hallmark of correlated quantum phases but is often challenging to probe experimentally. In two-dimensional flatband systems, charge responses have been intensively investigated yet neutral excitations remain largely unexplored. In particular, intervalley coherent state (IVC) features a neutral Goldstone mode due to spontaneously broken valley U(1) symmetry. While IVC state has been proposed as a unifying theme across graphene and semiconductor based systems, its defining feature, the neutral Goldstone mode, remains elusive in experiment. Here we investigate space and time resolved transport of neutral modes in twisted WSe2 moire superlattices through a novel ultrafast imaging technique. We uncover two new propagating collective modes with very different velocities, which emerge near the van Hove singularity (VHS) in both intermediate (3.5 to 4 degree) and large (around 5 degree) angle twisted WSe2. The fast-propagating mode has a large speed of about 3 km/s and is consistent with a Goldstone mode for an IVC state, while the slow-moving mode is likely a gapped amplitude mode. They can be understood as the spin-valley analogues of collective modes of a superfluid, whose propagation is imaged for the first time in a condensed matter system. Our study demonstrates a powerful new approach for probing charge-neutral modes in quantum materials and offers key insights into the interplay between charge and spin-valley physics in moire superlattices.

2606.20053 2026-06-19 cs.LG 新提交 80%

Comparative Study of Neural Surrogate Architectures for Autoregressive Prediction of Internal Battery States

用于电池内部状态自回归预测的神经代理架构比较研究

Gihyun Lee, Thorben Menne, Simon Olma, Jakob Hilgert, Sangyoung Park

发表机构 * IAV GmbH(IAV公司)

专题命中 物理仿真 :用神经网络代理预测电池内部状态,属于科学智能。

AI总结 系统比较四种神经网络架构(MLP、ResNet、U-Net、FNO)作为自回归状态转移算子,预测锂离子电池DFN模型内部状态,发现U-Net因多尺度空间归纳偏置在精度和速度上最优。

Comments 8 pages, 5 figures

详情
AI中文摘要

Doyle-Fuller-Newman (DFN) 模型以高保真度解析锂离子电池的内部电化学状态。然而,其控制方程的数值求解对于实时部署而言计算成本过高,限制了从单个电池到电池组及车队规模应用的可扩展性。虽然机器学习代理可以通过GPU加速大幅降低推理延迟,但现有大多数方法学习的是特定操作条件下的解近似,而非可泛化的状态演化动力学。本文系统比较了四种神经网络架构(MLP、ResNet、U-Net、FNO),它们被构建为自回归状态转移算子,可预测广泛操作条件下的完整DFN内部状态。为确保受控的架构比较,所有模型在统一框架下训练,采用多步展开和电流条件化,隔离了空间归纳偏置的影响。结果表明,U-Net的多尺度特征层次在300步自回归展开后,所有内部状态变量的平均最终步nRMSE达到3%,同时相比数值求解器实现了5.38倍的加速。这些发现强调了空间归纳偏置是代理性能的关键决定因素,推动了用于下一代电池管理系统和数字孪生的内部状态可观测性代理的发展。

英文摘要

The Doyle-Fuller-Newman (DFN) model resolves internal electrochemical states in lithium-ion batteries with high fidelity. However, the numerical solution of its governing equations is computationally prohibitive for real-time deployment, limiting scalability from individual cells to pack and fleet-scale applications. While machine learning surrogates can substantially reduce inference latency through GPU acceleration, most existing approaches learn solution approximations tied to specific operating conditions rather than learning generalizable state-evolution dynamics. This work presents a systematic comparison of four neural network architectures (MLP, ResNet, U-Net, FNO) formulated as autoregressive state-transition operators that predict full DFN internal states across a wide range of operating conditions. To ensure a controlled architectural comparison, all models are trained under a unified framework using multi-step unrolling and current-conditioning, isolating the impact of spatial inductive bias. Results demonstrate that the U-Net's multi-scale feature hierarchy achieves a mean final-step nRMSE of 3% averaged across all internal state variables after 300-step autoregressive rollouts, while providing a 5.38x speed-up over the numerical solver. These findings highlight spatial inductive bias as a critical determinant of surrogate performance, advancing the development of surrogates for internal state observability for next-generation battery management systems and digital twins.

2606.20015 2026-06-19 cs.LG 新提交 80%

Adaptive Distance-Aware Trunk Deep Operator Learning for Long-Span Roadway Bridges

自适应距离感知主干深度算子学习用于大跨度公路桥梁

Bilal Ahmed, Diab W. Abueidda, Waleed El-Sekelly, Tarek Abdoun, Mostafa E. Mobasher

发表机构 * Urban Engineering Department , addressline= New York University Abu Dhabi , country= United Arab Emirates organization= National Center for Supercomputing Applications , addressline= University of Illinois at Urbana-Champaign , country= United States of America organization= Department of Structural Engineering , addressline= Mansoura University , country= Mansoura, Egypt

专题命中 物理仿真 :深度算子学习预测桥梁结构响应,属于科学智能。

AI总结 提出自适应主干DeepONet框架,通过KNN构建荷载相关学习域、距离感知特征和刚度-informed Schur补全重建,实现大跨度桥梁局部响应高精度快速预测,相对误差低于5%,速度提升约60倍。

Comments 39 pages, 26 figures

详情
AI中文摘要

大跨度公路桥梁在车辆荷载下表现出高度局部化的结构响应,使得重复有限元分析在影响面生成和结构数字孪生等应用中计算成本高昂。现有的科学机器学习方法难以准确捕捉这些局部响应。为解决这一挑战,本研究提出了一种自适应主干DeepONet用于大型桥梁系统的局部结构响应预测。该框架利用KNN策略动态构建荷载相关的学习域,使网络聚焦于结构影响区域。主干网络进一步通过距离感知特征增强,这些特征编码了荷载与结构节点之间的几何关系。通过刚度-informed Schur补全公式引入基于物理的全场重建,使得自适应节点上的预测能够扩展到整个结构域。为了实现可扩展训练,使用降阶等效壳模型生成响应数据,该模型保留了主要的全局行为,同时显著降低了计算成本。该框架在基准桥梁模型和真实世界的Mussafah桥上进行了验证。结果表明,该方法实现了有限元级别的精度,相对误差低于5%,同时将总响应评估时间(包括全场重建)减少了约60倍;排除后处理重建步骤,AD-DeepONet推理比有限元快四个数量级。此外,该框架能够在任意车辆荷载配置下快速生成全场响应、影响线和影响面,显示出在大规模桥梁分析和数字孪生应用中的巨大潜力。

英文摘要

Long-span roadway bridges exhibit highly localized structural responses under vehicular loading, making repeated FE analysis computationally expensive for applications such as influence surface generation and structural digital twins. Existing SciML approaches struggle to accurately capture these localized responses. To address this challenge, this study proposes an adaptive-trunk DeepONet for localized structural response prediction in large-scale bridge systems. The framework dynamically constructs a load-dependent learning domain using a KNN strategy, allowing the network to focus on structural influence zones. The trunk network is further enhanced using distance-aware features that encode the geometric relationship between the load and structural nodes. A physics-based full-field reconstruction is incorporated through a stiffness-informed Schur complement formulation, enabling predictions at adaptive nodes to be extended to the entire structural domain. To enable scalable training, response data are generated using a reduced-order equivalent shell model that preserves the dominant global behavior while significantly reducing computational cost. The proposed framework is validated on both a benchmark bridge model and the real-world Mussafah Bridge. Results show that the method achieves FEM-level accuracy with relative errors below 5%, while reducing the total response evaluation time (including full-field reconstruction) by approximately 60x; excluding the post-processing reconstruction step, the AD-DeepONet inference is up to four orders of magnitude faster than FEM. In addition, the framework enables rapid generation of full-field responses, influence lines, and influence surfaces under arbitrary vehicular loading configurations, demonstrating strong potential for large-scale bridge analysis and digital twin applications.

2. 材料化学 4 篇

2603.09855 2026-06-19 physics.plasm-ph 85%

Sparse identification of effective microparticle interaction potential in dusty plasma from simulation data

稀疏识别有效微粒相互作用势在等离子体中的应用

Zachary Brooks Howe, Lorin Swint Matthews, Truell Hyde, Luca Guazzotto, Evdokiya Kostadinova

专题命中 材料化学 :稀疏识别微粒相互作用势,等离子体物理。

AI总结 本文提出利用SINDy方法从模拟数据中稀疏识别微粒相互作用势,用于预测等离子体相变和结构形成。

Comments 11 pages, 4 figures. This work has been submitted to the Physics of Plasmas for possible publication

详情
AI中文摘要

识别粒子相互作用势是等离子体、胶体和智能材料中的关键任务,有助于表征结构形成并预测相变。随着机器学习方法的发展,该相互作用可以从粒子位置数据中提取,从而得到通用表达式,适用于不同系统。稀疏回归等方法旨在提供可解释的模型,避免因过拟合导致的不必要的复杂性。本文展示了使用稀疏非线性动力学识别(SINDy)方法结合弱公式,从两个尘粒在Yukawa(屏蔽库仑)势下的简单模拟数据中学习运动方程。讨论了这些方法在实验等离子体数据中的应用,特别是模拟数据和玻璃箱实验在射频放电重力环境和直流放电微重力环境中的应用,如Plasmakristall-4(PK-4)实验。

英文摘要

Identification of the particle interaction potential is a challenging and important task in dusty plasma, colloids, and smart materials as it allows the characterization of structure formation and helps predict phase transitions. With the advent of machine learning methods, this interaction can be extracted from particle position data, leading to a generalizable expression which is applicable in different systems. Methods such as sparse regression aim to provide a physically interpretable model that can generalize well, while avoiding unnecessary complexity due to overfitting. In this work, we present the use of the Sparse Identification of Nonlinear Dynamics (SINDy) with the weak formulation to learn equations of motion for noisy data from simple simulations of two dust particles interacting with a Yukawa (shielded Coulomb) potential. The application of these methods to experimental dusty plasma data is discussed, particularly in the case of simulation data and glass box experiments in RF discharge gravity environments and DC discharge microgravity environments, such as the Plasmakristall-4 (PK-4) experiment.

2602.20573 2026-06-19 cs.LG 版本更新 85%

MolGraphBench: A Benchmark of GNN Architectures for Molecular Regression Tasks

MolGraphBench:用于分子回归任务的GNN架构基准测试

Rajan, Ishaan Gupta

发表机构 * Rajan 1 Ishaan Gupta 2

专题命中 材料化学 :分子回归任务GNN基准测试,化学信息学。

AI总结 提出MolGraphBench基准,比较四种GNN模型在分子回归任务上的性能,发现GCN和GIN为最优架构,并指出GNN层类型应作为可调超参数。

Comments 14 pages, 5 figures and 4 tables

详情
AI中文摘要

分子通常表示为SMILES字符串,可以轻松转换为手工设计的描述符或指纹(FP)用于分子性质预测。研究表明,SMILES可以转换为分子图 $G = (V, E)$,其中原子为节点 $(V)$,键为边 $(E)$。这些分子图随后可用于训练图神经网络(GNN)模型。尽管近年来GNN(现有和新架构)在分子性质预测中的应用激增,但仍缺乏严格的基准测试。我们提出了MolGraphBench,一个包含四种常用GNN模型的全面基准测试,用于分子性质预测。基准测试结果表明,基于绝对性能、训练效率、迁移学习和预测质量,图卷积网络(GCN)和图同构网络(GIN)是分子图回归任务的最优GNN架构。研究还表明,在融合(GNN-FP)框架中,分子指纹具有非互补性。此外,我们的GNN模型在三个数据集上取得了优于或与当前最先进GNN基线相当的性能(B3DB上GCN的RMSE为0.518,FreeSolv上GIN-FP的RMSE为1.022,RT数据集上GIN的MAE为63.783)。本研究的发现表明,GNN层类型应被视为可调超参数,而非固定设计选择,以实现更优性能。

英文摘要

Molecules are often represented as SMILES strings, which can be readily converted to hand-crafted descriptors or fingerprints (FP) for molecular property prediction. Research has demonstrated that SMILES can be converted to molecular graphs $G = (V, E)$, with atoms as nodes $(V)$ and bonds as edges $(E)$. These molecular graphs can subsequently be used to train graph neural networks (GNN) models. Despite the recent surge in application of GNN (existing and novel architectures) for molecular property prediction, a rigorous benchmark is still lacking. We propose MolGraphBench, a comprehensive benchmark of four commonly used GNN models for molecular property prediction. Benchmarking results demonstrate graph convolutional network (GCN) and graph isomorphism networks (GIN) as the optimal GNN architectures for molecular graph regression tasks, based on absolute performance, training efficiency, transfer learning and prediction quality. The study also indicates the non-complementary nature of molecular fingerprints in the fusion (GNN-FP) framework. Furthermore, our GNN models achieved performance superior or comparable performance to current state-of-the-art GNN baselines across three datasets (GCN with RMSE of $0.518$ on B3DB, GIN-FP with RMSE of $1.022$ on FreeSolv and GIN with MAE of $63.783$ on RT datasets). Findings from this study indicate that type of GNN-layer, should be treated as a tunable hyperparameter rather than a fixed design choice to achieve superior performance.

2411.06778 2026-06-19 cond-mat.str-el 85%

Unraveling Intertwined Orders in the Strongly Correlated Kagome Metal CsCr3Sb5

解析强关联kagome金属CsCr3Sb5中的交织秩序

Liangyang Liu, Yidian Li, Hengxin Tan, Yi Liu, Kuanglv Sun, Ying Shi, Yuxin Zhai, Hao Lin, Guanghan Cao, Xianhui Chen, Tao Wu, Binghai Yan, Guang-Ming Zhang, Luyi Yang

专题命中 材料化学 :研究Kagome金属中电荷密度波与交织秩序

AI总结 研究通过超快光学技术揭示CsCr3Sb5中的电荷密度波相变,并发现三态Potts型各向异性秩序,揭示多轨道平带退简并现象。

Journal ref National Science Review 16, nwag044 (2026)

详情
AI中文摘要

尽管在扭曲系统中已广泛研究了平带相关现象,但源自kagome晶格材料内在平带相互作用产生的有序态仍鲜有探索。新发现的kagome金属CsCr3Sb5提供了一个独特的平台,其费米面多轨道平带导致压电超导、反铁磁、结构相变和密度波秩序的复杂相互作用。本文利用超快光学技术,提供了强谱学证据证明CsCr3Sb5中的电荷密度波相变,澄清了先前的歧义。关键地,我们识别出旋转对称性破缺,表现为三态Potts型各向异性。通过弹性电阻测量直接证明了该秩序的电子起源,因为旋转对称性破缺的E2g成分在相变温度附近表现出发散行为。这种奇异的各向异性源于多轨道平带退简并,类似于某些铁基超导体的现象。本研究开创了在费米面平带系统中研究超快动力学的先河,为强关联系统中多种基本激发之间的相互作用提供了新见解。

英文摘要

While correlated phenomena of flat bands have been extensively studied in twisted systems, the ordered states that emerge from interactions in the intrinsic flat bands of kagome lattice materials remain largely unexplored. The newly discovered kagome metal CsCr3Sb5 offers a unique and rich platform for this research, as its multi-orbital flat bands at the Fermi surface result in a complex interplay of pressurized superconductivity, antiferromagnetism, a structural phase transition, and density wave orders. Here, using ultrafast optical techniques, we provide strong spectroscopic evidence for a charge density wave transition in CsCr3Sb5, resolving previous ambiguities. Crucially, we identify rotational symmetry breaking that manifests as a three-state Potts-type nematicity. Our elastoresistance measurements directly demonstrate the electronic origin of this order, as the rotational-symmetry-breaking E2g component of the elastoresistance shows a divergent behaviour around the transition temperature. This exotic nematicity results from the lifting of degeneracy of the multi-orbital flat bands, akin to phenomena seen in certain iron-based superconductors. Our study pioneers the investigation of ultrafast dynamics in flat-band systems at the Fermi surface, offering new insights into the interactions between multiple elementary excitations in strongly correlated systems.

2601.18600 2026-06-19 cond-mat.mtrl-sci cond-mat.mes-hall 版本更新 85%

On-surface dehydrogenative lateral homo-coupling and aromatization of n-octane on Pt(111)

正辛烷在Pt(111)上的表面脱氢横向自偶联与芳构化

D. Arribas, E. Tosi, V. Villalobos-Vilda, B. Cirera, I. Palacio, A. Sáez-Coronado, P. Lacovig, A. Baraldi, L. Bignardi, S. Lizzit, C. Sanchez-Sanchez, A. Gutiérrez, J. A. Martín-Gago, M. Garnica, J. I. Martínez, P. L. de Andres, P. Merino

专题命中 材料化学 :表面催化芳构化与偶联反应

AI总结 利用扫描隧道显微镜和第一性原理计算,研究了正辛烷在Pt(111)表面热诱导芳构化及分子间脱氢偶联反应,揭示了环芳构化和拉链式C-C键形成机制。

Comments 24 pages, 1 scheme, 3 figures

详情
AI中文摘要

脂肪烃(如正构烷烃)是碳原子的天然丰富来源。特别令人感兴趣的是从脂肪族反应物形成环状和芳香族产物。结合扫描隧道显微镜和从头算计算,我们研究了线性正辛烷分子在催化Pt(111)表面上的热诱导芳构化以及它们在600 K以上温度下的分子间自偶联反应。单个正辛烷分子的环芳构化需要线性吸附物在脱氢前弯曲,并形成分子内C-C键,产生吸附的苯环。此外,Pt(111)表面通过引发化学吸附的正辛烷分子脱氢甲基末端之间C-C键的形成,然后以拉链式方式沿碳骨架传播,催化了自偶联反应。我们的发现为生成芳香族产物和稳定的表面多环物种的多相催化过程提供了分子层面的见解。

英文摘要

Aliphatic hydrocarbons, such as normal alkanes, constitute a naturally abundant source of carbon atoms. Of special interest is the formation of cyclic and aromatic products from aliphatic reactants. Combining scanning tunneling microscopy and ab initio calculations, we investigate the thermal induced aromatization of linear n octane molecules on the catalytic Pt(111) surface and the reactions of intermolecular homocoupling between them at temperatures above 600 K. The cycloaromatization of individual n octane molecules requires bending the linear adsorbates prior to their dehydrogenation and the formation of an intramolecular C-C bond, yielding adsorbed benzene rings. In addition, the Pt(111) surface catalyzes a homocoupling reaction by initiating the formation of a C-C bond between the dehydrogenated methyl ends of the chemisorbed n octane molecules and then propagating along the carbon backbone in a zipper like fashion. Our findings provide molecular level insight into the heterogeneous catalytic processes underlying the generation of aromatic products and stable on surface polycyclic species.

3. 气象气候 1 篇

2601.18182 2026-06-19 physics.ao-ph physics.data-an 85%

A strictly geostrophic product of sea-surface velocities from the SWOT fast-sampling phase

从SWOT快速采样阶段严格地转流产物的海面速度

Takaya Uchida, Badarvada Yadidya, Vadim Bertrand, Jia-Xian Chang, Brian Arbic, Jay Shriver, Julien Le Sommer

专题命中 气象气候 :利用动态模式分解从SWOT卫星数据提取地转流,属于海洋气象研究。

AI总结 本文提出利用动态模式分解方法从SWOT轨道中提取地转成分,提供涡度和应变的联合概率密度函数及SSHa谱,以解决地转平衡在测高观测中的应用问题。

Comments 25 pages with double spacing, 4 figures

详情
AI中文摘要

尽管地转平衡仍是提取海面高度异常(SSHa)速度信息的最简单和最实用的平衡方法,但海洋学界仍存在疑问,即这种平衡在SWOT卫星测高观测中的应用程度如何。鉴于SWOT的有限时间分辨率,许多研究倾向于声称空间滤波后的SSHa场对应地转成分,这引入了选择空间尺度的模糊性。本文基于最近的内部潮(IT)校正发展(Yadidya等,2025)和Lapo等(2025)引入的动力学模式分解(DMD)方法,从SWOT一天重复轨道中稳健地提取与次惯性频率相关的地转成分;我们将全球数据集作为公共产品分发。我们提供了涡度和应变的联合概率密度函数(PDF)以及几个交叉区域的SSHa谱。

英文摘要

While geostrophy remains the simplest and most practical balance to extract velocity information from sea-surface height anomaly (SSHa), confusions remain within the oceanographic community to what extent this balance can be applied to altimetric observations with the launch of the Surface Water and Ocean Topography (SWOT) satellite. Given the limited temporal resolution of SWOT, many studies have resorted to claiming that the spatially filtered SSHa fields correspond to the geostrophic component. This introduces the ambiguity of which spatial scale to choose. Here, we build upon the recent developments in internal tide (IT) corrections (Yadidya et al., 2025) and apply a dynamic mode decomposition (DMD)-based method introduced by Lapo et al. (2025) to robustly extract the geostrophic component associated with sub-inertial frequencies from the SWOT one-day-repeat orbit; we distribute the global dataset as a public good. We provide the joint probability density function (PDF) of vorticity and strain, and spectra of SSHa at a few cross-over regions.

4. 其他科学智能 4 篇

2606.20191 2026-06-19 stat.ML stat.ME 新提交 80%

AK-MCS-C2 : Active Kriging Monte Carlo Simulation method with conformal certification for failure probability estimation

AK-MCS-C2: 具有共形认证的主动克里金蒙特卡洛模拟方法用于失效概率估计

Edgar Jaber, Vincent Chabridon, Mathilde Mougeot

专题命中 其他科学智能 :主动学习框架用于结构可靠性失效概率估计

AI总结 提出一种结合主动克里金蒙特卡洛模拟与共形预测的主动学习框架,通过自适应交叉共形策略和J+GP共形估计器,在少量样本下提供无分布假设的预测误差保证,提高极限状态面附近样本分类可靠性,从而提升失效概率估计的准确性和鲁棒性。

详情
AI中文摘要

我们提出了一种新颖的主动学习框架,用于结构可靠性分析中的失效概率估计,该框架将主动克里金蒙特卡洛模拟与共形预测相结合。所提出的方法采用了一种自适应交叉共形策略,专门针对小样本设置和基于J+GP共形估计器的克里金代理模型设计。与标准的AK-MCS方法不同,所提出的框架对预测误差提供了无分布假设的保证,从而对极限状态面附近的样本进行更可靠的分类。这种改进的不确定性量化增强了失效概率估计的准确性和鲁棒性,特别是在这种效率至关重要的罕见事件区域。可重复的数值结果说明了该方法的有效性,并在公认的基准测试上将其与经典方法进行了比较。

英文摘要

We introduce a novel active-learning framework for failure probability estimation in structural reliability analysis that integrates Active Kriging Monte Carlo simulation with conformal prediction. The proposed approach employs an adaptive cross-conformal strategy specifically designed for small-sample settings and kriging surrogate models using the J+GP conformal estimator. Unlike standard AK-MCS methods, the proposed framework provides distribution-free guarantees on prediction errors, leading to more reliable classification of samples near the limit-state surface. This improved uncertainty quantification enhances both the accuracy and robustness of failure probability estimates, especially for rare-event regimes where such efficiency is crucial. Reproducible numerical results illustrate the effectiveness of the method and also compare it to classical approaches on well-established benchmarks.

2606.19540 2026-06-19 stat.ME stat.CO stat.ML 新提交 80%

Overfitted high-dimensional matrix factorizations via adaptive spectral shrinkage

通过自适应谱收缩的过拟合高维矩阵分解

Lorenzo Mauri, David B. Dunson

专题命中 其他科学智能 :提出EigenBayes方法用于高维因子模型,应用基因组学

AI总结 提出EigenBayes方法,通过谱估计和自适应经验贝叶斯校准超参数,实现快速且具有不确定性量化的过拟合因子模型,在数值实验和基因组学应用中优于现有方法。

详情
AI中文摘要

因子模型是分析高维数据以提取低秩信号和估计协方差的常用方法。它们将协方差矩阵分解为低秩分量和对角分量之和。一个关键问题是如何选择潜在维度$k$,当因子模型仅近似成立且信噪比较低时,这尤其具有挑战性。贝叶斯过拟合因子模型指定$k$的上界,并依赖结构化收缩先验有效去除多余分量。这类方法流行且有效,但计算成本高。我们提出了一种更快的\texttt{EigenBayes}方法,基于潜在因子的谱估计和关键超参数的自适应经验贝叶斯校准,提供有效的不确定性量化。得到的后验分布可跨结果分解且解析可处理,绕过了马尔可夫链蒙特卡洛。我们证明\texttt{EigenBayes}能适应每个结果和潜在维度的信噪比,同时将多余的潜在分量收缩至零。我们建立了良好的渐近性质,并在数值实验和基因组学应用中展示了强大的实证性能,其中EigenBayes优于最先进的替代方法。

英文摘要

Factor models are popular approaches for analyzing high-dimensional data to extract low-rank signals and estimate covariances. They decompose the covariance matrix as the sum of low-rank and diagonal components. A key issue is how to choose the latent dimension $k$, which is particularly challenging when the factor model only holds approximately and in low signal-to-noise scenarios. Bayesian overfitted factor models specify an upper bound on $k$ and rely on structured shrinkage priors to effectively remove extra components. Such approaches are popular and effective, but computationally expensive. We propose a much faster \texttt{EigenBayes} approach that provides valid uncertainty quantification, based on spectral estimation of latent factors and adaptive empirical Bayes calibration of key hyperparameters. The resulting posterior distribution factorizes across outcomes and is analytically tractable, bypassing Markov chain Monte Carlo. We show that \texttt{EigenBayes} adapts to the signal-to-noise ratio of each outcome and latent dimension, while shrinking superfluous latent components to zero. We establish favorable asymptotic properties and demonstrate strong empirical performance in numerical experiments and a genomics application, where EigenBayes outperforms state-of-the-art alternatives.

2606.19739 2026-06-19 q-bio.NC 新提交 80%

Robust probabilistic measurement of structural-functional module consistency in infant brain development

婴儿大脑发育中结构-功能模块一致性的鲁棒概率测量

Lingbin Bian, Feihong Liu, Qian Wang, Han Zhang, Dinggang Shen, the UNC/UMN Baby Connectome Project Consortium

专题命中 其他科学智能 :婴儿脑网络结构-功能一致性概率测量方法

AI总结 提出基于随机模块的概率方法,鲁棒测量婴儿大脑结构-功能模块一致性,发现0-5岁间一致性下降,初级脑区一致性更高。

详情
AI中文摘要

脑网络通常被划分为模块,用于分析其在神经影像学研究的群体分析中功能分离的角色。这里,我们引入脑网络中的随机模块,用于在受试者群体中对结构-功能模块一致性(SFMC)进行鲁棒的概率测量。具体而言,随机模块可被视为一个脑区在受试者间可能被分配到群体级子网络的机会,其特征为该脑区的分配概率。这种新方法在评估脑网络中的非均匀模块方面有两个优势。首先,它可以鲁棒地评估脑结构模块与功能模块之间的一致性,而两者的群体规模不必相同;其次,它能够考虑群体中模块的个体间变异性。此外,与传统的结构-功能耦合方法相比,我们的基于随机模块的方法揭示了结构与功能之间耦合的更显著下降,表明更强的发育重组。我们使用婴儿连接组项目(BCP)数据集的结果显示,SFMC在0至5岁期间下降,并且在初级脑区(如视觉区域)较高,而在更高级的认知区域(包括与注意力、控制和默认模式网络相关的区域)较低。

英文摘要

Brain network is commonly divided into modules for analyzing their functionally segregated roles for group-level analysis in neuroimaging studies. Here, we introduce stochastic modules within brain networks for a robust probabilistic measurement of structural-functional module consistency (SFMC) in a group of subjects. Specifically, a stochastic module can be regarded as the chance of a brain region across subjects potentially being assigned to a group-level sub-network, characterized as an assignment probability for this brain region. This novel method has two advantages for evaluating inhomogeneous modules in brain networks. The first is that it can robustly evaluate the consistency between brain structural and functional modules whose population sizes are not necessary the same, and the second is that it is able to take into account the inter-individual variability of the modules for the groups. Moreover, compared with the conventional structural-functional coupling approach, our stochastic module-based method reveals a more pronounced decline in the coupling between structure and function, indicating stronger developmental reorganization. Our results using the dataset from Baby Connectome Project (BCP) show that the SFMC decreases from 0 to 5 years old, and is greater in primary brain regions, such as visual areas, while lower in more advanced cognitive regions, including those related to attention, control, and default mode network.

2606.19560 2026-06-19 cs.LG 新提交 80%

Understanding Key Features of Time Series Foundation Models from Epidemic Forecasting

从流行病预测理解时间序列基础模型的关键特征

Alireza Jafari, Judy Fox, Geoffrey C. Fox, Madhav Marathe, Aniruddha Adiga

发表机构 * Department of Computer Science, School of Engineering and Applied Science, University of Virginia(弗吉尼亚大学工程与应用科学学院计算机科学系) School of Data Science, University of Virginia(弗吉尼亚大学数据科学学院) Biocomplexity Institute, University of Virginia(弗吉尼亚大学生物复杂性研究所) Department of Electrical and Computer Engineering, School of Engineering and Applied Science, University of Virginia(弗吉尼亚大学工程与应用科学学院电气与计算机工程系)

专题命中 其他科学智能 :评估时间序列模型用于流行病预测,属于科学智能

AI总结 系统评估多种时间序列模型在流感预测中的表现,发现混合专家模型性能最优,预训练在长时域提升显著,而LLM方法效果较差。

Comments 15 pages, 2 figures, 9 tables

详情
AI中文摘要

季节性流感每年感染数百万人,并在美国造成大量发病和死亡,因此准确的短期预测成为核心公共卫生需求。可靠的流行病时间序列预测可以为疫苗接种时机、医院人员配备和资源分配提供信息,然而现代预测架构在传染病监测数据上的比较行为仍未得到充分表征。我们通过系统评估区域流感预测来填补这一空白,使用流感样疾病监测和流感相关住院时间序列,在时间泛化和空间泛化设置下进行1-4周提前预测。我们比较了经典神经网络架构、基于数值的Transformer模型、预训练时间序列基础模型和基于LLM的预测方法。在各项任务中,我们证明融合多个预训练预测器的混合专家模型实现了最强的整体性能,表明异质预训练表示提供了互补的预测信息。我们的结果进一步表明,基于数值的Transformer模型产生可靠的预测,而预训练在更长时域上提供最大增益,特别是当预训练领域与流感动力学机制一致时。相比之下,基于LLM的时间序列方法在此设置下表现不如数值预测器。最后,我们研究了住院信息作为辅助协变量和预训练源的作用。住院信号在特定设置中提供了互补的改进,并阐明了额外的监测流如何增强多时域预测的鲁棒性。这些发现为流感防范的模型选择、预训练策略和辅助信号使用提供了可操作的指导。

英文摘要

Seasonal influenza infects millions of people and causes substantial morbidity and mortality in the United States each year, making accurate short-term forecasting a core public-health need. Reliable forecasts of epidemic time series can inform vaccination timing, hospital staffing, and resource allocation, yet the comparative behavior of modern forecasting architectures on infectious-disease surveillance data remains insufficiently characterized. We address this gap through a systematic evaluation of regional influenza forecasting using influenza-like illness surveillance and influenza-associated hospitalization time series under both temporal and spatial generalization settings for 1-4-week-ahead prediction. We compare classical neural network architectures, numerical transformer-based models, pretrained time series foundation models, and LLM-based forecasting approaches. Across tasks, we demonstrate that a mixture-of-experts model that fuses multiple pretrained forecasters achieves the strongest overall performance, indicating that heterogeneous pretrained representations provide complementary predictive information. Our results further show that numerical transformer-based models produce reliable forecasts, while pretraining provides the largest gains at longer horizons, particularly when the pretraining domain is mechanistically aligned with influenza dynamics. In contrast, LLM-based time series methods underperform relative to numerical forecasters in this setting. Finally, we examine hospitalization information as both an auxiliary covariate and a pretraining source. Hospitalization signals provide complementary improvements in selected settings and clarify when additional surveillance streams enhance the robustness of multi-horizon forecasting. These findings provide actionable guidance on model selection, pretraining strategy, and auxiliary-signal use for influenza preparedness.

5. AI制药 2 篇

2606.19624 2026-06-19 cs.LG 新提交 80%

MassSpecGym in the Wild: Uncovering and Correcting Evaluation Pitfalls in AI-Driven Molecule Discovery

MassSpecGym in the Wild: 揭示并纠正AI驱动分子发现中的评估陷阱

Hongxuan Liu, Roman Bushuiev, Ivy Lightheart, Mrunali Manjrekar, Anton Bushuiev, Magdalena Lederbauer, Filip Jozefov, Yinkai Wang, Soha Hassoun, Josef Sivic, James Taylor, Runzhong Wang, David Healey, Tomáš Pluskal, Connor W. Coley

发表机构 * Massachusetts Institute of Technology(麻省理工学院) Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University in Prague(捷克信息学、机器人学与自动化捷克技术大学) Enveda Biosciences(Enveda 生物科技) Tufts University(塔夫茨大学)

专题命中 AI制药 :审查AI驱动分子发现中的评估陷阱,以MassSpecGym为例。

AI总结 本文系统审查了基于串联质谱的分子发现中机器学习模型的评估问题,以MassSpecGym基准为例,发现26篇论文中至少17篇存在数据泄露、捷径学习和实现错误三类问题,并通过实验量化影响,提出改进建议并发布MassSpecGym v1.5。

详情
AI中文摘要

可靠的基准测试对于开发基于串联质谱(MS/MS)分子发现的机器学习模型至关重要。实验设计和模型评估过程中的细微问题会降低此类基准的可信度,并导致错误结论。我们以标准MassSpecGym基准套件为例,对近期MS/MS机器学习文献中的模型评估问题进行了全面审查,以说明这些问题的影响。在采用MassSpecGym基准的第一年内,我们发现在26篇报告MassSpecGym基准结果的论文中,至少有17篇存在评估问题。我们将失败原因归纳为三类:(i) 数据泄露,(ii) 捷径学习,以及(iii) 实现错误和指标分歧。通过大量实验和代码复现,我们量化了这些问题的影响,并展示了它们如何破坏MassSpecGym旨在强制执行的评估标准。我们将研究结果提炼为适用于MS/MS挑战、基准和自定义评估设置的建议。我们还发布了MassSpecGym v1.5,这是我们在MassSpecGym基准套件中实施建议的版本,解决了本次审计中发现的失败模式。MassSpecGym v1.5可从此https URL公开获取。

英文摘要

Reliable benchmarking is critical for developing machine learning models for tandem mass spectrometry (MS/MS) based molecule discovery. Subtle issues in experimental design and model evaluation procedures can degrade the trustworthiness of such benchmarks and lead to erroneous conclusions. We conduct a thorough review of model evaluation issues in the recent MS/MS machine learning literature, using the standard MassSpecGym benchmark suite as a case study to illustrate the impact of these issues. We find evaluation issues in at least 17 of 26 papers reporting MassSpecGym benchmark results in the first year of its adoption. We isolate three classes of failures: (i) data leakage, (ii) shortcut learning, and (iii) implementation bugs and metric divergence. Through extensive experimentation and code replication, we quantify the impact of these issues and show how they corrupt the evaluation standards MassSpecGym was designed to enforce. We distill our findings into recommendations generalizable to MS/MS challenges, benchmarks, and custom evaluation setups. We also release MassSpecGym v1.5, an implementation of our recommendations in the MassSpecGym benchmarking suite which addresses the failure modes identified in this audit. MassSpecGym v1.5 is publicly available at https://github.com/pluskal-lab/MassSpecGym.

2606.19496 2026-06-19 cs.LG 新提交 80%

Calibrating Generative Models to Feature Distributions with MMD Finetuning

使用MMD微调将生成模型校准到特征分布

Nathaniel L. Diamant, Brian L. Trippe

发表机构 * Stanford University(斯坦福大学)

专题命中 AI制药 :校准生成模型特征分布以匹配抗生素分子

AI总结 提出kCGM方法,通过最小化生成与目标特征分布的最大均值差异(MMD)并加入KL正则化,在不牺牲有效性的前提下校准生成模型的特征分布,适用于多种生成模型。

详情
AI中文摘要

生成模型可以产生个体上合理的样本,但在关键特征分布上与目标集存在显著偏差。例如,在广泛的药物类化学空间上预训练的模型可能生成分子,其分子特征与感兴趣的治疗类别(如已知抗生素)不同。纠正这种分布校准错误具有挑战性:在目标集上直接微调可能导致过拟合,并且无法控制匹配哪些特征。为了填补这一空白,我们引入了核校准生成模型(kCGM)。kCGM使用无偏得分函数估计器最小化生成特征分布与目标特征分布之间的最大均值差异(MMD),并通过KL正则化保持与预训练模型的接近。在一个包含174种抗生素的目标集上,直接微调牺牲了化学有效性以匹配特征分布,而kCGM在提高有效性的同时改善了目标特征匹配。我们还在蛋白质和DNA生成任务中展示了kCGM,表明它可以使用仅特征级别的监督来适应自回归、连续空间扩散和离散扩散模型。代码可在https://this URL获取。

英文摘要

Generative models can produce individually plausible samples while deviating substantially from a target set in the distribution of key features. For example, a model pretrained on broad drug-like chemical space may generate molecules whose molecular features differ from those of a therapeutic class of interest, such as known antibiotics. Correcting such distributional miscalibration is challenging: direct finetuning on the target set can overfit and does not control which features are matched. To fill this gap, we introduce kernel Calibrating Generative Models (kCGM). kCGM minimizes a maximum mean discrepancy (MMD) between generated and target feature distributions using an unbiased score-function estimator, with KL regularization to remain close to the pretrained model. On a target set of 174 antibiotics, direct finetuning sacrifices chemical validity for feature-distribution matching, whereas kCGM improves target feature matching while increasing validity. We further demonstrate kCGM in protein and DNA generation tasks, showing it can adapt autoregressive, continuous-space diffusion, and discrete diffusion models using only feature-level supervision. Code is available at https://github.com/smithhenryd/cgm.