arXivDaily arXiv每日学术速递 周一至周五更新

科学与医疗

AI for Science

科学智能、蛋白质、分子、药物、材料、气象、物理和数学 AI。

今日/当前日期收录 310 信号源:cs.LG, q-bio, physics, cond-mat, math, stat.ML

1. 其他科学智能 7 篇

2606.19560 2026-06-19 cs.LG 新提交 80%

Understanding Key Features of Time Series Foundation Models from Epidemic Forecasting

从流行病预测理解时间序列基础模型的关键特征

Alireza Jafari, Judy Fox, Geoffrey C. Fox, Madhav Marathe, Aniruddha Adiga

发表机构 * Department of Computer Science, School of Engineering and Applied Science, University of Virginia(弗吉尼亚大学工程与应用科学学院计算机科学系) School of Data Science, University of Virginia(弗吉尼亚大学数据科学学院) Biocomplexity Institute, University of Virginia(弗吉尼亚大学生物复杂性研究所) Department of Electrical and Computer Engineering, School of Engineering and Applied Science, University of Virginia(弗吉尼亚大学工程与应用科学学院电气与计算机工程系)

专题命中 其他科学智能 :评估时间序列模型用于流行病预测,属于科学智能

AI总结 系统评估多种时间序列模型在流感预测中的表现,发现混合专家模型性能最优,预训练在长时域提升显著,而LLM方法效果较差。

Comments 15 pages, 2 figures, 9 tables

详情
AI中文摘要

季节性流感每年感染数百万人,并在美国造成大量发病和死亡,因此准确的短期预测成为核心公共卫生需求。可靠的流行病时间序列预测可以为疫苗接种时机、医院人员配备和资源分配提供信息,然而现代预测架构在传染病监测数据上的比较行为仍未得到充分表征。我们通过系统评估区域流感预测来填补这一空白,使用流感样疾病监测和流感相关住院时间序列,在时间泛化和空间泛化设置下进行1-4周提前预测。我们比较了经典神经网络架构、基于数值的Transformer模型、预训练时间序列基础模型和基于LLM的预测方法。在各项任务中,我们证明融合多个预训练预测器的混合专家模型实现了最强的整体性能,表明异质预训练表示提供了互补的预测信息。我们的结果进一步表明,基于数值的Transformer模型产生可靠的预测,而预训练在更长时域上提供最大增益,特别是当预训练领域与流感动力学机制一致时。相比之下,基于LLM的时间序列方法在此设置下表现不如数值预测器。最后,我们研究了住院信息作为辅助协变量和预训练源的作用。住院信号在特定设置中提供了互补的改进,并阐明了额外的监测流如何增强多时域预测的鲁棒性。这些发现为流感防范的模型选择、预训练策略和辅助信号使用提供了可操作的指导。

英文摘要

Seasonal influenza infects millions of people and causes substantial morbidity and mortality in the United States each year, making accurate short-term forecasting a core public-health need. Reliable forecasts of epidemic time series can inform vaccination timing, hospital staffing, and resource allocation, yet the comparative behavior of modern forecasting architectures on infectious-disease surveillance data remains insufficiently characterized. We address this gap through a systematic evaluation of regional influenza forecasting using influenza-like illness surveillance and influenza-associated hospitalization time series under both temporal and spatial generalization settings for 1-4-week-ahead prediction. We compare classical neural network architectures, numerical transformer-based models, pretrained time series foundation models, and LLM-based forecasting approaches. Across tasks, we demonstrate that a mixture-of-experts model that fuses multiple pretrained forecasters achieves the strongest overall performance, indicating that heterogeneous pretrained representations provide complementary predictive information. Our results further show that numerical transformer-based models produce reliable forecasts, while pretraining provides the largest gains at longer horizons, particularly when the pretraining domain is mechanistically aligned with influenza dynamics. In contrast, LLM-based time series methods underperform relative to numerical forecasters in this setting. Finally, we examine hospitalization information as both an auxiliary covariate and a pretraining source. Hospitalization signals provide complementary improvements in selected settings and clarify when additional surveillance streams enhance the robustness of multi-horizon forecasting. These findings provide actionable guidance on model selection, pretraining strategy, and auxiliary-signal use for influenza preparedness.

2606.19761 2026-06-19 cs.LO math.LO 新提交 80%

Finishing Oltean's Completeness Proof in Lean 4 for Hybrid Logic $L(\forall)$

在 Lean 4 中完成 Oltean 关于混合逻辑 $L(\forall)$ 的完备性证明

Lars Warren Ericson

专题命中 其他科学智能 :在Lean4中完成混合逻辑完备性证明

AI总结 本文在 Lean 4 中完成了混合逻辑 $L(\forall)$ 的机器检查完备性证明,通过结构新鲜性和存在引理 Henkin 构造两种工具解决了新鲜名称的生成问题。

Comments 147 pages, 5 figures

详情
AI中文摘要

我们给出了一个在 Lean 4 中机器检查的完备性定理,针对混合逻辑 $L(\forall)$:带有名义词、满足风格绑定器 $\forall$ 和盒子模态的命题模态逻辑。(基本混合逻辑(无绑定器)的机器检查完备性由 Asta Halkjær From 在 Isabelle/HOL 中开创。)我们基于 Alex Oltean 2023 年的 Lean 4 形式化工作,该工作机械化了语法、语义、希尔伯特风格证明系统和可靠性(遵循 Blackburn 的混合完备性(1998)),但留下了不完备的部分。完成它需要在两个结构不同的点上制造新鲜名称,我们的核心发现是它们需要两种不同的工具。(1)通过扩展的 Lindenbaum 构造构建的根可证最大一致集,每一步都需要一个对整个集合新鲜的名义词;正确的工具是结构新鲜性:扩展语言,使得通过构造保留无限的名义词供应。我们调查了设计空间(Oltean 在 $\mathbb{N}$ 内的奇偶编码、Bud Mishra 建议的不交和 $N \oplus \mathbb{N}$ 参数化,以及 From 的合成完备性框架)并解释了我们采用的编码。(2)一个最大一致集的可证 $\Diamond$-后继不能通过这种方式获得:其典范盒子归约可证地提及每个名义词,因此没有保留的名称是新鲜的。这里正确的工具是 Oltean 选择但未完成的:一个存在引理 Henkin 构造,通过一个新鲜状态变量从前驱的可证性中抽取每个见证;我们通过一个携带数据的见证累加器和一个紧致性论证完成了它。定理 $\Gamma \models \varphi \to \Gamma \vdash \varphi$ 被完全形式化:该开发是无 sorry 的,且 #print axioms 仅报告 propext、this http URL 和 this http URL。我们将开发移植到 Lean v4.30.0 / mathlib v4.30.0。

英文摘要

We present a machine-checked completeness theorem, in Lean 4, for the hybrid logic $L(\forall)$: propositional modal logic with nominals, the satisfaction-style binder $\forall$, and the box modality. (Machine-checked completeness for basic hybrid logic, without binders, was pioneered by Asta Halkjær From in Isabelle/HOL.) We build on Alex Oltean's 2023 Lean 4 formalization, which mechanized the syntax, semantics, Hilbert-style proof system, and soundness following Blackburn's Hybrid Completeness (1998), but left completeness unfinished. Finishing it requires manufacturing fresh names at two structurally different points, and our central finding is that they call for two different tools. (1) The root witnessed maximal consistent set, built by an extended Lindenbaum construction, needs at each step a nominal fresh for the whole set; the right tool is structural freshness: extend the language so an infinite supply of nominals is reserved by construction. We survey the design space (Oltean's odd/even encoding inside $\mathbb{N}$, the disjoint-sum $N \oplus \mathbb{N}$ parameterization suggested by Bud Mishra, and From's synthetic-completeness frameworks) and explain the encoding we adopt. (2) The witnessed $\Diamond$-successor of a maximal consistent set cannot be obtained this way: its canonical box-reduct provably mentions every nominal, so no reserved name is fresh. Here the right tool is one Oltean chose but left incomplete: an existence-lemma Henkin construction drawing each witness from the predecessor's witnessedness through a fresh state variable; we complete it with a data-carrying witness accumulator and a compactness argument. The theorem $Γ\models φ\to Γ\vdash φ$ is fully formalized: the development is sorry-free, and #print axioms reports only propext, Classical.choice, and Quot.sound. We port the development to Lean v4.30.0 / mathlib v4.30.0.

2606.19405 2026-06-19 q-bio.QM math.DS q-bio.PE 新提交 80%

Multi-type branching inference on contact trees with application to COVID-19

接触树上的多类型分支推断及其在COVID-19中的应用

Augustine Okolie, Johannes Müller, Eno Akarawakc, Isaac Ajiboye

专题命中 其他科学智能 :提出接触树上的多类型分支推断方法

AI总结 提出一种直接作用于接触树上传播树的似然框架,通过多类型分支过程考虑接触度异质性,从部分解析的传播树中推断流行病学参数,并在COVID-19接触追踪数据中验证。

Comments 26 pages, 8 Figures

详情
AI中文摘要

从传播树推断流行病学参数对于理解传染病动态至关重要。现有的基于树的似然方法,包括最初应用于系统动力学环境中的多类型出生-死亡模型,提供了强大的工具,但大多数假设均匀混合,很少捕捉当个体感染更多接触者时传播潜力的变化。在这项工作中,我们开发了一个直接作用于传播树的似然框架,其中节点是个体,边是报告的传播事件,不涉及序列数据。我们推导了一个在有根接触树上的随机SIR过程的似然,其中每个感染个体由有效接触总数和已感染的下游接触数来刻画。我们得到了一个分支完全未被观察到的概率以及它产生一个处于给定状态的观察(采样)末端的概率密度的闭式常微分方程。对于已知末端状态的有根接触树,可以评估得到的似然,并且我们通过将内部分支时间视为潜在变量,将其扩展到部分解析的树。在模拟爆发上的验证确认了准确的参数恢复和良好校准的不确定性。应用于印度卡纳塔克邦的经验COVID-19接触追踪数据,展示了该框架在实际流行病学环境中的实用性。通过在多类型分支似然中纳入接触度异质性,我们的工作为从完全或部分解析的传播树推断传播动态和接触结构提供了一个原则性的基线,补充而非依赖于基于序列的系统动力学推断。

英文摘要

Inferring epidemiological parameters from transmission trees is essential for understanding infectious disease dynamics. Existing tree-based likelihood methods, including the multi-type birth-death models originally applied in phylodynamic settings, provide powerful tools, but most assume homogeneous mixing and rarely capture how transmission potential changes as an individual infects more of their contacts. In this work, we develop a likelihood framework that operates directly on transmission trees, in which nodes are individuals and edges are reported transmission events, with no sequence data involved. We derive a likelihood for a stochastic SIR process on a rooted contact tree in which each infected individual is characterised by the total number of effective contacts, and the number of already infected downstream contacts. We obtain closed-form ordinary differential equations for the probability that a clade goes entirely unobserved and for the probability density that it produces an observed (sampled) tip in a given state. The resulting likelihood can be evaluated for a rooted contact tree with known tip states, and we extend it to partially resolved trees by treating internal branching times as latent variables. Validation on simulated outbreaks confirms accurate parameter recovery and well calibrated uncertainty. Application to empirical COVID-19 contact-tracing data from Karnataka, India, demonstrates the framework's utility for real epidemiological settings. By incorporating contact-degree heterogeneity in a multi-type branching likelihood, our work provides a principled baseline for inferring both transmission dynamics and contact structure from fully or partially resolved transmission trees, complementing rather than relying on sequence-based phylodynamic inference

2606.20534 2026-06-19 math.OC 新提交 80%

On Second-Order Methods for Bilevel Optimization

关于双层优化的二阶方法

Jiawen Bi, Jiaxiang Li, Mingyi Hong, Shuzhong Zhang

专题命中 其他科学智能 :提出双层优化二阶方法,达最优复杂度

AI总结 本文针对双层优化问题,提出了一种单循环三次正则牛顿算法,在非凸上层和强凸下层设置下,实现了最优的O(ε^{-1.5})总预言复杂度,首次达到二阶驻点的最优收敛率。

详情
AI中文摘要

双层优化是现代机器学习和工程设计不可或缺的建模工具。然而,在双层优化中寻找二阶驻点的理论和实践仍然很大程度上未解决。即使对于具有强凸下层问题的双层优化,其诱导的超函数通常是非凸的。尽管三次正则牛顿方法(CRN)在单层优化中实现了最优的$\mathcal{O}(\varepsilon^{-1.5})$ SOSP(二阶驻点)率,但如何控制将二阶方法应用于双层问题时超梯度和超Hessian计算的精度,以使整个过程高效,仍不清楚。在本文中,我们着手回答这个问题。特别地,我们首先制定了一个双循环CRN基线,该基线实现了最优的外层率,但需要重复的下层求解。接下来,我们提出了一种单循环三次正则牛顿算法,该算法将一个下层梯度步与一个用于超梯度的牛顿步相结合,并证明了总体确定性的$\mathcal{O}(\varepsilon^{-1.5})$总预言复杂度,这是最优的。此外,我们说明了一些直观简单的修改可能无法维持收敛结果。据我们所知,这是第一个用于无约束NCSC(非凸上层和强凸下层)双层优化设置的确定性单循环方法,该方法实现了寻找超函数$\varepsilon$-SOSP的$\mathcal{O}(\varepsilon^{-1.5})$最优收敛率。

英文摘要

Bilevel optimization is an indispensable modeling tool for modern machine learning and engineering design. However, the theory and practice for finding second order stationary points in the context of bilevel optimization still remain largely unsettled. Even for bilevel optimization with strongly convex lower-level problem, the hyperfunction it induces is in general nonconvex. Although the Cubic Regularized Newton methods (CRN) famously achieve the optimal $\mathcal{O}(\varepsilon^{-1.5})$ SOSP (second-order stationary point) rate in single-level optimization, it is unclear how to control the accuracy of the hypergradient and hyper-Hessian computations in the context of applying the second-order methods to bilevel problems in order for the overall process to be efficient. In this paper, we set out to answer this question. In particular, we first formulate a double loop CRN baseline that achieves the optimal outer rate but requires repeated lower level solves. Next, we propose a single loop cubic regularized Newton algorithm that combines one lower-level gradient step with one Newton step for the hypergradient, and prove an overall deterministic $\mathcal{O}(\varepsilon^{-1.5})$ total oracle complexity, which is optimal. In addition, we illustrate that some intuitively simple modifications of our method may fail to hold up the convergence result. To the best of our knowledge, this is the first deterministic single loop method for unconstrained NCSC (non-convex upper-level and strongly convex lower-level) bilevel optimization setting that achieves the $\mathcal{O}(\varepsilon^{-1.5})$ optimal convergence rate for finding an $\varepsilon$-SOSP of the hyperfunction.

2606.20329 2026-06-19 cs.LG physics.geo-ph 新提交 80%

Constrained hybrid modelling to predict microbial dynamics and organic matter turnover in soil systems

约束混合建模预测土壤系统中微生物动态与有机质周转

Paul Collart, Juergen Gall, Andrea Schnepf, Holger Pagel, Lars Doorenbos

发表机构 * Agrosphere (IBG-3), Forschungszentrum Jülich GmbH(农业圈(IBG-3),于利希研究中心) Institute of Crop Science and Resource Conservation, University of Bonn(波恩大学作物科学与资源保护研究所) Institute of Computer Science, University of Bonn(波恩大学计算机科学研究所) Lamarr Institute for Machine Learning and Artificial Intelligence(拉马尔机器学习和人工智能研究所)

专题命中 其他科学智能 :土壤微生物建模,环境科学机器学习

AI总结 提出首个混合建模框架,利用神经网络从宏基因组推断功能性状预测过程模型参数,并整合生态理论约束,有效预测微生物动态和有机质周转。

Comments Accepted at ICML '26

详情
AI中文摘要

土壤微生物控制有机质循环,并在很大程度上决定土壤系统如何应对和缓解气候变化及环境威胁。因此,在基于过程的土壤模型中表示微生物动态对于预测土壤碳循环至关重要,尽管从数据中获取信息极具挑战性。改进参数化的一个有前景的方法是整合基因组数据,然而建模基因组与微生物驱动过程之间复杂且未知的关系是一个未解决的问题。在这项工作中,我们提出了第一个混合建模框架,用于从基于DNA测序数据的宏基因组推断功能性状中推导基于过程的土壤有机质周转模型的生物动力学参数值。我们的模型通过神经网络从基因组性状数据预测过程模型的生物动力学参数,并整合来自生态理论和文献的约束,以确保即使是非观测状态变量也能实现逼真的行为。我们在不同复杂度的合成基因组性状数据集和真实数据上评估了我们的方法,结果表明,我们的方法在多个基线上提高了性能,并有效学习了过程模型中不可测量组分的动态,即使是在小训练数据集上也是如此。

英文摘要

Soil microorganisms control organic matter cycling and largely determine how soil systems can cope with and mitigate climate change and environmental threats. Representing microbial dynamics in process-based soil models is therefore critical to predict carbon cycling in soils, albeit highly challenging to inform from data. One promising approach to improve their parametrisation is the integration of genomic data, yet modelling the complex and unknown relationship between genomes and the processes the microbes are driving is an unsolved problem. In this work, we present the first hybrid modeling framework for deriving biokinetic parameter values of a process-based soil organic matter turnover model from metagenome-inferred functional traits based on DNA sequencing data. Our model predicts biokinetic parameters of the process-based model from genomic trait data with a neural network and integrates constraints from ecological theory and literature to ensure realistic behavior, even of non-observed state variables. We evaluate our method on synthetic genomic trait datasets of varying complexity and on real data, showing that our approach improves performance over multiple baselines and learns the dynamics of unmeasurable components of the process-based model effectively, even for small training datasets.

2606.20145 2026-06-19 q-fin.ST cond-mat.stat-mech physics.data-an q-fin.MF q-fin.RM 新提交 80%

Trends, Volatility, Correlations, and Critical Phenomena in Financial Markets

金融市场中的趋势、波动率、相关性和临界现象

Sara A. Safari, Christoph Schmidhuber

专题命中 其他科学智能 :金融市场趋势与波动率预测,属于经济物理

AI总结 基于当前市场趋势预测未来波动率和相关性,发现趋势强度与波动率、相关性呈二次关系,改进风险预测并支持临界点晶格气体模型。

Comments 31 pages, 9 figures

详情
AI中文摘要

我们基于金融市场的当前趋势预测未来的波动率和相关性。这补充了先前的工作,该工作通过当前趋势强度的三次多项式来建模未来预期收益。经验上,我们观察到在强烈上升或下降趋势期间,波动率和相关性往往逐日增加。这种效应在下降趋势中尤为显著。它可以通过当前趋势强度的二次多项式精确量化,这细化了波动率和相关性的常见均值回归模型。我们的结果通过考虑市场趋势改进了市场风险的预测。它们也支持最近一项将金融市场建模为接近其临界点的晶格气体的提议。

英文摘要

We forecast future volatilities and correlations of financial markets based on the current trends in these markets. This complements previous work that models future expected returns by a cubic polynomial of the current trend strength. Empirically, we observe that volatilities and correlations tend to increase day after day in times of strong up- or down-trends. This effect is particularly pronounced in down-trends. It can be accurately quantified by quadratic polynomials of today's trend strengths, which refine common mean-reversion models of volatilities and correlations. Our results improve the prediction of market risk by accounting for market trends. They also support a recent proposal to model financial markets by a lattice gas near its critical point.

2606.19860 2026-06-19 physics.comp-ph cond-mat.stat-mech physics.soc-ph 新提交 80%

The Heat Kernel Expansion: Curvature for Shock Detection in Higher-Order Financial Networks

热核展开:高阶金融网络中的曲率用于冲击检测

Mohammad Elsayed, Sara Najem

专题命中 其他科学智能 :热核展开检测金融网络冲击,属于经济物理

AI总结 本文通过热核展开系数定义曲率,用于检测高阶金融网络中的冲击,发现曲率比欧拉示性数和挠率更敏感地捕捉法律变化的影响。

详情
AI中文摘要

本研究跟踪了挪威金融网络在九年期间每月的变化。数据包括董事会成员及其与公司的关联,我们将其建模为单纯复形。在此框架中,董事表示为节点,公司表示为复形的面。为了表征后者,我们关注三个拓扑度量:通过贝蒂数计算的欧拉示性数、通过高阶拉普拉斯矩阵的简化行列式计算的挠率,以及高阶聚类系数。前两者未能捕捉到法律对代表权的影响,而我们的曲率概念则不同,它是通过热核在时间幂次上的级数展开系数计算的几何度量,这是本工作的主要贡献。特别地,欧拉示性数积分了曲率,因此局部信息丢失。随后,并非所有拓扑度量都能可靠地捕捉网络中的冲击。此外,生成树的数量可能在最低阶发生显著变化,但这些变化不一定反映在挠率中。相反,曲率的变化揭示了因立法导致的董事会连锁变化,并作为检测网络中冲击的敏感度量。曲率的拐点与外部强迫相关,最小值与冲击到达时间相关。在挠率的分量中也观察到尖锐转变,而在高阶聚类中观察到平滑变化。

英文摘要

This work follows the evolution of financial networks in Norway over a period of nine years at a monthly rate. The data consist of board directors and their affiliations to companies, which we model as simplicial complexes. In this framework, directors are represented as nodes and companies as faces of the complex. To characterize the latter, we focus on three topological measures: the Euler characteristic, computed through the Betti numbers, torsion computed through the reduced determinant of the higher-order Laplacians, and higher-order clustering coefficients. The first two fail to capture the effect of imposed law on representation, unlike our notion of curvature which is a geometrical measure computed from the coefficients of the series expansion of the heat kernel in powers of time, which is our major contribution in this work. In particular, the Euler characteristic integrates curvature, and thus local information is lost. Subsequently, not every topological measure can reliably capture shocks in networks. Further, the number of spanning trees may undergo significant changes at the lowest order, yet these changes need not be reflected in the torsion. Conversely, the change in the curvature revealed variation in the board interlock due to legislation, and serves as a sensitive measure for detecting shocks in networks. Inflection points in curvature are associated with external forcing, and minima with shock arrival times. Sharp transitions are also observed in the components of torsion, while smooth changes are observed in higher-order clustering.

2. AI制药 1 篇

2606.19496 2026-06-19 cs.LG 新提交 80%

Calibrating Generative Models to Feature Distributions with MMD Finetuning

使用MMD微调将生成模型校准到特征分布

Nathaniel L. Diamant, Brian L. Trippe

发表机构 * Stanford University(斯坦福大学)

专题命中 AI制药 :校准生成模型特征分布以匹配抗生素分子

AI总结 提出kCGM方法,通过最小化生成与目标特征分布的最大均值差异(MMD)并加入KL正则化,在不牺牲有效性的前提下校准生成模型的特征分布,适用于多种生成模型。

详情
AI中文摘要

生成模型可以产生个体上合理的样本,但在关键特征分布上与目标集存在显著偏差。例如,在广泛的药物类化学空间上预训练的模型可能生成分子,其分子特征与感兴趣的治疗类别(如已知抗生素)不同。纠正这种分布校准错误具有挑战性:在目标集上直接微调可能导致过拟合,并且无法控制匹配哪些特征。为了填补这一空白,我们引入了核校准生成模型(kCGM)。kCGM使用无偏得分函数估计器最小化生成特征分布与目标特征分布之间的最大均值差异(MMD),并通过KL正则化保持与预训练模型的接近。在一个包含174种抗生素的目标集上,直接微调牺牲了化学有效性以匹配特征分布,而kCGM在提高有效性的同时改善了目标特征匹配。我们还在蛋白质和DNA生成任务中展示了kCGM,表明它可以使用仅特征级别的监督来适应自回归、连续空间扩散和离散扩散模型。代码可在https://this URL获取。

英文摘要

Generative models can produce individually plausible samples while deviating substantially from a target set in the distribution of key features. For example, a model pretrained on broad drug-like chemical space may generate molecules whose molecular features differ from those of a therapeutic class of interest, such as known antibiotics. Correcting such distributional miscalibration is challenging: direct finetuning on the target set can overfit and does not control which features are matched. To fill this gap, we introduce kernel Calibrating Generative Models (kCGM). kCGM minimizes a maximum mean discrepancy (MMD) between generated and target feature distributions using an unbiased score-function estimator, with KL regularization to remain close to the pretrained model. On a target set of 174 antibiotics, direct finetuning sacrifices chemical validity for feature-distribution matching, whereas kCGM improves target feature matching while increasing validity. We further demonstrate kCGM in protein and DNA generation tasks, showing it can adapt autoregressive, continuous-space diffusion, and discrete diffusion models using only feature-level supervision. Code is available at https://github.com/smithhenryd/cgm.

3. 气象气候 3 篇

2606.19363 2026-06-19 cs.LG 新提交 80%

When to Trust, How to Distill: Multi-Foundation Model Guidance for Lightweight, Robust Scientific Time Series Forecasting

何时信任,如何蒸馏:面向轻量级鲁棒科学时间序列预测的多基础模型指导

Rupasree Dey, Abdul Matin, Nathan Orwick, Yao Zhang, Shrideep Pallickara, Sangmi Lee Pallickara

发表机构 * Colorado State University(科罗拉多州立大学)

专题命中 气象气候 :时间序列基础模型用于气象等领域预测。

AI总结 提出Guard框架,通过上下文路由器和不确定性门控温度机制,从多个分布偏移的基础模型中蒸馏知识,训练轻量级预测器,在气象、碳通量等四个领域降低RMSE。

Comments KDD 2026, paper decision: Accepted, track: AI for Science. total 12 pages including references and appendix

详情
AI中文摘要

时间序列基础模型(TSFMs)在物理科学中的部署受到一个关键权衡的阻碍:虽然这些模型编码了丰富、通用的时间动态,但当零样本应用于特定科学领域时,它们会遭受严重的分布错位,并且其计算成本阻碍了在边缘计算传感器网络中的部署。我们解决了一个基本挑战:如何从错位的基础模型(FM)中提取潜在的结构知识,以训练轻量级、专门的预测器?我们提出了用于蒸馏的门控不确定性感知路由(Guard),这是一个新颖的框架,将多教师蒸馏重新定义为实例级决策过程,具有两种自适应机制:(1)上下文路由器,根据局部输入统计动态选择最相关的教师,利用不同基础模型之间的互补性;(2)不确定性门控温度机制,充当“断路器”,当教师置信度与领域现实偏离时自动减弱蒸馏强度。我们在四个气候关键领域评估了我们提出的轻量级框架:气象学、生态系统碳通量、土壤湿度和能源电网。我们的方法相对于固定权重的多教师蒸馏基线显著降低了RMSE,成功地从预训练的FM(教师)中蒸馏知识,即使由于原始和目标数据域之间的分布偏移,它们表现出次优的零样本准确性。我们证明,这些领域错位的教师仍然可以作为关键的纠正者,在28.5%的最难实例上优于全局优越的FM。最终,这使得适用于资源受限边缘部署的高精度科学预测成为可能。代码可在https://this URL获取。

英文摘要

The deployment of Time-Series Foundation Models (TSFMs) in physical sciences is hindered by a critical trade-off: while these models encode rich, universal temporal dynamics, they suffer from severe distributional misalignment when applied zero-shot to specific scientific domains, and their computational cost prohibits deployment in edge-computing sensor networks. We address a fundamental challenge: How can we extract latent structural knowledge from misaligned foundation models (FM) to train lightweight, specialized forecasters? We propose Gated Uncertainty-Aware Routing for Distillation (Guard), a novel framework that reframes multiteacher distillation as an instance-wise decision process with two adaptive mechanisms: (1) a Contextual Router that dynamically selects the most relevant teacher based on local input statistics, exploiting complementarity across diverse foundation models; and (2) an Uncertainty-Gated Temperature mechanism that acts as a "circuit-breaker," automatically attenuating distillation strength when teacher confidence diverges from domain reality. We evaluate our proposed lightweight framework on four climate-critical domains: meteorology, ecosystem carbon flux, soil moisture, and energy grids. Our method significantly reduces RMSE relative to a fixed-weight multi-teacher distillation baseline, successfully distilling knowledge from pretrained FMs (teachers) even when they exhibit suboptimal zero-shot accuracy due to distribution shift between the original and target data domains. We demonstrate that these domain-misaligned teachers can still serve as critical correctives, outperforming the globally superior FMs on 28.5% of the hardest instances. Ultimately, this enables high-precision scientific forecasting suitable for resource-constrained edge deployment. Code is available at https://github.com/RupasreeDey/GUARD-KDD2026.

2606.20050 2026-06-19 physics.ao-ph physics.flu-dyn 新提交 80%

Enhanced Gulf Stream Path Variability Under Intensified Stratification

增强的层结下墨西哥湾流路径变率增强

Lennard Miller, Antoine Venaille, Stephane Popinet, Bruno Deremble

专题命中 气象气候 :海洋层结影响湾流路径,气候物理

AI总结 通过高分辨率海洋模型,发现上层海洋层结增强导致墨西哥湾流延伸体失去稳定性,从稳定东向路径转变为剧烈混沌弯曲,且这一转变独立于大西洋经向翻转环流和风强迫变化。

Comments 30 pages, 9 figures (including supplementary material)

详情
AI中文摘要

上层海洋层结增强是全球变暖不可避免的后果,并将强烈影响洋流结构。利用高分辨率海洋模型,我们表明层结增强导致墨西哥湾流延伸体失去相干性,其稳定的东向路径被剧烈、混沌的弯曲所取代。这种状态转变独立于大西洋经向翻转环流和表面风强迫的变化。在分辨中尺度涡的理想化和现实海洋模型中,层结增强下弯曲增强也被证明是一个稳健的特征,但在参数化涡的粗分辨率模型中未能捕捉。因此,所呈现的结果强调了在气候预测中改进海洋湍流表征的必要性。

英文摘要

Increased upper-ocean stratification is an unavoidable consequence of global warming and will strongly impact the structure of ocean currents. Using a high-resolution ocean model, we show that intensification of stratification leads to the loss of coherence of the Gulf Stream Extension, replacing its steady eastward path with vigorous, chaotic meanders. This regime shift persists independently of changes in the Atlantic Meridional Overturning Circulation and surface wind forcing. Enhanced meandering under intensified stratification also proves to be a robust feature across both idealized and realistic ocean models that resolve mesoscale eddies, but is not captured by coarse-resolution models that parameterize eddies. The presented findings therefore highlight the need for improved representations of oceanic turbulence in climate projections.

2606.19581 2026-06-19 physics.ao-ph 新提交 80%

A Land-Sea Contrast Pattern in Surface Temperature and Atmospheric Circulation Trends in Recent Decades

近几十年地表温度和大气环流趋势中的海陆对比模式

Benjamin O. Johnson, Maria Rugenstein

专题命中 气象气候 :海陆增温对比与环流趋势,属于气候科学

AI总结 研究发现陆地相对海洋的增暖主导了观测到的地表温度和大气环流趋势,包括太平洋盆地的负类IPO倾向,而气候模式低估了海陆增温比。

详情
AI中文摘要

观测到的气候趋势的空间模式仍然难以理解。本文认为,陆地相对于海洋的增暖塑造了观测到的地表温度和大气环流趋势,包括太平洋盆地的负类年代际太平洋振荡(IPO)倾向。观测和模拟的趋势显示,在相对于海洋增暖更快的陆地上,海平面气压总体下降,其空间模式类似于季节循环和理想化气候模式实验中对陆地加热的响应。使用历史强迫的耦合气候模式模拟低估了海陆增温比。只有在突然CO2四倍增的气候模式模拟的早期响应中,气候模式才能重现观测到的海陆增温比,在这种情况下,可以看到海洋表面高压增强和太平洋上负类IPO地表增暖模式与观测趋势相当。我们提出,许多气候变量中模拟与观测趋势之间的差异可能由气候模式低估海陆增温比来解释。确定这一差异的原因有可能约束未来气候变化的预测,因为导致气候模式低估海陆增温比差异的潜在机制将决定这个问题的持续性。

英文摘要

Spatial patterns in observed climate trends remain poorly understood. Here we argue that a warming of land relative to ocean has shaped observed surface temperature and atmospheric circulation trends, including the negative Inter-Decadal Pacific Oscillation (IPO)-like tendency across the Pacific basin. Observed and modeled trends display an overall decline in sea level pressure over the faster-warming land relative to ocean, with a spatial pattern that resembles the seasonal cycle and the response to land heating in idealized climate model experiments. Coupled climate model simulations with historical forcing underestimate the land-sea warming ratio. It is only in the early response of abrupt CO2 quadrupling climate model simulations that climate models are able to recreate the observed land-sea warming ratio, in which case a strengthening of oceanic surface highs and a negative IPO-like surface warming pattern over the Pacific comparable to observed trends are seen. We propose that discrepancies between modeled and observed trends in many climate variables may be explained by the underestimation of the land-sea warming ratio by climate models. Determining the cause of this discrepancy has the potential to constrain projections of future climate change as the underlying mechanism causing climate models to underestimate the land-sea warming ratio discrepancy will set the persistence of this problem.

4. 物理仿真 15 篇

2606.20496 2026-06-19 math.NA cs.DC cs.MS cs.NA 新提交 80%

CoarseSolvers for Exascale Solution of Poisson Problems

用于泊松问题百亿亿次求解的粗网格求解器

Thilina Ratnayaka, Paul Fischer, Luke Olson

专题命中 物理仿真 :提出泊松问题百亿亿次求解的粗网格求解器

AI总结 提出一种两层Schwarz方法替代代数多重网格(AMG)作为p-多重网格预条件子的粗网格求解器,通过结构化非嵌套粗空间实现无通信插值,在Summit/Frontier超算上验证了优于BoomerAMG的性能。

详情
AI中文摘要

我们提出一种两层Schwarz方法,作为代数多重网格(AMG)的替代方案,用于求解由不可压缩Navier-Stokes方程的谱/有限元离散产生的压力泊松方程的p-多重网格(pMG)预条件子的最后一层(粗网格)求解器。所提出的Schwarz方法包括原始pMG粗空间中的一个局部问题和一个全局粗问题。本文的主要贡献是为全局粗问题提出了一种新颖的、结构化的非嵌套粗空间。所提出的全局粗空间的结构化特性使得原始p-多重网格粗空间与全局粗问题之间的插值无需通信。通过在橡树岭领导计算设施的Summit/Frontier超算上使用高度可扩展的不可压缩Navier-Stokes求解器套件Nek5000/RS进行的一系列实验,我们展示了所提方法相比最先进的AMG求解器BoomerAMG的有效性。

英文摘要

WepresentatwolevelSchwarzmethodasanalternativetoAlgebraicMultigridmethod(AMG) used as the last level (coarse) solver of the p-multigrid pMG preconditioner for pressure Poission equation resulting from Spectral/Finite element descretization of incompressible Navier-Stokes eqaution. Proposed Schwarz method consits of a local problem in the original pMG coarse space and a global coarse problem. Main contribution of the paper is a novel, structured and a non-nested coarse space for the global coarse problem. Structured nature of the proposed global coarse space enable communication-free interpolation between the original p-multgrid coarse space and the global coarse problem. We demonstrate the effectiveness of the proposed method compared to the state of the art AMG solver BoomerAMG by a series of experiments performed using Nek5000/RS, a suite of highly scalable incompressible Navier-Stokes solvers, on Summit/Frontier supercomputers at Oak Ridge Leadership Computing Facility.

2606.20513 2026-06-19 quant-ph cs.IT math.IT 新提交 80%

Approximating optimal decoding of quantum LDPC codes with narrow frontiers

用窄前沿近似最优解码量子LDPC码

Anthony Leverrier, Rüdiger Urbanke

专题命中 物理仿真 :量子LDPC码解码器,属于量子信息科学

AI总结 提出Frontier解码器,一种剪枝动态规划解码器,通过保留窄评分前沿近似逻辑陪集后验质量,在表面码和颜色码上达到接近最优的阈值,并在电路级噪声下以极小的平均列表大小实现最先进性能。

Comments 15 pages, 9 figures Implementation available at https://github.com/aleverrier/frontier

详情
AI中文摘要

我们引入了Frontier解码器,一种用于稀疏量子解码问题的剪枝动态规划解码器。Frontier按选定顺序处理错误变量,合并具有相同残留综合征和逻辑标签的前缀,并通过仅保留窄评分前沿来近似逻辑陪集后验质量。如果没有剪枝,递归是精确的顺序推理,具有指数复杂度。在码容量设置中,解码器对于表面码和颜色码达到了接近最优的阈值。在电路级噪声模型中,它以非常小的平均保留列表大小实现了最先进的性能:对于粗码$[[144,12,12]]$,在物理错误率为$0.001$时,平均列表大小小于100。当列表大小恒定时,解码器具有线性复杂度,这表明了低延迟实现的可能性。

英文摘要

We introduce the Frontier decoder, a pruned dynamic-programming decoder for sparse quantum decoding problems. Frontier processes error variables in a chosen order, merges prefixes with the same residual syndrome and logical label, and approximates logical-coset posterior masses by retaining only a narrow scored frontier. Without pruning, the recursion is exact ordered inference with exponential complexity. In the code-capacity setting, the decoder reaches thresholds close to optimal for the surface code and the color code. In the circuit-level noise model, it achieves state-of-the-art performance with a very small average retained list size: less than 100 for the gross code $[[144,12,12]]$ at a physical error rate of $0.001$. When the list size is constant, the decoder has linear complexity, suggesting the possibility of low-latency implementations.

2606.20385 2026-06-19 quant-ph cs.NA math.NA 新提交 80%

Sparse Configuration Interaction for the Electronic Schrödinger Equation Revisited: Complete Basis Set Limit Complexity and Quantum-Encoding Impact

电子薛定谔方程的稀疏组态相互作用再探:完备基组极限复杂度与量子编码影响

Michael Griebel, Jan Hamaekers

专题命中 物理仿真 :电子薛定谔方程求解,量子化学计算

AI总结 本文重新审视电子薛定谔方程离散谱中本征函数的正则性结果,并研究其对逼近复杂度的影响,发现稀疏网格构造下收敛速率的主项与电子数无关,为经典和量子计算提供新编码优势。

详情
AI中文摘要

在本文中,我们重新审视了电子薛定谔方程离散谱中本征函数的正则性结果,并研究了它们对逼近复杂度的影响。特别地,对于完备基组极限的收敛性,可以证明主导代数指数中的维度灾难可以得到缓解。也就是说,对于一般的稀疏网格构造,关于自由度数目的收敛速率的主项与电子数无关。这些见解表明,对于电子薛定谔方程的经典数值求解器以及通过新的量子比特高效波函数编码的量子计算方法,都具有潜在的好处。

英文摘要

In this article we revisit regularity results for eigenfunctions in the discrete spectrum of the electronic Schrödinger equation and study their consequences for approximation complexity. In particular, for the convergence to the complete basis set limit, it can be shown that the curse of dimensionality in the leading algebraic exponent can be mitigated. That is, for general sparse grid constructions, the main term of the convergence rate with respect to the number of degrees of freedom is independent of the number of electrons. These insights indicate potential benefits for classical numerical solvers of the electronic Schrödinger equation and also for quantum-computing approaches through new qubit-efficient wavefunction encodings.

2606.19947 2026-06-19 quant-ph cs.LG 新提交 80%

QMaxCal: Path-Space Regularization for Open Quantum Control via Girsanov's Theorem

QMaxCal: 基于 Girsanov 定理的开环量子控制路径空间正则化

Merijn Moody, Zier Mensch, Miranda C. N. Cheng, Peter G. Bolhuis, Max Welling

发表机构 * Institute of Physics, University of Amsterdam, Netherlands(阿姆斯特丹大学物理研究所) Van 't Hoff Institute for Molecular Sciences, University of Amsterdam, Netherlands(阿姆斯特丹大学范·霍夫分子科学研究所) Dutch Institute for Emergent Phenomena, University of Amsterdam, Netherlands(阿姆斯特丹大学新兴现象研究所) Institute for Mathematics, Academia Sinica, Taiwan(台湾“中华学术院”数学研究所) Korteweg-de Vries Institute for Mathematics, University of Amsterdam, Netherlands(阿姆斯特丹大学柯特韦斯数学研究所) Amsterdam Machine Learning Lab, University of Amsterdam, Netherlands(阿姆斯特丹大学机器学习实验室) Department of Physics, National Taiwan University, Taiwan(台湾国立台湾大学物理系)

专题命中 物理仿真 :开放量子系统控制,量子信息与机器学习

AI总结 针对开放量子系统退相干问题,利用 Girsanov 定理推导 KL 散度的可微估计器,提出两种正则化项以最小化退相干影响,在多种量子系统中优于未正则化的梯度方法和强化学习基线。

Comments 26 pages, 6 figures. ICML 2026 AI4Physics Workshop

详情
AI中文摘要

在存在退相干的条件下,可靠的量子控制需要能够对抗环境噪声对受控动力学影响的策略。连续监测下的开放量子系统产生经典测量记录,其漂移依赖于系统所经历的噪声;共享相同退相干通道的两个演化的记录仅在此漂移上有所不同,因此 Girsanov 定理给出了它们轨迹分布之间 KL 散度的闭式、可微估计器。我们用两个物理动机的参考度量实例化该估计器,得到两个正则化项,它们都将系统驱动到退相干效应最小的状态:Wiener KL (KL_W),在噪声模型的某些条件下经验上更有效;以及漂移方差正则化项 (R_DV),适用于所有噪声模型。两者在性质上不同于现有的控制通量或平滑性惩罚:它们惩罚控制对退相干通道的可观测后果,而非控制幅度本身。这些正则化项在一系列开放量子系统中优于未正则化的基于梯度和强化学习的基线——包括单量子比特和多量子比特基准测试,以及一个校准到已发表的 IBM Kingston 处理器快照的多量子比特链——在多个评估维度上:最终态保真度、对假设噪声模型失配的鲁棒性(在训练噪声下增益从 +17 个百分点增长到 2.5 倍噪声失配下的 +27 个百分点),以及禁止态的占据。正则化项将不保真度降低高达 50%,在校准的 IBM Kingston 链上获得约 16% 的增益。

英文摘要

Reliable quantum control in the presence of decoherence requires policies that combat the effect of environmental noise on the controlled dynamics. Open quantum systems under continuous monitoring generate classical measurement records whose drift depends on the noise experienced by the system; the records of two evolutions sharing the same decoherence channels differ only in this drift, so Girsanov's theorem yields a closed-form, differentiable estimator of the KL divergence between their trajectory distributions. We instantiate this estimator with two physically motivated reference measures, yielding two regularizers that both drive the system toward states where the effects of decoherence are minimal: the Wiener KL (KL_W), which is empirically more effective under certain conditions on the noise model, and the drift-variance regularizer (R_DV), which works for all noise models. Both are qualitatively distinct from existing penalties on control fluence or smoothness: they penalize the observable consequences of control on the decoherence channels rather than the control amplitude itself. The regularizers outperform unregularized gradient-based and reinforcement-learning baselines across a range of open quantum systems -- including single- and multi-qubit benchmarks and a multi-qubit chain calibrated to a published snapshot of the IBM Kingston processor -- along several axes of evaluation: final-state fidelity, robustness to mismatch in the assumed noise model (gains grow from +17 pp at training noise to +27 pp under 2.5x noise mismatch), and occupation of forbidden states. The regularizers reduce infidelity by up to 50%, with ~16% gains on the calibrated IBM Kingston chain.

2606.20467 2026-06-19 cs.LG cs.NA math.NA physics.comp-ph 新提交 80%

Agentic Symbolic Search: Characterizing PDEs Beyond Hand-crafted Expressions, Meshes, and Neural Networks

智能符号搜索:超越手工表达式、网格和神经网络的PDE特征化

Zongmin Yu, Liu Yang

发表机构 * National University of Singapore(新加坡国立大学)

专题命中 物理仿真 :PDE符号搜索,科学机器学习

AI总结 提出ASYS框架,通过智能体将PDE理论转化为可微分符号程序,结合进化搜索和梯度优化自动发现解析形式或近似,在多个问题中生成可解释表示。

详情
AI中文摘要

数学家通过数学结构而非计算值表来理解PDE解。历史上,这需要针对每个问题单独进行数学分析。数值模拟和神经网络都不能直接产生这些结构。我们提出智能符号搜索(ASYS),一种先验引导框架,其中智能体将PDE理论、公共问题约束和累积搜索经验转化为可测试的可微分符号程序。数学形式在进化搜索下被精炼,而其连续参数通过基于梯度的优化拟合。这使得搜索成为归纳偏置注入的自动化形式,而非盲目的符号回归。对于已知解析形式的问题,ASYS自然恢复这些形式;对于其他问题,ASYS构建解析近似,可引导数学家进行进一步分析。在我们的实验中,跨越五个问题,包括有界动力学、有限时间爆破和自由边界聚焦,ASYS产生了可解释表示,包括Allen-Cahn 2D动力学的几何界面公式和Keller-Segel趋化爆破的九参数收缩律,这些场景中先前没有闭式描述。ASYS展示了表征PDE解的新范式的可能性,超越了手工解析解、基于网格的数值解和神经网络近似。

英文摘要

Mathematicians understand a PDE solution through mathematical structures rather than tables of computed values. Historically, this has been the product of mathematical analysis, carried out by hand for each problem individually. Neither numerical simulation nor neural networks produce those structures directly. We propose Agentic Symbolic Search (ASYS), a prior-guided framework in which an agent translates PDE theory, public problem constraints, and accumulated search experience into testable differentiable symbolic programs. The mathematical forms are refined under evolutionary search, while their continuous parameters are fit by gradient-based optimization. This makes the search an automated form of inductive-bias injection rather than blind symbolic regression. For problems with known analytical forms, ASYS recovers these forms naturally; for other problems, ASYS constructs analytical approximations which can guide mathematicians toward further analysis. In our experiments, across five problems spanning bounded dynamics, finite-time blow-up, and free-boundary focusing, ASYS produces interpretable representations, including a geometric interface formula for Allen-Cahn 2D dynamics and a nine-parameter contraction law for Keller-Segel chemotactic blow-up, in settings where no closed-form description was previously available. ASYS shows the possibility of a new paradigm for characterizing PDE solutions, beyond handcrafted analytical solutions, mesh-based numerical solutions, and neural network approximations.

2606.20313 2026-06-19 quant-ph physics.chem-ph 新提交 80%

Entanglement structure of the dynamical phases in the sub-Ohmic spin-boson model

亚欧姆自旋-玻色子模型中动力学相的纠缠结构

Cunxi Gong, Zirui Sheng, Weitang Li

专题命中 物理仿真 :自旋-玻色子模型纠缠结构,量子多体物理

AI总结 利用树张量网络态方法研究亚欧姆自旋-玻色子模型的纠缠结构,发现自旋纠缠熵的稳定平台可构建标量熵景观,其脊线在参数空间中与基于布居的相边界部分一致但未再现双分支结构。

详情
AI中文摘要

亚欧姆自旋-玻色子模型在其自旋布居动力学中表现出三种不同的动力学区域,分别归类为相干、非相干和伪相干。这些区域是否对应不同的自旋-浴纠缠结构仍是一个开放问题。本文利用带有投影分裂时间演化的树张量网络态(TTN-TDVP-PS),在亚欧姆$(s, \alpha)$平面上扫描一个广泛的网格。我们发现自旋纠缠熵$S_\mathrm{spin}(t)$在比极化弛豫更短的时间尺度上达到一个稳定平台,从而能够根据稳定值$S_\mathrm{stable}$构建一个稳定的熵景观。在这个标量熵景观中,熵脊在小$s$处大致遵循基于布居的相边界,但在大$s$处没有再现双分支结构。脊线在非相干区域内保持单值,而不是分别追踪两个基于布居的转变。布洛赫球表示为这种行为提供了几何解释。熵平台对应于轨迹稳定在恒定半径的壳层上,而脊线标志着最小稳定布洛赫半径的参数。模式分辨的浴纠缠表明,低频模式主导了环境熵的尺度,并且相干动力学增强了超出直接自旋-模式关联的浴模式关联。这些结果确立了稳定自旋纠缠熵作为一个物理上有信息的可观测量,补充了基于布居的耗散量子动力学分类。

英文摘要

The sub-Ohmic spin-boson model exhibits three distinct dynamical regimes in its spin population dynamics, classified as coherent, incoherent, and pseudo-coherent. Whether these regimes correspond to distinct spin-bath entanglement structures remains an open question. Here we address this using tree tensor network states with projector-splitting time evolution (TTN-TDVP-PS), scanning a broad grid in the sub-Ohmic $(s, α)$ plane. We find that the spin entanglement entropy $S_\mathrm{spin}(t)$ reaches a stationary plateau on a timescale shorter than the polarization relaxation, enabling construction of a stationary entropy landscape from the stationary value $S_\mathrm{stable}$. Within this scalar entropy landscape, the entropy ridge broadly follows the population-based phase boundary at small $s$, but does not reproduce the two-branch structure at large $s$. The ridge remains single-valued within the incoherent region rather than separately tracking both population-based transitions. The Bloch-sphere representation provides a geometric interpretation of this behavior. The entropy plateau corresponds to trajectories settling onto constant-radius shells, with the ridge marking the parameters of smallest stationary Bloch radius. Mode-resolved bath entanglement shows that low-frequency modes dominate the environmental entropy scale and that coherent dynamics enhance bath-mode correlations beyond direct spin--mode correlations. These results establish the stationary spin entanglement entropy as a physically informative observable that complements population-based classifications of dissipative quantum dynamics.

2606.20160 2026-06-19 quant-ph physics.comp-ph 新提交 80%

Multi-objective design of photon blockade for bright single-photon sources

用于明亮单光子源的光子阻塞多目标设计

Sunkyu Yu, Xianji Piao, Namkyoo Park

专题命中 物理仿真 :单光子源优化设计,量子光学

AI总结 提出一种基于Liouville空间伴随公式和雅可比更新的计算框架,结合模拟退火,实现光子阻塞单光子源的多目标优化,在宽参数空间内以近60%成功率达到g2(0)<0.1和理论亮度上限。

详情
AI中文摘要

高质量单光子源,通过可饱和发射体、光子阻塞或预示对生成实现,是光子量子平台不可或缺的构建模块。尽管这些机制通过通常由分析模型捕获的不同原理抑制多光子发射,但其实际实现受到纯度、亮度和不可区分性等相互冲突要求的限制,这些要求必须在高维设计空间中平衡。在这里,我们提出了一个用于优化单光子源竞争指标的计算框架。基于Liouville空间伴随公式,该公式有效评估马尔可夫开放量子系统中的多个目标,我们开发了基于雅可比矩阵的更新,确保多目标成本的一阶单调减少。通过结合模拟退火以逃离梯度消失平台,我们的框架在没有任何分析指导的情况下,在宽参数空间内实现了近60%的光子阻塞设计成功率,其中g2(0)小于0.1且亮度达到理论界限。该框架为开放量子系统的多目标设计提供了通用方案。

英文摘要

High-quality single-photon sources, realized through saturable emitters, photon blockade, or heralded pair generation, are indispensable building blocks for photonic quantum platforms. Although these mechanisms suppress multiphoton emission through distinct principles typically captured by analytical models, their practical implementation is constrained by conflicting requirements for purity, brightness, and indistinguishability, which must be balanced within high-dimensional design landscapes. Here, we propose a computational framework for optimizing competing metrics of single-photon sources. Building on a Liouville-space adjoint formulation that efficiently evaluates multiple objectives in Markovian open quantum systems, we develop a Jacobian-based update, which ensures first-order monotonic reduction of multi-objective costs. By incorporating simulated annealing to escape gradient-vanishing plateaus, our framework achieves a design success rate of nearly 60 % for photon blockade with g2(0) smaller than 0.1 and theoretically bounded brightness across a broad parameter space, without any analytical guidance. This framework provides a general recipe for multi-objective design of open quantum systems.

2606.19853 2026-06-19 cs.LG physics.comp-ph 新提交 80%

Physics-Informed Neural Network with Squeeze-Excitation-like Attention

带有挤压-激励式注意力的物理信息神经网络

Yun-Fei Song, Long-Gang Pang, Fu-Peng Li, Jun-Jie Zhang

发表机构 * Key Laboratory of Quark and Lepton Physics (MOE) & Institute of Particle Physics, Central China Normal University(华中师范大学夸克与轻子物理教育部重点实验室及粒子物理研究所) Artificial Intelligence and Computational Physics Research Center, Central China Normal University(华中师范大学人工智能与计算物理研究中心) Key Laboratory of Nuclear Physics and Ion-beam Application (MOE) & Institute of Modern Physics, Fudan University(复旦大学核物理与离子束应用教育部重点实验室及现代物理研究所) Shanghai Research Center for Theoretical Nuclear Physics, NSFC and Fudan University(国家自然科学基金委员会-复旦大学上海理论核物理研究中心) Northwest Institute of Nuclear Technology(西北核技术研究所)

专题命中 物理仿真 :物理信息神经网络改进,科学机器学习

AI总结 提出SEA-PINN架构,通过挤压-激励式注意力机制动态调整神经元重要性,实现稳定初始化,在20个基准问题中17个方差极小,无需傅里叶嵌入或周期激活即可达到与TSA-PINN相当的精度,并可作为轻量插件提升其他PINN性能。

Comments 15 pages, 6 figures

详情
AI中文摘要

我们引入了SEA-PINN,一种新颖的架构,它将类似挤压-激励的注意力机制融入物理信息神经网络,以动态重新校准各层神经元的重要性。SEA-PINN的一个关键特性是其高度稳定的初始化。在20个基准问题中的17个上,SEA-PINN表现出几乎可忽略的方差和显著降低的初始损失,为优化建立了一个准确定且有利的起点。值得注意的是,在没有采用傅里叶特征嵌入或周期激活函数的情况下,SEA-PINN与TSA-PINN(一种通过正弦激活中的可学习频率专门为高频问题设计的模型)相比,达到了具有竞争力的精度(在高频案例7上,相对于FNN-PINN的改进分别为83%和90%)。此外,将SEA-PINN集成到TSA-PINN中使性能提升了42.49%。这些结果强调了SEA-PINN作为一种轻量级插件模块,能够增强非线性表示能力,促进更稳健和高效的收敛,并提高物理信息学习的整体可靠性。

英文摘要

We introduce SEA-PINN, a novel architecture that incorporates a Squeeze-Excitation-like attention mechanism into physics-informed neural networks to dynamically recalibrate the importance of neurons across layers. A key feature of SEA-PINN is its highly stable initialization. On 17 out of 20 benchmark problems, SEA-PINN exhibit nearly negligible variance and significantly reduced initial loss, establishing a quasi-deterministic and favorable starting point for optimization. Notably, without employing Fourier feature embeddings or periodic activation functions, SEA-PINN attained competitive accuracy (83\% vs. 90\% improvement relative to FNN-PINN on the high-frequency case 7) as compared with TSA-PINN-a model specifically engineered for high-frequency problems via learnable frequencies in sinusoidal activations. Furthermore, integrating SEA-PINN into TSA-PINN boosted performance by 42.49\%. These results underscore SEA-PINN as a lightweight plug-in module that enhances nonlinear representation power, promotes more robust and efficient convergence, and strengthens the overall reliability of physics-informed learning.

2606.19649 2026-06-19 quant-ph physics.ins-det 新提交 80%

Optimized Quantum States for Sensing in the Presence of Loss and Phase Noise

用于存在损耗和相位噪声的传感的优化量子态

Shruti Maliakal, Zachary Mann, Christopher Wipf, Rana X Adhikari, Su Direkci, Yanbei Chen

专题命中 物理仿真 :量子传感优化,量子信息科学

AI总结 通过数值优化量子Fisher信息,在损耗和相位噪声下发现非高斯态(如Fock态、立方相位态和离散旋转对称态)优于任何高斯态,在平均光子数5、损耗5%、相位噪声200 mrad时非高斯优势达2.2 dB。

Comments The build is 8 pages, 5 figures (3 in the body, 2 in the End Matter)

详情
AI中文摘要

压缩真空使引力波探测器和其他量子传感器能够超越标准量子极限,并且在仅存在损耗的体制中是最优的;相位噪声破坏了这种最优性。通过数值优化跨损耗和相位噪声景观的量子Fisher信息,我们识别出优于任何高斯态的非高斯态。这些态分为三类:Fock类、立方相位类以及具有离散旋转对称性的态。将输入态的平均光子数限制为$\bar{n}=5$,在$1-\eta = 5\\%$的光子损耗和200 mrad的相位噪声下,非高斯优势达到2.2 dB。此外,我们观察到即使测量策略是零差探测,非高斯优势仍然可以保持。

英文摘要

Squeezed vacuum lets gravitational-wave detectors and other quantum sensors surpass the standard quantum limit, and is optimal in the loss-limited regime; phase noise breaks this optimality. Numerically optimizing the quantum Fisher information across the loss and phase-noise landscape, we identify non-Gaussian states that outperform any Gaussian state. These fall into three classes: Fock-like, cubic-phase-like, and states with discrete rotational symmetry. Limiting the average number of photons in the input state to $\bar{n}=5$, with $1-η= 5\%$ photon loss and 200 mrad phase noise, the non-Gaussian advantage reaches up to 2.2 dB. Furthermore, we observe that the non-Gaussian advantage can persist even when the measurement strategy is homodyne detection.

2606.20511 2026-06-19 physics.flu-dyn 新提交 80%

State estimation of Rayleigh-Bénard convection with reduced-order models

基于降阶模型的瑞利-贝纳德对流状态估计

Enrique Flores-Montoya, André F. C. da Silva, André V. G. Cavalieri

专题命中 物理仿真 :流体对流状态估计,物理仿真应用

AI总结 结合稳定Galerkin降阶模型与扩展卡尔曼滤波,实现二维RB对流状态估计,在周期、准周期和混沌状态下速度与温度重建误差分别低于14%和9%,并开发了贪心传感器布置策略。

详情
AI中文摘要

在本工作中,我们开发了一个用于二维瑞利-贝纳德(RB)对流的状态估计框架,该框架将稳定的Galerkin降阶模型(ROM)与扩展卡尔曼滤波(EKF)相结合。ROM由线性化Boussinesq方程的可控性模态构建,为滤波预测步骤提供非线性动力学模型。直接数值模拟(DNS)用于生成用于数据同化的合成测量值。我们评估了滤波器在周期、准周期和混沌状态下的性能,表明滤波器能够高保真地跟踪最能量模态,并实现速度时间平均重建误差低于$14\%$,温度低于$9\%$。我们将基于ROM的EKF应用于混合模拟场景,其中系统状态从粗粒度的PIV类速度测量中同化。结果表明,仅速度观测就足以重建状态,包括温度场。最后,我们利用卡尔曼增益矩阵开发了一种贪心传感器布置策略,该策略逐步移除信息量最少的传感器。该算法揭示了传感器类型之间的清晰层次结构,可用于推导骨架观测配置。它还为哪些测量变量和空间位置对状态校正最具信息量提供了指导。本框架具有通用性,可应用于其他二次Galerkin ROM进行状态估计。

英文摘要

In this work, we develop a state estimation framework for two-dimensional Rayleigh-Bénard (RB) convection that combines a stable Galerkin reduced-order model (ROM) with an extended Kalman filter (EKF). The ROM, constructed from controllability modes of the linearised Boussinesq equations, provides the nonlinear dynamical model for the filter prediction step. Direct numerical simulations (DNS) are used to generate synthetic measurements for data assimilation. We assess filter performance across periodic, quasiperiodic, and chaotic regimes, demonstrating that the filter tracks the most energetic modes with high fidelity and achieves time-averaged reconstruction errors below $14\%$ for velocity and $9\%$ for temperature. We apply the ROM-based EKF to a hybrid simulation scenario where the system state is assimilated from coarse PIV-like velocity measurements. It is shown that velocity observations alone suffice to reconstruct the state, including the temperature field. Finally, we exploit the Kalman gain matrix to develop a greedy sensor placement strategy that progressively removes the least informative sensors. The algorithm reveals a clear hierarchy among sensor types and can be used to derive skeletal observation configurations. It also provides guidance on which measurement variables and spatial locations are most informative for state correction. The present framework is general, and may be applied to other quadratic Galerkin ROMs for state estimation.

2606.20352 2026-06-19 physics.flu-dyn 新提交 80%

Planar Lagrangian transport and scalar-gradient organization in a turbulent reacting shear layer

湍流反应剪切层中的平面拉格朗日输运与标量梯度组织

Sriram P. Kalathoor, Joseph C. Oefelein

专题命中 物理仿真 :湍流反应剪切层输运,物理仿真

AI总结 通过三维直接数值模拟的时均中平面数据,结合有限时间李雅普诺夫指数场、柯西-格林变形测度及双曲测地线拉格朗日相干结构提取,分析了超音速反应氢气-空气混合层中的平面拉格朗日输运与标量梯度组织,揭示了有限时间拉伸对反应剪切层结构的组织作用。

Comments 20 pages, 23 figures, 19 tables, to be submitted to Chaos

详情
AI中文摘要

我们利用三维直接数值模拟的时均中平面数据,分析了超音速反应氢气-空气时间混合层中的平面拉格朗日输运与标量梯度组织。分析结合了前向/后向有限时间李雅普诺夫指数(FTLE)场、操作性FTLE脊骨架、柯西-格林变形测度、剪切LCS度量以及平面双曲测地线LCS提取,以研究有限时间拉伸如何结构化反应剪切层。时均FTLE脊识别了受限二维切片中的排斥和吸引有限时间输运骨架,并量化了脊几何、交叉占用、持久性和标量条件输运。双曲测地线LCS从平面流图重建的柯西-格林张量中提取,作为在高λ_max法向极大值处播种的应变线,提供了操作性FTLE脊骨架的变分对应物。然后,我们将输运骨架与温度、混合分数和反应中间体联系起来。结果显示:局部前向/后向脊重叠、强标量梯度富集、占据相同高应变输运骨架的有限时间测地线LCS、相对于时间和横流分层零模型的残余方向依赖性分离,以及相对于去相关和FTLE积分尺度保持紧凑的标量响应滞后。这些结果共同提供了可压缩反应剪切流中相干结构及其在中平面混合中作用的输运导向表征。

英文摘要

We analyze planar Lagrangian transport and scalar-gradient organization in a supersonic, reacting hydrogen-air temporal mixing layer using time-resolved mid-plane data from a three-dimensional direct numerical simulation. The analysis combines forward/backward finite-time Lyapunov exponent (FTLE) fields, operational FTLE-ridge skeletons, Cauchy-Green deformation measures, shear-LCS metrics, and planar hyperbolic geodesic-LCS extraction to examine how finite-time stretching structures the reacting shear layer. The time-resolved FTLE ridges identify repelling and attracting finite-time transport skeletons in the constrained two-dimensional slice, from which ridge geometry, intersection occupancy, persistence, and scalar-conditioned transport are quantified. Hyperbolic geodesic LCS are extracted from Cauchy-Green tensors reconstructed from planar flow maps as strainlines seeded at high-$λ_{\max}$ normal maxima, providing a variational counterpart to the operational FTLE-ridge skeleton. We then relate the transport skeleton to temperature, mixture fraction, and a reaction intermediate. The results show localized forward/backward ridge overlap, strong scalar-gradient enrichment, finite-time geodesic LCS that occupy the same high-strain transport skeleton, residual direction-dependent separation from a time- and cross-stream-stratified null model, and scalar-response lags that remain compact relative to decorrelation and FTLE-integration scales. Together, these results provide a transport-oriented characterization of coherent structures and their role in mid-plane mixing within a compressible reacting shear flow.

2606.20298 2026-06-19 physics.plasm-ph physics.acc-ph physics.optics 新提交 80%

Dephasingless laser wakefield acceleration in a plasma waveguide

等离子体波导中的无退相激光尾场加速

J. P. Palastro, K. G. Miller, C. D. Arrowsmith, R. Almeida, M. R. Edwards, A. L. Elliott, A. Kiewel, A. Konzel, L. S. Mack, D. Ramsey, D. Singh, A. G. R. Thomas, J. Vieira

专题命中 物理仿真 :激光尾场加速,等离子体物理仿真

AI总结 提出利用等离子体波导中时空结构激光脉冲驱动真空光速尾场,消除电子退相,保持恒定光斑尺寸和超短脉宽,单级能量增益随模式数线性增加。

详情
AI中文摘要

激光尾场加速器(LWFA)为紧凑型电子加速器和光子源提供了极大的加速梯度,但受限于退相,即被捕获的电子会超出尾场的加速相位。飞行聚焦脉冲可以通过以真空光速驱动尾场来消除退相,但这些脉冲涉及权衡,如变化的光斑尺寸、长持续时间或大的等离子体体积。在这里,我们展示了在等离子体波导中传播的时空结构激光脉冲可以以真空光速驱动尾场,同时保持恒定的光斑尺寸和超短脉宽。该脉冲是通过叠加具有适当选择的频率的等离子体波导模式形成的。与飞行聚焦方法相比,波导显著减少了所需的等离子体体积。标度律和准三维粒子模拟表明,单级能量增益随用于构建脉冲的模式数线性增加,从而实现了比标准LWFA更大的能量增益或更短的加速级。

英文摘要

Laser wakefield accelerators (LWFAs) provide extremely large accelerating gradients for compact electron accelerators and photon sources but are limited by dephasing, where trapped electrons outrun the accelerating phase of the wakefield. Flying-focus pulses can eliminate dephasing by driving a wake at the vacuum speed of light, but these pulses involve tradeoffs such as varying spot size, long duration, or large plasma volume. Here we show that a spatiotemporally structured laser pulse propagating in a plasma waveguide can drive a wakefield at the vacuum speed of light while maintaining a constant spot size and ultrashort duration. The pulse is formed by superposing plasma-waveguide modes with appropriately selected frequencies. Compared with flying-focus approaches, the waveguide substantially reduces the required plasma volume. Scaling laws and quasi-3D particle-in-cell simulations show that the single-stage energy gain increases linearly with the number of modes used to construct the pulse, enabling larger energy gains or shorter stages than standard LWFA.

2606.20139 2026-06-19 physics.flu-dyn 新提交 80%

A high-fidelity numerical database for free-stream transition

自由流转换的高保真数值数据库

Louenas Zemmour, Xavier Gloerfelt, Paola Cinnella

专题命中 物理仿真 :湍流转换高保真数据库,流体仿真

AI总结 通过壁面解析隐式大涡模拟生成高保真数值数据库,模拟ERCOFTAC T3平板实验,评估RANS转换模型缺陷,为机器学习转换模型提供基准。

Comments The high-fidelity numerical database associated with this work is publicly available on Zenodo: https://doi.org/10.5281/zenodo.17166216

详情
AI中文摘要

层流到湍流转换的准确预测对于空气动力学和涡轮机械系统的设计至关重要,然而广泛使用的实验基准(如ERCOFTAC T3系列)缺乏现代模型开发所需的全场、三维和时间分辨数据。为了解决这些限制,本研究提出了边界层旁路转换的高保真数值数据库,通过壁面解析隐式大涡模拟(iLES)严格模拟ERCOFTAC T3平板实验。计算使用高阶可压缩Navier-Stokes求解器在多种配置下进行,涵盖一系列自由流湍流强度以及零和变化压力梯度。数值结果在壁面摩擦、平均速度和脉动剖面方面与遗留实验数据表现出令人满意的一致性。最后,利用所得数据库评估标准RANS转换模型(SA-BCM和$k-\omega-\gamma$)的预测能力,揭示了预测转换起始和长度方面的系统性缺陷。这突显了该数据集作为校准、评估和开发下一代物理信息机器学习转换模型的基础资源的价值。

英文摘要

The accurate prediction of laminar-to-turbulent transition is critical for the design of aerodynamic and turbomachinery systems, yet widely used experimental benchmarks, such as the ERCOFTAC T3 series, lack the full-field, three-dimensional, and time-resolved data required for modern model development. To address these limitations, this study presents a high-fidelity numerical database of bypass transition in boundary layers, generated using wall-resolved implicit Large Eddy Simulations (iLES) to rigorously mimic the ERCOFTAC T3 flat-plate experiments. Computations are performed using a high-order compressible Navier-Stokes solver across multiple configurations, encompassing a range of freestream turbulence intensities and both zero and varying pressure gradients. The numerical results demonstrate satisfactory agreement with legacy experimental data for skin friction, mean velocity, and fluctuation profiles. Finally, the resulting database is utilized to evaluate the predictive capabilities of standard Reynolds-Averaged Navier-Stokes (RANS) transition models (SA-BCM and $k-ω-γ$), revealing systemic flaws in predicting transition onset and length. This highlights the dataset's value as a foundational resource for the calibration, assessment, and development of next-generation, physics-informed machine learning transition closures.

2606.20125 2026-06-19 physics.optics physics.plasm-ph 新提交 80%

Caustic-Driven Fluidic Microlenses for Enhanced Nonlinear and High-Energy-Density Physics

用于增强非线性与高能量密度物理的焦散驱动流体微透镜

Sourabh Singh, S. Sree Harsha, Tamanna, Prashant Kumar Singh

专题命中 物理仿真 :焦散微透镜高能量密度物理,物理仿真

AI总结 本文展示液体射流中的焦散微透镜效应可高效驱动线性、非线性和高能量密度现象,通过微焦耳飞秒脉冲产生吉帕冲击,并支持高达0.2 MHz重复率。

Comments Submitted to Physical Review Applied; under review

详情
AI中文摘要

我们证明了液体射流中发生的焦散微透镜效应能高效驱动线性、非线性和高能量密度现象。在线性区域,焦散提供局域聚焦,区别于外部高数值孔径光学元件。在非线性区域,它们增强液体-空气界面的输入场并提升表面敏感过程。在高能量密度领域,焦散驱动的局域激光吸收利用微焦耳飞秒脉冲产生吉帕冲击,且可扩展至0.2 MHz的重复率。焦散驱动流体微透镜为表面非线性光学、超快科学和高能量密度物理提供了机遇。

英文摘要

We demonstrate that caustic microlensing occurring in a liquid jet efficiently drives linear, nonlinear, and high-energy-density phenomena. In the linear regime, caustics provide localized focusing, distinct from external high-NA optics. In the nonlinear regime, they enhance the input field at the liquid-air interface and boost surface-sensitive processes. In the high-energy-density domain, caustic-driven localized laser absorption generates gigapascal shocks using microjoule femtosecond pulses, with scalability up to repetition rates of 0.2 MHz. Caustic-driven fluidic microlensing offers opportunities for surface nonlinear optics, ultrafast science, and high-energy-density physics.

2606.19523 2026-06-19 physics.plasm-ph 新提交 80%

Bayesian optimization of stellarator alpha-particle confinement using data-informed parameter spaces and dimensionality reduction

利用数据驱动参数空间和降维的仿星器α粒子约束贝叶斯优化

Matt Landreman, Michael Czekanski, Andrew Giuliani, Byoungchan Jang, Rory Conlin

专题命中 物理仿真 :仿星器α粒子约束贝叶斯优化,属于物理仿真

AI总结 提出两种基于数据的新参数空间(分位数变换和PCA+分位数变换)解决仿星器优化中傅里叶参数边界设置难题,结合贝叶斯优化与引导中心追踪实现快速粒子约束优化,得到非准对称或准等动态的优异约束位形。

详情
AI中文摘要

现代仿星器通常通过优化等离子体边界表面的形状来设计,参数取为傅里叶振幅。许多有前景的优化算法(如贝叶斯方法)需要对参数施加边界约束,并且当每个参数的尺度相似时效率最高。对于典型的傅里叶参数化,如何设置这些边界尚不明确:宽约束会导致边界自相交和MHD平衡计算频繁失败,而紧约束则限制了表达能力。为了解决这些问题,本文提出了两种新的仿星器优化参数空间。两者都从现有仿星器边界数据集开始。第一种方法对每个傅里叶自由度应用分位数变换,将数据分布映射到单位区间上的均匀分布。第二种方法对边界上的点应用主成分分析(PCA),然后进行分位数变换。对于两种方法,变换后的变量成为自由度,自然有界于[0, 1]。PCA方法还具有降维的额外优势,用少量参数即可获得高表达能力。通过贝叶斯优化,在优化循环内使用引导中心追踪进行异步并行化,展示了这些方法在良好α粒子约束方面的效果。这些优化得到了在远离准对称或准等动态的磁场中具有优异快粒子约束的仿星器位形。

英文摘要

Modern stellarators are typically designed by optimizing the shape of the plasma boundary surface, with the parameters taken to be Fourier amplitudes. Many promising optimization algorithms such as Bayesian methods require bound constraints on the parameters and are most efficient when each parameter is scaled similarly to the others. With the typical Fourier parameterization, it is unclear how to set these bounds: wide constraints lead to self-intersecting boundaries and frequent failures of the MHD equilibrium calculation, while tight bound constraints limit expressiveness. To address these issues, here we propose two new parameter spaces for stellarator optimization. Both begin with a dataset of existing stellarator boundaries. In the first approach, a quantile transformation is applied to each Fourier degree of freedom, mapping the data distribution to a uniform distribution on the unit interval. In the second approach, principal component analysis (PCA) is applied to points on the boundaries, followed by a quantile transformation. For both approaches, the transformed variables become the degrees of freedom, naturally bounded to [0, 1]. The PCA method has the additional benefit of dimensionality reduction, with high expressiveness for a small number of parameters. The methods are demonstrated via Bayesian optimization for good alpha-particle confinement with guiding-center tracing inside the optimization loop, using asynchronous parallelization. These optimizations yield stellarator configurations with excellent fast-particle confinement in fields that can be far from quasisymmetric or quasi-isodynamic.

5. 材料化学 4 篇

2606.20105 2026-06-19 physics.chem-ph physics.comp-ph 新提交 80%

Can DFT-trained neural network potentials reproduce structure, solvation, and water-exchange properties in aqueous magnesium solutions?

DFT训练的神经网络势能否重现镁水溶液中的结构、溶剂化和水交换性质?

Sebastian Falkner, Pablo Montero de Hijes, Christoph Dellago, Nadine Schwierz

专题命中 材料化学 :神经网络势模拟镁溶液,材料化学

AI总结 开发并系统评估基于revPBE-D3/zd和revPBE0-D3/zd数据的MACE神经网络势,发现其能准确再现水合结构、扩散和交换动力学,但溶剂化自由能显著低估实验值,表明需显式长程静电处理。

详情
AI中文摘要

镁离子在许多生物过程中起着至关重要的作用,但在生物分子模拟中仍然难以建模。尽管付出了大量的科学努力,经典力场未能同时再现关键的结构、热力学和动力学溶液性质,这很可能是因为它们无法显式考虑量子多体效应。在这里,我们开发并系统评估了用于水MgCl$_2$溶液的MACE神经网络势(NNPs),这些势基于revPBE-D3/zd和revPBE0-D3/zd密度泛函理论参考数据训练,并评估它们再现广泛实验溶液性质的能力,包括第一水合壳的结构、扩散系数、活性导数、水交换速率和机制以及溶剂化自由能。两种NNP都准确地再现了第一水合壳的八面体结构、离子配对性质和扩散系数。将NNP与过渡路径采样和其他增强采样技术相结合,使我们能够捕获Mg$^{2+}$第一水合壳中水交换的罕见事件,揭示了解离交换机制。过渡界面采样得到的交换速率在实验值的一个数量级内,相比经典解离力场有显著改进。相比之下,NNP导出的溶剂化自由能显著低估了实验值,揭示了当前局部NNP架构在描述离子溶剂化热力学方面的局限性。我们的结果表明,DFT训练的NNP可以准确描述Mg$^{2+}$的水合结构、扩散、离子配对和交换动力学,同时强调需要显式长程静电处理以实现与实验离子溶剂化自由能的定量一致。

英文摘要

Magnesium ions play an essential role in many biological processes but remain challenging to model in biomolecular simulations. Despite considerable scientific effort, classical force fields fail to simultaneously reproduce key structural, thermodynamic and kinetic solution properties, likely due to their inability to explicitly account for quantum many-body effects. Here, we develop and systematically benchmark MACE neural network potentials (NNPs) for aqueous MgCl$_2$ solutions trained on revPBE-D3/zd and revPBE0-D3/zd density functional theory reference data and assess their ability to reproduce a broad range of experimental solution properties including the structure of the first hydration shell, diffusion coefficient, activity derivative, water-exchange rate and mechanism as well as solvation free energy. Both NNPs accurately reproduce the octahedral structure of the first hydration shell, ion pairing properties and diffusion coefficients. Combining the NNPs with transition path sampling and other enhanced sampling techniques allows us to capture the rare event of water exchange in the first hydration shell of Mg$^{2+}$ revealing a dissociative exchange mechanism. Transition interface sampling yields exchange rates within one order of magnitude of experiment, representing a substantial improvement over classical dissociative force fields. In contrast, the NNP-derived solvation free energy significantly underestimates the experimental value, revealing a limitation of the present local NNP architectures for describing ion solvation thermodynamics. Our results demonstrate that DFT-trained NNPs can accurately describe Mg$^{2+}$ hydration structure, diffusion, ion pairing, and exchange kinetics, while highlighting the need for explicit long-range electrostatic treatments to achieve quantitative agreement with experimental ion solvation free energies.

2606.20462 2026-06-19 cond-mat.soft cond-mat.mtrl-sci cond-mat.stat-mech 新提交 80%

Polymer-polymer interdiffusion: effects of entanglements and a polymeric source

聚合物-聚合物相互扩散:缠结和聚合物源的影响

Avraham Moriel, Howard A. Stone

专题命中 材料化学 :聚合物相互扩散研究,属于软物质物理

AI总结 利用双流体模型研究缠结和非缠结聚合物在有无源条件下的相互扩散,推导标度关系和自相似解,并通过数值模拟验证,揭示源项对扩散前沿特征的影响。

Comments 11 pages, 7 figures

详情
AI中文摘要

许多工业应用和生物场景涉及两种聚合物物种的相互扩散。受生物亚细胞源驱动过程的启发,我们研究了在无或有聚合物源的情况下,非缠结和缠结场景中的聚合物-聚合物相互扩散问题。利用双流体形式,我们得到了标度关系、自相似约化和解析解,并通过一维和二维数值模拟进行了验证。源项的引入打破了自相似结构,改变了边界条件和积分域。然而,我们表明,扩散液滴的前沿特征表现出与无源情况下相似的空间结构。我们的结果有助于更深入地理解聚合物-聚合物相互扩散和非线性输运,尤其是在存在源的情况下。

英文摘要

Many industrial applications and biological scenarios involve the interdiffusion of two polymeric species. Motivated by biological subcellular source-driven processes, we study polymer-polymer interdiffusion problems in the absence or the presence of a polymeric source, for both unentangled and entangled scenarios. Utilizing a two-fluid formalism, we arrive at scaling relations, self-similar reductions, and analytical solutions, which are confirmed with one- and two-dimensional numerical simulations. The introduction of a source term breaks the self-similar structure, modifying the boundary conditions and the domain of integration. Nevertheless, we show that the front characteristics of the diffusing droplet exhibit similar spatial structures as in the absence of a source. Our results allow deeper understanding of polymer-polymer interdiffusion and nonlinear transport, especially in the presence of a source.

2606.20159 2026-06-19 cond-mat.mes-hall cond-mat.mtrl-sci 新提交 80%

Electric-field-driven magnetic switching and tightly bound interlayer excitons in bilayer CrSBr

电场驱动的磁开关与双层CrSBr中的强束缚层间激子

Xiubin Li, Yu Sun, Xuanji Wang, Chunyan Wang, Kenji Watanabe, Takashi Taniguchi, Sheng Liu, Ting Yu, Liang Li, Tao Zhang, Jing Li

专题命中 材料化学 :研究电场驱动磁开关和层间激子,材料物理

AI总结 本研究在双层CrSBr中实现了电场驱动的反铁磁-铁磁可逆开关,并发现强束缚层间激子,揭示了线性磁电效应和电场调制的层间交换耦合两种共存机制。

详情
AI中文摘要

在二维范德华磁体中实现电场控制磁序是低功耗自旋技术的一个核心目标。在环境稳定的反铁磁体CrSBr中,强磁各向异性和稳健的激子-自旋耦合提供了有利平台,但其磁相的电学确定性控制尚未实现。这里我们展示了在无有意载流子掺杂的双栅极双层CrSBr中,电场驱动的反铁磁与铁磁态之间的可逆开关。同时,光致发光测量解析出一个强束缚的层间激子,其本征偶极矩仅为约1 e埃。磁相变的电场依赖性揭示了两种共存机制:反铁磁态中的线性磁电效应和电场调制的层间交换耦合。它们的相互作用解释了临界磁场的非对称演化。我们的结果确立了双层CrSBr作为电控自旋光电子功能器件的有前途的二维材料。

英文摘要

Electric field control of magnetic order in two-dimensional (2D) van der Waals magnets is a central goal for low-power spin-based technologies. In the ambient-stable antiferromagnet CrSBr, strong magnetic anisotropy and robust exciton-spin coupling provide a favorable platform, yet deterministic electric field control of its magnetic phases has not been achieved. Here we demonstrate electric-field-driven reversible switching between antiferromagnetic and ferromagnetic states in dual-gated bilayer CrSBr without intentional carrier doping. In parallel, photoluminescence measurements resolve a tightly bound interlayer exciton with an intrinsic dipole moment of only ~1 e angstrom. The electric field dependence of the magnetic phase transition reveals two coexisting mechanisms: a linear magnetoelectric effect in the antiferromagnetic state and an electric-field-modulated interlayer exchange coupling. Their interplay accounts for the asymmetric evolution of the critical magnetic field. Our results establish bilayer CrSBr as a promising 2D material for electrically controlled spin-optoelectronic functionalities.

2606.20116 2026-06-19 cond-mat.mtrl-sci physics.chem-ph 新提交 80%

Hartree-Fock Limit for Energies in Solids

固体能量的Hartree-Fock极限

Jānis Užulis, Andris Gulans

专题命中 材料化学 :建立固体Hartree-Fock极限方法,材料计算

AI总结 在线性化增强平面波框架内建立达到Hartree-Fock极限的方法,通过一致构造径向基函数和核心轨道,实现分子和固体总能量精度达几μHa,并为14种半导体和绝缘体提供参考数据。

详情
AI中文摘要

本研究在线性化增强平面波(LAPW)框架内建立了达到分子和固体Hartree-Fock(HF)极限的途径。我们通过一致地构造与HF哈密顿量匹配的径向基函数和核心轨道,消除了标准LAPW方法处理非局域交换的当前限制。所提出的方法能够以几μHa的精度计算分子和固体的总能量,并用于为14种半导体和绝缘体提供参考数据。对于本研究中考虑的系统,基于(半)局域势构造径向基函数和核心轨道的标准方法在实际相对能量(包括分子和固态形成能以及Si自间隙缺陷形成能)方面仍然高度精确。更广泛地说,这些结果为基组和赝势评估提供了严格的全电子基准,改进了LAPW内杂化泛函计算的误差控制,并开辟了直接基于杂化泛函核心轨道在LAPW内进行X射线光谱模拟的途径。

英文摘要

This study establishes a route to the Hartree--Fock (HF) limit for molecules and solids within the linearized augmented plane wave (LAPW) framework. We remove current limitations of the standard LAPW approach to nonlocal exchange by constructing radial basis functions and core orbitals consistently with the HF Hamiltonian. The presented method yields total energies of molecules and solids with a precision of a few $μ$Ha, and we use it to provide reference data for 14 semiconductors and insulators. For the systems considered in this study, the standard approach based on (semi)local potentials for constructing radial basis functions and core orbitals remains highly precise for practical relative energies, including molecular and solid-state formation energies and Si self-interstitial defect formation energies. More broadly, the results provide stringent all-electron benchmarks for basis-set and pseudopotential assessment, improve error control in hybrid-functional calculations within LAPW, and open the way to X-ray spectroscopy simulations within LAPW based directly on hybrid-functional core orbitals.