arXivDaily arXiv每日学术速递 周一至周五更新

科学与医疗

AI for Science

科学智能、蛋白质、分子、药物、材料、气象、物理和数学 AI。

今日/当前日期收录 8 信号源:cs.LG, q-bio, physics, cond-mat, math, stat.ML
2606.19825 2026-06-19 cs.LG 新提交 90%

Enhancing Graph Neural Networks Using Proximity Graphs for Dust Source Emission Forecasting

利用邻近图增强图神经网络用于沙尘源排放预测

Maryam Sanisales, Zahed Rahmati, Ali Darvishi Boloorani, Ali Vefghi

发表机构 * Amirkabir University of Technology(阿米尔卡比尔理工大学) University of Tehran(德黑兰大学)

专题命中 气象气候 :利用图神经网络预测沙尘源排放,属于气象应用。

AI总结 提出使用Delaunay三角剖分等邻近图作为图神经网络输入,通过消息传递捕捉沙尘源排放的时空动态,相比随机图和LSTM模型显著提升预测精度。

详情
AI中文摘要

准确预测沙尘源排放对于减轻沙尘暴带来的重大环境和健康危害至关重要。传统预测方法通常难以捕捉这些现象的复杂时空动态。在本文中,我们证明邻近图使图神经网络(GNN)能够有效建模数据点之间复杂的空间和时间关系。具体来说,我们使用邻近图——如Delaunay三角剖分、Gabriel图、k-最近邻图和Yao图——作为GNN(包括GraphSAGE、图卷积网络和图注意力网络)的输入来执行消息传递。我们的方法强调了将邻近图与GNN集成用于稳健准确的沙尘源预测的有效性。为了强调邻近图表示的重要性,我们将我们的方法与使用随机图进行消息传递的GNN进行了比较。结果表明,使用邻近图的GNN显著优于使用随机图的GNN,并且在沙尘源排放预测中也远优于长短期记忆(LSTM)模型。

英文摘要

Accurate prediction of dust source emissions is critical for mitigating the significant environmental and health hazards posed by dust storms. Traditional forecasting methods often struggle to capture the complex spatiotemporal dynamics of these phenomena. In this paper, we demonstrate that proximity graphs enable Graph Neural Networks (GNNs) to effectively model the intricate spatial and temporal relationships between data points. Specifically, we use proximity graphs--such as Delaunay triangulation, Gabriel graph, k-Nearest Neighbor graph, and Yao graph--as the input for GNNs (including GraphSAGE, Graph Convolutional Networks, and Graph Attention Networks) to perform message passing. Our approach highlights the effectiveness of integrating proximity graphs with GNNs for robust and accurate dust source forecasting. To emphasize the importance of proximity graph representations, we compare our method against GNNs using random graphs for message passing. The results show that GNNs with proximity graphs significantly outperform those with random graphs and are also far superior to Long Short-Term Memory (LSTM) model in dust source emission forecasting.

2606.19642 2026-06-19 physics.ao-ph stat.AP stat.ML 新提交 90%

Rigorous uncertainty quantification of probabilistic AI weather forecasts with conformal prediction

基于保形预测的概率AI天气预报的严格不确定性量化

Anna Asch, Raphael Rossellini, Pedram Hassanzadeh, Rebecca Willett

专题命中 气象气候 :AI天气预报不确定性量化,属于气象科学智能

AI总结 针对AI概率天气预报校准不足(尤其是极端事件),提出使用保形预测方法,无需分布假设即可数学保证覆盖,应用于三个全球模型(GenCast、NeuralGCM、AIFS-ENS)的温度和降水预报,实现校准不确定性而不牺牲其他概率指标。

详情
AI中文摘要

概率天气预报正随着人工智能(AI)经历快速变革。在传统数值天气预报中,计算能力可能限制集合预报对未知未来状态统计分布的近似程度。AI模型便于生成更大的集合,并经过概率考量训练,理论上能带来更好的不确定性量化。这些最先进模型的预报通常被认为是良好校准的。然而,我们在此表明,此类模型的统计覆盖(校准的最终度量)可能存在问题,尤其是在极端事件上。为解决这一缺陷,我们采用保形预测,这是一类统计方法,与以往的后处理技术不同,它在无分布假设下数学上保证覆盖。我们将在线保形预测应用于三个领先全球天气模型(GenCast、NeuralGCM和AIFS-ENS)的温度和降水预报(包括极端情况),确保校准不确定性而不牺牲其他概率指标。这种后处理方法可应用于任何预报模型。

英文摘要

Probabilistic weather forecasting is undergoing rapid transformation with artificial intelligence (AI). In traditional numerical weather prediction, computing power can limit how well ensemble forecasts approximate the unknown statistical distribution of future states. AI models facilitate larger ensembles and are trained with probabilistic considerations, ideally leading to better uncertainty quantification. Forecasts from these state-of-the-art models are often considered well-calibrated. However, here we show that the statistical coverage of such models, the ultimate measure of calibration, can struggle, especially on extreme events. To address this shortcoming, we employ conformal prediction, a class of statistical methods that mathematically guarantees coverage under no distributional assumptions, unlike previous post-processing techniques. We apply online conformal prediction to temperature and precipitation forecasts (including extremes) of three leading global weather models, GenCast, NeuralGCM, and AIFS-ENS, ensuring calibrated uncertainty at no expense to other probabilistic metrics. This post-processing method can be applied to any forecasting model.

2606.20165 2026-06-19 physics.ao-ph 新提交 85%

PRecover 1.0: Process Rate Recovery with Machine Learning

PRecover 1.0:基于机器学习的过程速率恢复

Miriam Simm, Tom Beucler, Corinna Hoose

专题命中 气象气候 :机器学习恢复云微物理速率,气象应用

AI总结 提出PRecover数据驱动后处理方法,利用随机森林、梯度提升和神经网络从ICON模型标准输出中恢复未存储的云微物理过程速率,采用两步分类-回归方法,成功恢复短时间累积速率并提供校准预测区间。

Comments Prepared for submission to Geoscientific Model Development (GMD)

详情
AI中文摘要

来自数值模拟的云微物理过程速率的全面信息有助于更好地理解降水形成路径和气溶胶-云相互作用。然而,资源限制通常使得将所有微物理过程速率包含在模型输出中不切实际,限制了深入分析。为了解决这一不足,我们引入了PRecover,一种数据驱动的后处理方法,用于从数值天气预报模型的标准输出中恢复运行时未存储的微物理过程速率。具体来说,我们训练随机森林、梯度提升模型和前馈神经网络,从ICosahedral非静力(ICON)模型中的双矩体微物理方案恢复微物理过程速率。我们使用云变量作为输入,这些变量来自欧洲有限区域设置下的高分辨率模拟。暖雨和冰微物理过程速率通过两步分类-回归方法恢复,包括瞬时和累积过程速率。作为基于物理的基线,我们评估是否可以直接从存储的ICON输出变量重新计算过程速率。对于增长和自收集等过程速率,可以准确重新计算,但对于自动转换、雨融化或异质冰核化速率则不行。使用PRecover,我们成功恢复了大多数在10分钟或更短输出时间步长内累积的过程速率,但对于更长累积间隔累积的速率,恢复难度增加。为了量化预测不确定性,我们通过共形分位数回归提供校准的预测区间。我们通过两个在不同区域域和训练中未见过的模拟设置下的案例研究,展示了模型的空间可迁移性。

英文摘要

Comprehensive information on cloud microphysical process rates from numerical simulations allows for better understanding of precipitation formation pathways and aerosol-cloud interactions. However, resource limitations often make it impractical to include all microphysical process rates in the model output, limiting in-depth analyses. To address this shortcoming, we introduce PRecover, a data-driven post-processing approach to recover microphysical process rates that are not stored during runtime from standard output of a numerical weather prediction model. In particular, we train random forests, gradient boosting models, and feed-forward neural networks to recover microphysical process rates from a two-moment bulk microphysics scheme in the ICOsahedral Nonhydrostatic (ICON) model. We use cloud variables as input, obtained from high-resolution simulations in a limited-area setup over Europe. Warm-rain and ice microphysical process rates are recovered with a two-step classification-regression approach for both instantaneous and accumulated process rates. As a physics-based baseline, we assess whether process rates can be directly recalculated from stored ICON output variables. Accurate recalculation is possible for process rates such as accretion and self-collection but not for the autoconversion, rain melting or heterogeneous ice nucleation rate. Using PRecover, we successfully recover most of the process rates that are accumulated over output time steps of 10 minutes or less, but the values are increasingly difficult to recover for rates accumulated over longer accumulation intervals. To quantify predictive uncertainty, we provide calibrated prediction intervals through conformalized quantile regression. We demonstrate spatial transferability of the models with two case studies over different regional domains and simulation settings unseen during training.

2606.19778 2026-06-19 physics.ao-ph 新提交 85%

A Stochastic-Thermodynamic Constraint on the Seasonal Phase Locking of the El Niño-Southern Oscillation

厄尔尼诺-南方涛动季节锁相的一个随机热力学约束

Yuki Yasuda, Tsubasa Kohyama

专题命中 气象气候 :ENSO季节锁相机制,属于气候科学智能

AI总结 通过线性随机充放电振子模型,利用热力学不确定关系量化熵产生率对SST异常方差季节变化的约束,解释ENSO冬季锁相机制。

详情
AI中文摘要

我们在线性随机充放电振子(SRO)中研究了厄尔尼诺-南方涛动(ENSO)的季节锁相,该振子是一个具有加性噪声和时变增长率的阻尼振子。锁相反映在海表温度异常(SSTA)方差的季节性上。通常,能量驱动这种变化,而熵则控制其是否发生;因此锁相同时受到基于能量和基于熵的约束。我们使用热力学不确定关系(TUR)量化了这种基于熵的约束,TUR是随机热力学中的一个基本不等式。TUR通过部分熵产生率约束SSTA方差的变化趋势,该熵产生率由正向和反向转移概率之比主导,并量化了SSTA转移的不可逆性。增长率控制这种不可逆性:其极值出现在北半球秋季和冬末,熵产生率在这两个时期达到峰值。这些峰值放松了TUR对SSTA方差趋势的约束,使得方差本身可以在北半球冬季达到峰值,这与观测到的ENSO锁相一致。相反,当不可逆性不足时,ENSO无法增长或衰减。如果这种不可逆性被解释为耗散能量,那么对ENSO增长和衰减的约束将要求这种耗散从赤道太平洋输出。需要更现实的模型来检验这一假设,并进一步探索熵与耗散能量之间的物理联系。

英文摘要

We investigate the seasonal phase locking of the El Niño-Southern Oscillation (ENSO) in a linear stochastic recharge oscillator (SRO), a damped oscillator with additive noise and a time-dependent growth rate. Phase locking is reflected in the seasonality of the variance of the sea surface temperature anomaly (SSTA). In general, energy drives such a change, whereas entropy governs whether it occurs; phase locking is thus subject to both an energy- and an entropy-based constraint. We quantify this entropy-based constraint using a thermodynamic uncertainty relation (TUR), a fundamental inequality in stochastic thermodynamics. The TUR constrains the tendency of the SSTA variance by the partial entropy production rate, which is dominated by the ratio of forward and backward transition probabilities and quantifies the irreversibility of SSTA transitions. The growth rate governs this irreversibility: its extrema occur in boreal autumn and late winter, and the entropy production rate peaks at both times. These peaks relax the TUR constraint on the tendency of the SSTA variance, so that the variance itself can peak in boreal winter, consistent with observed ENSO phase locking. Conversely, when irreversibility is insufficient, ENSO cannot grow or decay. If this irreversibility were interpreted as dissipated energy, the constraint on ENSO growth and decay would require this dissipation to be exported from the equatorial Pacific. A more realistic model is needed to test this hypothesis and to further explore the physical connection between entropy and dissipated energy.

2601.18182 2026-06-19 physics.ao-ph physics.data-an 85%

A strictly geostrophic product of sea-surface velocities from the SWOT fast-sampling phase

从SWOT快速采样阶段严格地转流产物的海面速度

Takaya Uchida, Badarvada Yadidya, Vadim Bertrand, Jia-Xian Chang, Brian Arbic, Jay Shriver, Julien Le Sommer

专题命中 气象气候 :利用动态模式分解从SWOT卫星数据提取地转流,属于海洋气象研究。

AI总结 本文提出利用动态模式分解方法从SWOT轨道中提取地转成分,提供涡度和应变的联合概率密度函数及SSHa谱,以解决地转平衡在测高观测中的应用问题。

Comments 25 pages with double spacing, 4 figures

详情
AI中文摘要

尽管地转平衡仍是提取海面高度异常(SSHa)速度信息的最简单和最实用的平衡方法,但海洋学界仍存在疑问,即这种平衡在SWOT卫星测高观测中的应用程度如何。鉴于SWOT的有限时间分辨率,许多研究倾向于声称空间滤波后的SSHa场对应地转成分,这引入了选择空间尺度的模糊性。本文基于最近的内部潮(IT)校正发展(Yadidya等,2025)和Lapo等(2025)引入的动力学模式分解(DMD)方法,从SWOT一天重复轨道中稳健地提取与次惯性频率相关的地转成分;我们将全球数据集作为公共产品分发。我们提供了涡度和应变的联合概率密度函数(PDF)以及几个交叉区域的SSHa谱。

英文摘要

While geostrophy remains the simplest and most practical balance to extract velocity information from sea-surface height anomaly (SSHa), confusions remain within the oceanographic community to what extent this balance can be applied to altimetric observations with the launch of the Surface Water and Ocean Topography (SWOT) satellite. Given the limited temporal resolution of SWOT, many studies have resorted to claiming that the spatially filtered SSHa fields correspond to the geostrophic component. This introduces the ambiguity of which spatial scale to choose. Here, we build upon the recent developments in internal tide (IT) corrections (Yadidya et al., 2025) and apply a dynamic mode decomposition (DMD)-based method introduced by Lapo et al. (2025) to robustly extract the geostrophic component associated with sub-inertial frequencies from the SWOT one-day-repeat orbit; we distribute the global dataset as a public good. We provide the joint probability density function (PDF) of vorticity and strain, and spectra of SSHa at a few cross-over regions.

2606.19363 2026-06-19 cs.LG 新提交 80%

When to Trust, How to Distill: Multi-Foundation Model Guidance for Lightweight, Robust Scientific Time Series Forecasting

何时信任,如何蒸馏:面向轻量级鲁棒科学时间序列预测的多基础模型指导

Rupasree Dey, Abdul Matin, Nathan Orwick, Yao Zhang, Shrideep Pallickara, Sangmi Lee Pallickara

发表机构 * Colorado State University(科罗拉多州立大学)

专题命中 气象气候 :时间序列基础模型用于气象等领域预测。

AI总结 提出Guard框架,通过上下文路由器和不确定性门控温度机制,从多个分布偏移的基础模型中蒸馏知识,训练轻量级预测器,在气象、碳通量等四个领域降低RMSE。

Comments KDD 2026, paper decision: Accepted, track: AI for Science. total 12 pages including references and appendix

详情
AI中文摘要

时间序列基础模型(TSFMs)在物理科学中的部署受到一个关键权衡的阻碍:虽然这些模型编码了丰富、通用的时间动态,但当零样本应用于特定科学领域时,它们会遭受严重的分布错位,并且其计算成本阻碍了在边缘计算传感器网络中的部署。我们解决了一个基本挑战:如何从错位的基础模型(FM)中提取潜在的结构知识,以训练轻量级、专门的预测器?我们提出了用于蒸馏的门控不确定性感知路由(Guard),这是一个新颖的框架,将多教师蒸馏重新定义为实例级决策过程,具有两种自适应机制:(1)上下文路由器,根据局部输入统计动态选择最相关的教师,利用不同基础模型之间的互补性;(2)不确定性门控温度机制,充当“断路器”,当教师置信度与领域现实偏离时自动减弱蒸馏强度。我们在四个气候关键领域评估了我们提出的轻量级框架:气象学、生态系统碳通量、土壤湿度和能源电网。我们的方法相对于固定权重的多教师蒸馏基线显著降低了RMSE,成功地从预训练的FM(教师)中蒸馏知识,即使由于原始和目标数据域之间的分布偏移,它们表现出次优的零样本准确性。我们证明,这些领域错位的教师仍然可以作为关键的纠正者,在28.5%的最难实例上优于全局优越的FM。最终,这使得适用于资源受限边缘部署的高精度科学预测成为可能。代码可在https://this URL获取。

英文摘要

The deployment of Time-Series Foundation Models (TSFMs) in physical sciences is hindered by a critical trade-off: while these models encode rich, universal temporal dynamics, they suffer from severe distributional misalignment when applied zero-shot to specific scientific domains, and their computational cost prohibits deployment in edge-computing sensor networks. We address a fundamental challenge: How can we extract latent structural knowledge from misaligned foundation models (FM) to train lightweight, specialized forecasters? We propose Gated Uncertainty-Aware Routing for Distillation (Guard), a novel framework that reframes multiteacher distillation as an instance-wise decision process with two adaptive mechanisms: (1) a Contextual Router that dynamically selects the most relevant teacher based on local input statistics, exploiting complementarity across diverse foundation models; and (2) an Uncertainty-Gated Temperature mechanism that acts as a "circuit-breaker," automatically attenuating distillation strength when teacher confidence diverges from domain reality. We evaluate our proposed lightweight framework on four climate-critical domains: meteorology, ecosystem carbon flux, soil moisture, and energy grids. Our method significantly reduces RMSE relative to a fixed-weight multi-teacher distillation baseline, successfully distilling knowledge from pretrained FMs (teachers) even when they exhibit suboptimal zero-shot accuracy due to distribution shift between the original and target data domains. We demonstrate that these domain-misaligned teachers can still serve as critical correctives, outperforming the globally superior FMs on 28.5% of the hardest instances. Ultimately, this enables high-precision scientific forecasting suitable for resource-constrained edge deployment. Code is available at https://github.com/RupasreeDey/GUARD-KDD2026.

2606.20050 2026-06-19 physics.ao-ph physics.flu-dyn 新提交 80%

Enhanced Gulf Stream Path Variability Under Intensified Stratification

增强的层结下墨西哥湾流路径变率增强

Lennard Miller, Antoine Venaille, Stephane Popinet, Bruno Deremble

专题命中 气象气候 :海洋层结影响湾流路径,气候物理

AI总结 通过高分辨率海洋模型,发现上层海洋层结增强导致墨西哥湾流延伸体失去稳定性,从稳定东向路径转变为剧烈混沌弯曲,且这一转变独立于大西洋经向翻转环流和风强迫变化。

Comments 30 pages, 9 figures (including supplementary material)

详情
AI中文摘要

上层海洋层结增强是全球变暖不可避免的后果,并将强烈影响洋流结构。利用高分辨率海洋模型,我们表明层结增强导致墨西哥湾流延伸体失去相干性,其稳定的东向路径被剧烈、混沌的弯曲所取代。这种状态转变独立于大西洋经向翻转环流和表面风强迫的变化。在分辨中尺度涡的理想化和现实海洋模型中,层结增强下弯曲增强也被证明是一个稳健的特征,但在参数化涡的粗分辨率模型中未能捕捉。因此,所呈现的结果强调了在气候预测中改进海洋湍流表征的必要性。

英文摘要

Increased upper-ocean stratification is an unavoidable consequence of global warming and will strongly impact the structure of ocean currents. Using a high-resolution ocean model, we show that intensification of stratification leads to the loss of coherence of the Gulf Stream Extension, replacing its steady eastward path with vigorous, chaotic meanders. This regime shift persists independently of changes in the Atlantic Meridional Overturning Circulation and surface wind forcing. Enhanced meandering under intensified stratification also proves to be a robust feature across both idealized and realistic ocean models that resolve mesoscale eddies, but is not captured by coarse-resolution models that parameterize eddies. The presented findings therefore highlight the need for improved representations of oceanic turbulence in climate projections.

2606.19581 2026-06-19 physics.ao-ph 新提交 80%

A Land-Sea Contrast Pattern in Surface Temperature and Atmospheric Circulation Trends in Recent Decades

近几十年地表温度和大气环流趋势中的海陆对比模式

Benjamin O. Johnson, Maria Rugenstein

专题命中 气象气候 :海陆增温对比与环流趋势,属于气候科学

AI总结 研究发现陆地相对海洋的增暖主导了观测到的地表温度和大气环流趋势,包括太平洋盆地的负类IPO倾向,而气候模式低估了海陆增温比。

详情
AI中文摘要

观测到的气候趋势的空间模式仍然难以理解。本文认为,陆地相对于海洋的增暖塑造了观测到的地表温度和大气环流趋势,包括太平洋盆地的负类年代际太平洋振荡(IPO)倾向。观测和模拟的趋势显示,在相对于海洋增暖更快的陆地上,海平面气压总体下降,其空间模式类似于季节循环和理想化气候模式实验中对陆地加热的响应。使用历史强迫的耦合气候模式模拟低估了海陆增温比。只有在突然CO2四倍增的气候模式模拟的早期响应中,气候模式才能重现观测到的海陆增温比,在这种情况下,可以看到海洋表面高压增强和太平洋上负类IPO地表增暖模式与观测趋势相当。我们提出,许多气候变量中模拟与观测趋势之间的差异可能由气候模式低估海陆增温比来解释。确定这一差异的原因有可能约束未来气候变化的预测,因为导致气候模式低估海陆增温比差异的潜在机制将决定这个问题的持续性。

英文摘要

Spatial patterns in observed climate trends remain poorly understood. Here we argue that a warming of land relative to ocean has shaped observed surface temperature and atmospheric circulation trends, including the negative Inter-Decadal Pacific Oscillation (IPO)-like tendency across the Pacific basin. Observed and modeled trends display an overall decline in sea level pressure over the faster-warming land relative to ocean, with a spatial pattern that resembles the seasonal cycle and the response to land heating in idealized climate model experiments. Coupled climate model simulations with historical forcing underestimate the land-sea warming ratio. It is only in the early response of abrupt CO2 quadrupling climate model simulations that climate models are able to recreate the observed land-sea warming ratio, in which case a strengthening of oceanic surface highs and a negative IPO-like surface warming pattern over the Pacific comparable to observed trends are seen. We propose that discrepancies between modeled and observed trends in many climate variables may be explained by the underestimation of the land-sea warming ratio by climate models. Determining the cause of this discrepancy has the potential to constrain projections of future climate change as the underlying mechanism causing climate models to underestimate the land-sea warming ratio discrepancy will set the persistence of this problem.