Beyond the Training Data: Confidence-Guided Mixing of Parameterizations in a Hybrid AI-Climate Model
超越训练数据:在混合AI-气候模型中基于置信度的参数化混合
Helge Heuer, Tom Beucler, Mierk Schwabe, Julien Savre, Manuel Schlund, Veronika Eyring
AI总结 本文提出通过基于置信度的混合方法,将训练数据中的参数化与传统对流方案结合,以提升混合AI-气候模型的稳定性与可解释性。
详情
地球系统模型(ESMs)中持续性的系统性误差源于对亚网格、多尺度大气对流和湍流的表示困难。机器学习(ML)参数化在短时高分辨率模拟中表现出减少这些误差的潜力。然而,混合(物理+ML)ESMs的稳定长期大气模拟仍然困难,因为离线训练的神经网络(NNs)在在线运行中常不稳定。直接在粗粒数据上训练对流参数化具有挑战性,特别是因为尺度无法清晰分离。这个问题通过超参数化模拟的数据得到缓解,这些数据提供了更清晰的尺度分离。然而,将参数化从一个ESM转移到另一个仍然困难,因为分布偏移会导致大的推理误差。本文提出了一种概念验证,其中训练于ClimSim的物理引导NN对流参数化成功转移到ICON-A。该方案(a)在调整后的ClimSim数据上训练,减去了辐射倾向,(b)被整合到ICON-A中。NN参数化预测其自身的误差,当置信度较低时,可以与传统对流方案混合,从而使混合AI-物理模型能够通过混合参数与观测和再分析数据进行调整。这通过约束水汽柱、低空稳定性和地理条件的对流趋势,提高了过程理解,产生了可解释的区域行为。在AMIP式设置中,几种混合配置优于默认的对流方案(例如改进的降水统计)。在训练期间添加输入噪声,混合和纯ML方案都能产生稳定的模拟,并且在至少20年内保持物理一致性。
Persistent systematic errors in Earth system models (ESMs) arise from difficulties in representing the full diversity of subgrid, multiscale atmospheric convection and turbulence. Machine learning (ML) parameterizations trained on short high-resolution simulations show strong potential to reduce these errors. However, stable long-term atmospheric simulations with hybrid (physics + ML) ESMs remain difficult, as neural networks (NNs) trained offline often destabilize online runs. Training convection parameterizations directly on coarse-grained data is challenging, notably because scales cannot be cleanly separated. This issue is mitigated using data from superparameterized simulations, which provide clearer scale separation. Yet, transferring a parameterization from one ESM to another remains difficult due to distribution shifts that induce large inference errors. Here, we present a proof-of-concept where a ClimSim-trained, physics-informed NN convection parameterization is successfully transferred to ICON-A. The scheme is (a) trained on adjusted ClimSim data with subtracted radiative tendencies, and (b) integrated into ICON-A. The NN parameterization predicts its own error, enabling mixing with a conventional convection scheme when confidence is low, thus making the hybrid AI-physics model tunable with respect to observations and reanalysis through mixing parameters. This improves process understanding by constraining convective tendencies across column water vapor, lower-tropospheric stability, and geographical conditions, yielding interpretable regime behavior. In AMIP-style setups, several hybrid configurations outperform the default convection scheme (e.g., improved precipitation statistics). With additive input noise during training, both hybrid and pure-ML schemes lead to stable simulations and remain physically consistent for at least 20 years.