PRecover 1.0: Process Rate Recovery with Machine Learning
PRecover 1.0:基于机器学习的过程速率恢复
Miriam Simm, Tom Beucler, Corinna Hoose
AI总结 提出PRecover数据驱动后处理方法,利用随机森林、梯度提升和神经网络从ICON模型标准输出中恢复未存储的云微物理过程速率,采用两步分类-回归方法,成功恢复短时间累积速率并提供校准预测区间。
Comments Prepared for submission to Geoscientific Model Development (GMD)
详情
来自数值模拟的云微物理过程速率的全面信息有助于更好地理解降水形成路径和气溶胶-云相互作用。然而,资源限制通常使得将所有微物理过程速率包含在模型输出中不切实际,限制了深入分析。为了解决这一不足,我们引入了PRecover,一种数据驱动的后处理方法,用于从数值天气预报模型的标准输出中恢复运行时未存储的微物理过程速率。具体来说,我们训练随机森林、梯度提升模型和前馈神经网络,从ICosahedral非静力(ICON)模型中的双矩体微物理方案恢复微物理过程速率。我们使用云变量作为输入,这些变量来自欧洲有限区域设置下的高分辨率模拟。暖雨和冰微物理过程速率通过两步分类-回归方法恢复,包括瞬时和累积过程速率。作为基于物理的基线,我们评估是否可以直接从存储的ICON输出变量重新计算过程速率。对于增长和自收集等过程速率,可以准确重新计算,但对于自动转换、雨融化或异质冰核化速率则不行。使用PRecover,我们成功恢复了大多数在10分钟或更短输出时间步长内累积的过程速率,但对于更长累积间隔累积的速率,恢复难度增加。为了量化预测不确定性,我们通过共形分位数回归提供校准的预测区间。我们通过两个在不同区域域和训练中未见过的模拟设置下的案例研究,展示了模型的空间可迁移性。
Comprehensive information on cloud microphysical process rates from numerical simulations allows for better understanding of precipitation formation pathways and aerosol-cloud interactions. However, resource limitations often make it impractical to include all microphysical process rates in the model output, limiting in-depth analyses. To address this shortcoming, we introduce PRecover, a data-driven post-processing approach to recover microphysical process rates that are not stored during runtime from standard output of a numerical weather prediction model. In particular, we train random forests, gradient boosting models, and feed-forward neural networks to recover microphysical process rates from a two-moment bulk microphysics scheme in the ICOsahedral Nonhydrostatic (ICON) model. We use cloud variables as input, obtained from high-resolution simulations in a limited-area setup over Europe. Warm-rain and ice microphysical process rates are recovered with a two-step classification-regression approach for both instantaneous and accumulated process rates. As a physics-based baseline, we assess whether process rates can be directly recalculated from stored ICON output variables. Accurate recalculation is possible for process rates such as accretion and self-collection but not for the autoconversion, rain melting or heterogeneous ice nucleation rate. Using PRecover, we successfully recover most of the process rates that are accumulated over output time steps of 10 minutes or less, but the values are increasingly difficult to recover for rates accumulated over longer accumulation intervals. To quantify predictive uncertainty, we provide calibrated prediction intervals through conformalized quantile regression. We demonstrate spatial transferability of the models with two case studies over different regional domains and simulation settings unseen during training.