Assessing global drivers of forest transpiration using clustered machine learning models
利用聚类机器学习模型评估全球森林蒸腾作用的驱动因素
Morgan Thornwell, David Yang, Cheng-Wei Huang, Peyman Abbaszadeh, Samantha Hartzell
AI总结 本文通过聚类机器学习模型分析全球森林蒸腾速率的驱动因素,发现不同生物群落和植物功能类型对环境变量的响应存在显著差异,揭示了蒸腾作用在不同气候条件下的调控机制。
详情
理解森林蒸腾作用的环境驱动因素对于改进全球水分可用性和生态系统健康预测至关重要。然而,由于植物水分胁迫和生态系统蒸腾作用受到多种控制因素的影响,这些驱动因素可能在适应当地气候条件的树种之间差异很大。在这里,使用SAPFLUXNET数据库,通过按生物群落和植物功能类型进行聚类的两种策略,分析了全球森林蒸腾速率的驱动因素。使用随机森林算法和神经网络算法对每个聚类进行预测。分析了每种模型的性能和特征重要性,并将其与评估每个聚类性能的环境变量进行比较。通过定义站点聚类,这些模型能够预测广泛地理区域和树种的蒸腾作用及其环境驱动因素。与训练整个数据集的模型相比,高性能的聚类模型在测量数据上的R²值在0.74至0.90之间,其中在最多36个站点的中等大小聚类中达到最高性能。不同聚类之间特征重要性存在显著差异,表明蒸腾作用的关键预测因子在植物功能类型和生物群落之间变化强烈。总体而言,水分受限的气候更受土壤湿度控制,而高年均温度的气候则更受太阳辐射控制,对空气温度的依赖性较低。这些发现提供了关于森林蒸腾作用如何响应环境因素的见解,范围涵盖了广泛的气候类型和树种。
Understanding the environmental drivers of forest transpiration is critical for improving global predictions of water availability and ecosystem health. Due to many competing controls on plant water stress and ecosystem transpiration, however, these drivers may vary widely across tree species which have adapted hydraulically to local climate conditions. Here, clustered machine learning models were used to analyze global drivers of forest transpiration rates using the SAPFLUXNET database. Sap flux data from a total of ninety-five sites spanning seven biomes were grouped using two clustering strategies: by biome and by plant functional type. Two supervised machine learning algorithms, a random forest algorithm and a neural network algorithm, were used to predict rates of sap flux for each cluster. The performance and feature importance in each model were analyzed and compared to evaluate the environmental variables that control each cluster's performance. By defining site clusters, these models are able to predict transpiration and its environmental drivers across a wide variety of geographical sites and tree species. Unlike models trained on the entire dataset, high-performing clustered models achieved R$^2$ values to measurement data in the range of 0.74 to 0.90, with the highest performance being achieved in mid-sized clusters of up to thirty-six sites. There was high variance in feature importance between clusters, indicating that key predictors of transpiration varied strongly across both plant functional type and biome. Overall, water-limited climates tended to be more controlled by soil moisture, whereas climates with high mean annual temperature tended to be more controlled by solar radiation and less dependent on air temperature. These findings provide insights into how forest transpiration responds to environmental factors across a wide range of climate types and tree species.