赵越, 徐大伟, 范凯凯, 李淑贞, 沈贝贝, 邵长亮, 王旭, 辛晓平. Landsat 8和机器学习估算蒙古高原草地地上生物量[J]. 农业工程学报, 2022, 38(24): 138-144. DOI: 10.11975/j.issn.1002-6819.2022.24.015
    引用本文: 赵越, 徐大伟, 范凯凯, 李淑贞, 沈贝贝, 邵长亮, 王旭, 辛晓平. Landsat 8和机器学习估算蒙古高原草地地上生物量[J]. 农业工程学报, 2022, 38(24): 138-144. DOI: 10.11975/j.issn.1002-6819.2022.24.015
    Zhao Yue, Xu Dawei, Fan Kaikai, Li Shuzhen, Shen Beibei, Shao Changliang, Wang Xu, Xin Xiaoping. Estimating above-ground biomass in grassland using Landsat 8 and machine learning in Mongolian plateau[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2022, 38(24): 138-144. DOI: 10.11975/j.issn.1002-6819.2022.24.015
    Citation: Zhao Yue, Xu Dawei, Fan Kaikai, Li Shuzhen, Shen Beibei, Shao Changliang, Wang Xu, Xin Xiaoping. Estimating above-ground biomass in grassland using Landsat 8 and machine learning in Mongolian plateau[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2022, 38(24): 138-144. DOI: 10.11975/j.issn.1002-6819.2022.24.015

    Landsat 8和机器学习估算蒙古高原草地地上生物量

    Estimating above-ground biomass in grassland using Landsat 8 and machine learning in Mongolian plateau

    • 摘要: 草地地上生物量(Above-Ground Biomass,AGB)是反映草地植被利用状况的重要参数,其精准监测对于草地科学管理与合理利用具有重要意义。近年来,遥感技术因其能快速、准确获取大尺度草地光谱信息,已经被广泛应用于草地地上生物量的估算中。该研究以中国内蒙古呼伦贝尔市与其毗邻的蒙古国东方省草原区为研究区,利用Landsat 8数据计算的9种植被指数、气象数据和地面调查数据,比较分析6种机器学习算法构建的回归模型性能,重新构建优化的随机森林回归模型。结果表明,以光谱、降水量、气温为特征的优化后的随机森林回归模型性能更稳定,预测值与实测值之间决定系数为0.801,均方根误差为43.709 g/m2,相对均方根误差为23.077%。研究区域地上生物量呈中部较低,东西两侧较高的空间分布特征,最高可达357.2 g/m2,最低为33.01 g/m2,与该区域降水量与草地利用方式的空间异质性密切相关。该研究表明,基于Landsat 8数据结合气象数据构建的机器学习模型在草地生物量遥感反演中有较大潜力,地上生物量反演结果可以为草地资源合理利用与评价提供参考。

       

      Abstract: Abstract: Above-Ground Biomass (AGB) is one of the most important indicators to reflect the status of grassland use. Accurate and rapid monitoring is of great significance to scientific management and rational use. Alternatively, remote sensing technology has been widely used to estimate the AGB in recent years. However, the estimation errors can often be caused by the common phenomenon of "same spectrum, different species" in remote sensing. One of the potential solutions can be to use the spectral and meteorological data to invert the AGB grassland. In this study, a machine learning model was developed to characterize the spectral indices and meteorological data using Landsat 8 remote sensing and ground survey as data sources. A systematic investigation was implemented to explore the performance of regression models constructed by five machine learning algorithms. Specifically, the AGB of grassland was estimated to obtain the high accuracy inversion of remote sensing for the grassland biomass. Nine vegetation indices were selected to calculate in Hulunbuir of Inner Mongolia and Dornod of Mongolia in China. An optimal Random Forest (RF) regression model was then reconstructed by feature selection. The regression validation revealed that a similar overall performance was achieved in the six machine learning models. But the lower performance was found in the spectral data as the input only (Root Mean Square Error (RMSE): 63.852-87.944 g/m2, relative Root Mean Square Error (rRMSE): 33.712%-46.432%, coefficient of determination (R2): 0.388-0.647). Furthermore, the error of all regression decreased gradually, as the number of features increased in the data combination. The model fitting ability increased gradually as well, indicating that the increasing number of features in the different regression models was effectively handled through the fusion of multiple data inputs. The best evaluation was obtained from each regression model in the data combination of spectra + precipitation + temperature. The RF also obtained the best performance (RMSE=51.702 g/m2, rRMSE=27.297%, and R2=0.749). The weights of the multiple source data in the model were determined to assess the relative importance of the input data. The results showed that the precipitation was the most important input feature of the model, with a maximum weight of more than 0.1, much higher than the other spectral data. Three vegetation indices of VARI, MSAVI, and GEMI in the spectral data were weighted more than 0.09 as the features, which was higher than the rest. The more stable performance was achieved in the optimized RF regression model, with a correlation coefficient (R2) of 0.801 between predicted and measured values, an RMSE of 43.709 g/m2, and an rRMSE of 23.077%. The AGB spatial distribution in the study area was lower in the central area, but higher on the east and west sides, with a maximum of 357.2 g/m2 and a minimum of 33.01 g/m2. It was closely related to the spatial heterogeneity of climate and grassland use patterns.

       

    /

    返回文章
    返回