土地覆盖类别面积混淆矩阵校正与回归遥感估算方法对比

    Comparison analysis on land cover area estimators: confusion matrix calibration and regression

    • 摘要: 混淆矩阵校正和回归估算是2类常用的,以遥感数据作为辅助的区域面积总量估算方法,但它们的相对效果及适用条件并不明确。该文根据遥感影像分类误差的分布规律模拟生成不同总体分类精度的图像,并在此数据的基础上,采用统一的抽样方案,以简单抽样反推方法为对比基准,探讨在不同的总体分类精度和不同的样本量下区域面积总量遥感估算方法的精度及稳定性。试验得到以下结论:1)相比于简单抽样反推方法,混淆矩阵校正和回归估算具有更好的准确性和稳定性;2)在各总体分类精度和样本量下,简单回归估计、比估计均具有良好的准确性和稳定性,相比于混淆矩阵校正,回归类方法略具优势;3)当遥感数据分类精度较低(≤60%)时,混淆矩阵逆校正的准确性和稳定性与简单抽样反推方法无明显区别,因此在此种情况下应尽量避免使用混淆矩阵逆校正方法。

       

      Abstract: Abstract: Remote sensing can provide timely, economic and objective data with a large scope, and thus has become the main data source for land cover/use area estimation. However, the classification result cannot be directly used to calculate the area of a given land cover type because of the commission and omission errors. Confusion matrix calibration (direct estimator and inverse estimator) and regression (ratio estimator and simple regression estimator) are two commonly used approaches by which one can estimate the accurate area of a given land cover type from a classification map. However, their applied conditions and relative efficiency are unclear.  The two approaches (including four specific estimators) were compared with a simple expansion estimator, which is regarded as the basic estimator in a comparison process, by using the same simulated images under a unified sampling scheme. The main purpose is to explore the accuracy and stability of each estimator in different overall classification accuracy and sampling ratios. The simulated images were produced by adding different amounts of errors into the classification map of real remote sensing data. These errors fall into two categories. One is the errors within patches, which result from the similarity of spectral information. The other is the errors located around the boundary of different land cover types, because of the misclassification of mixed pixels. According to the distribution rule of remote sensing classification error and heterogeneity, we calculated the error possibility for each pixel, and then altered the class label of error pixels. There were four different images we simulated by a procedure of IDL 7.0 to represent various overall accuracy levels of classification in practical cases. Then, two evaluation criteria were adopted, with average absolute relative bias, indicating the accuracy of estimation, and coefficient of variance, representing the stability of estimation.  The results suggest that: (1) Confusion matrix calibration and regression can provide more accurate and stable estimates than simple expansion estimator does, which means that the simulated images, also the remote sensing classification map in practical cases, play an positive role in estimation, especially in improving the stability degree. (2) In the experiments with different overall classification accuracy and sampling ratio, the two evaluation criteria value of simple regression estimator and ratio estimator are extremely close, and they all show higher accuracy and stability compared with confusion matrix calibration methods, which indicates that regression methods have more advantages to some degree. In addition, confusion matrix calibration methods are more sensitive to classification accuracy than that of regression methods. The reason is that confusion matrix calibration methods are interested in not only the amount of pixels which are classified correctly, but also the land cover type which one error pixel falls into. Regression methods use the information from sampling units more comprehensively, and the relationship between samples and auxiliary data stands for the general connection between ground truth and remote sensing classification, and also reduces the influence of an individual sample. (3) When the overall accuracy of classification is low (≤60%), estimates from the inverse estimator are similar to or even worse than the simple expansion estimator. Therefore, an inverse estimator should not be used under this condition.

       

    /

    返回文章
    返回