CNN-ISS遥感影像分类的瓦片边缘效应及消除方案

    Tile edge effect and elimination scheme of image classification using CNN-ISS remote sensing

    • 摘要: 应用卷积神经网络语义分割模型(Image Semantic Segmentation based on Convolutional Neural Network,CNN-ISS)进行遥感影像分类时,需将大幅影像分解为特定大小瓦片影像,并将其作为CNN-ISS处理对象,这一过程破坏了位于瓦片边缘处地物的完整几何及纹理特征,从而影响瓦片边缘处地物的识别效果,即瓦片边缘效应。该研究以DeepLab V3为CNN-ISS核心模型,对唐山农村地物进行语义分割,定量分析了分类结果的瓦片边缘效应,并提出了5个消除此效应的后处理方案。结果表明:像素分类精度与像素到瓦片边缘距离正相关,瓦片边缘处错误率最高达6.93%,中央处错误率最低为3.52%,存在瓦片边缘效应;采用该研究提出的瓦片边缘效应消除方案后,整幅影像的总精度(Pixel Accuracy,PA)、均交并比(Mean Intersection over Union,mIoU)和Kappa系数均有提升,最高分别提升0.40、1.97个百分点和0.012 2。在不改变CNN-ISS核心模型条件下,通过该研究的瓦片边缘效应消除后处理方案,可有效提升遥感影像分类精度,尤其针对复杂异构体和线状地物精度提升效果更好。

       

      Abstract: Abstract: Semantic segmentation of an image has become a key interdisciplinary application in image processing, computer vision, pattern recognition, and artificial intelligence. In deep learning architectures, the Convolutional Neural Network for Interferometric Semantic Segmentation (CNN-ISS) is widely used in digital image processing and machine vision. The CNN-ISS can be utilized to effectively extract further features, such as texture and geometric features, indicating stronger transfer learning and generalization, compared with traditional image classifications of remote sensing. As such, the CNN-ISS is suitable for the interpretation of high-resolution remote sensing images, identification of complicated features, and crop mapping. In classification, large remote sensing images need to be segmented into specific tiled images, thereby serving as the object of Convolutional Neural Network (CNN) processing. However, artificial image tiling can generate fragments on the edge of a tile, leading to the low classification accuracy of pixels near the edge of the tile. Here, the phenomenon was defined as the edge effect of tiled images, where the classification accuracy of pixels near the edge of the tile was lower than that of the central area. In this work, two indicators were designed, including the error rate with a distance to tile edges (ERD), and the error rate of the whole image (ERW), to quantify the edge effect of CNN-ISS processed tiled images. Meanwhile, the offset positions (i, k) were set for the starting point of the shift window to ensure that any pixel on the whole image must be in the central area of the tile generated under a certain offset setting. Then, five technical solutions were obtained to test the minimized edge effect of tiled images using the scores in multiple groups of categories. Taking the Tangshan as the segmented typical rural surface, a DeepLab V3 was selected as the core model of CNN-ISS to analyze the edge effect of the classification. The results showed that the pixel classification accuracy was positively correlated with the distance from the pixel to the edge of a tiled image. The highest error rate was 6.93% occurred along the edge of the tiled image, and the lowest error rate was 3.52% in the center of the tile, indicating the accuracy of the central area was higher than that of the edge. It showed an obvious edge effect of tiled images. In edge effect elimination scheme for the tiled images, the total classification accuracy improved significantly, where the Kappa coefficient and Mean Intersection over Union (mIoU) of the entire image increased 0.012 2 and 1.97 percent point, respectively. Taking the Kappa coefficient, one of the classic accuracy indices for the remote sensing image interpretation, as an example, the order of accuracy including the control group was: solution 2 (0.881 0)> solution 5 (0.878 9) > solution 3 (0.878 8) > solution 4 (0.877 7) > solution 1 (0.875 9) > the control group (0.868 8). Besides, the solutions of edge effects depended mainly on the types of features in the tiled images. The general law was that the tile edge effects of linear features and complex isomers (pit ponds, rural residential areas) were more obviously improved the accuracy, as the solutions were more significantly accurate, compared with that of the base land, or another agricultural land. Compared with the control group, the improvement order of IoU in the solution 2 was: roads (4.13 percent point) > pit ponds (2.97 percent point) > rivers and ditches (1.61 percent point) > rural residential areas (0.65 percent point) > another agricultural land (0.46 percent point). Without changing the core model of CNN semantic segmentation, the elimination scheme for the edge effect of a tile can be used to effectively improve the accuracy of remote sensing image classification, especially for the linear features and complex isomers.

       

    /

    返回文章
    返回