LI Xuqing, WU Dongxue, WANG Yubo, et al. Improved HRNet semantic segmentation model for remote sensing images of winter wheat[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2024, 40(3): 193-200. DOI: 10.11975/j.issn.1002-6819.202308134
    Citation: LI Xuqing, WU Dongxue, WANG Yubo, et al. Improved HRNet semantic segmentation model for remote sensing images of winter wheat[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2024, 40(3): 193-200. DOI: 10.11975/j.issn.1002-6819.202308134

    Improved HRNet semantic segmentation model for remote sensing images of winter wheat

    • Timely and accurate acquisition of winter wheat distribution is beneficial to the estimation of grain yield and optimization of planting structure. However, the previous means of acquisition cannot fully meet the needs of large-scale applications in recent years. Deep learning can be expected to introduce into agricultural remote sensing, in order to obtain the spatial distribution of agricultural resources, and further realize the development of agricultural modernization. Taking the winter wheat as the target object, the high-resolution Gaofen-2 remote sensing images were utilized to extract the winter wheat over a large area. Winter wheat was scattered distribution with the different field shapes in the remote sensing images. At the same time, the complex features were contained in the remote sensing images, leading to the interference in the recognition of winter wheat. In this study, a CAHRNet semantic segmentation network model was proposed to enhance the recognition accuracy and fuzzy details of field edges in the spatial distribution of winter wheat. The improved model first replaced the backbone feature network from ResNet with a high-resolution network HRNet. A unique parallel structure was connected to the multiple layers of sub-networks. The multiple sub-networks were continuously interacted with each other to repeatedly fuse the feature generated by multi-scale sub-networks. The spatial feature was enriched to rapidly locate the target object. The network was then maintained to extract the high-resolution feature. Secondly, the CAHRNet model was used to extract the spatial distribution of the winter wheat. The OCR module was introduced to aggregate the contextual feature of the target object. The extracted feature was classified by the backbone network. The aggregation of all the object regions was then evaluated to divide the regions into different categories. The correlation between the regions and the pixel points was enhanced to clarify the approximate location of the target object. Lastly, coordinate attention was introduced to capture the remote dependencies and retain the precise location of the target object, according to the dual spatial orientations. The correlation between multiple pixel points was enhanced to more accurately locate the exact position of the target object. The coordinate attention enhanced the spatial features of the target object and then reduced the detail loss caused by the model during feature extraction. Much more details were obtained than before in the prediction. The CAHRNet network model enhanced the supervision of the spatial features of the target object. The spatial relationship between the winter wheat and background pixels was established to enhance the region-pixel correlation. At the same time, the contextual feature of individual pixel points was utilized to enhance the pixel-point to pixel-point attention. The edge details of the target object region were preserved to highlight the overall features of winter wheat. The segmentation accuracy was improved to realize the fine extraction of winter wheat spatial distribution. The experimental results show that the optimal evaluation was achieved in each index on the high-resolution remote sensing dataset, with the MIoU and PA reaching 81.72% and 97.08%, respectively. The MIoU performance was improved by 9.90 and 2.44 percentage points, respectively, compared with the typical U-Net and DeepLabv3+, while the PA model was improved by 6.80 and 1.59 percentage points, respectively. The finding can provide technical support for the further acquisition of winter wheat crop distribution.
    • loading

    Catalog

      /

      DownLoad:  Full-Size Img  PowerPoint
      Return
      Return