基于弱监督语义分割的砀山梨表面缺陷识别方法

侯文慧; 郭丹丹; 周传起; 毛博; 饶元; 刘路; 王玉伟

doi:10.11975/j.issn.1002-6819.202410167

基于弱监督语义分割的砀山梨表面缺陷识别方法

Recognizing the surface defects of dangshan pears using weakly supervised semantic segmentation network

摘要

摘要: 为识别砀山梨表面缺陷，语义分割网络需要依赖大量精细的像素级标签，样本标注成本较高，导致其实际应用受到限制。针对上述问题，该研究提出两种像素级伪标签生成方法，基于包围框和点标注，借助人工经验的全局信息与局部统计信息生成精细的像素级标签；构建基于U-Net的轻量化语义分割网络，记为MCF-Unet，在骨干特征网络底层融合特征金字塔模块，以增强网络的边缘感知能力，在跳跃连接处增加CBAM（convolutional block attention module）注意力机制，以提高网络对目标信息的关注；采用自生成标签参与网络训练，实现砀山梨缺陷分割。试验结果表明，相较于其他深度学习网络模型，所构建的 MCF-UNet网络经两种弱监督数据集训练后具有较高的分割准确率及较强的鲁棒性，预测平均交并比分别达70.80%和72.94%。同时，可视化效果表明MCF-UNet模型能够在较低成本的弱监督训练后快速准确地识别砀山梨缺陷。该研究探索了弱监督深度学习在砀山梨缺陷识别中的应用，为水果无损检测领域的弱监督学习提供了参考。

Abstract: Surface defect of pears has been one of the most significant influencing factors on the fruit quality. The appearance of the fruit with surface defects can allow for the bacterial growth. Consequently, the defect recognition can be expected to serve as the quality grading of fruits. However, the surface defects visually resemble the numerous spots on the surface of pears. It is quite challenging in the task of defect recognition. Fortunately, deep semantic segmentation networks have been widely applied in the field of non-destructive testing in recent years. Nevertheless, the supervised semantic segmentation is typically required for a large number of finely pixel-level labels as the training samples. A considerable challenge is still remained on the defect recognition task of pears. Taking the Dangshan pears as the object, this study aims to recognize the surface defects using weakly supervised semantic segmentation network. Pixel-level pseudo-labels were generated using global information. The local statistical insights were derived from the human experience. The fine pixel-level labels were obtained to reduce the high cost of annotation. The defect samples of pears were firstly captured using mobile phones and industrial cameras. Image enhancement techniques were applied to increase the diversity of the samples. In the bounding box weak labels using global experience, the images were converted to the HSV color space using transformation. Histogram statistics were performed on each channel of this space. The thresholds were determined for the preliminary segmentation. The morphological processing was used to refine the pixel-level labels. In the point-level weak labels using global experience, a seed region growing algorithm was employed to segment the defect areas. The pixel-level pseudo-labels generated from the weak labels were used to create a weakly supervised semantic segmentation dataset. Subsequently, the rapid and accurate identification of surface defects were achieved in the Dangshan pears. A lightweight semantic segmentation network was constructed using U-Net, referred to as MCF-Unet. Feature Pyramid Network (FPN) was integrated at the bottom layer of the backbone feature network, in order to enhance the edge perception. Additionally, a Convolutional Block Attention Module (CBAM) was incorporated at the skip connection points, in order to improve the relevant target information. Finally, the segmentation of the network was validated using a self-built weakly supervised dataset. A comparison was made with the current models, such as DeepLabv3+, PSPNet, ResNet-U-Net, and VGG-U-Net. Experimental results indicated that the MCF-UNet network was achieved in the high segmentation accuracy and speed, after training on the two self-generated weakly supervised datasets. The mean Intersection over Union (IoU) of the predicted segmentation reached 70.80% and 72.94%, respectively. The training time of the MCF-UNet model was significantly reduced, compared with the more accurate VGG-U-Net network, with a prediction time of 0.055 s per frame. Visualization data demonstrated that the MCF-UNet model was rapidly and accurately identified the surface defects in Dangshan pears after low-cost weakly supervised training. Additionally, the two weak labels were suitable for the defect segmentation of Dangshan pears, compared with the graffiti annotation. The pixel-level pseudo-label generation was also combined with deep semantic learning. The weakly supervised deep learning was applied to recognize the surface defects of Dangshan pears. The finding can also provide the valuable insights in the non-destructive detection of fruits using weakly supervised learning.

HTML全文

参考文献(32)

施引文献

资源附件(0)