基于YOLOv5s改进模型的小白菜虫害识别方法

郑俊键; 兰玉彬; 熊万杰; 李硕; 杨润娜; 董昕

doi:10.11975/j.issn.1002-6819.202403065

基于YOLOv5s改进模型的小白菜虫害识别方法

Method for identification of Pak choi pests and diseases based on improved YOLOv5s

摘要

摘要: 小白菜是中国种植面积较广、深受大众喜爱的蔬菜，真实菜地环境中虫害往往出现在叶片的特定区域，且受环境因素如光照和背景干扰较大，影响对其的智能检测。为提高小白菜虫害的检测效率和准确率，该研究提出一种基于YOLOv5s网络框架改进的YOLOPC（YOLO for Pak Choi）小白菜虫害识别模型。首先，引入CBAM（convolutional block attention module）注意力机制，将其放在CBS（卷积层Convolution+归一化层Batch normalization+激活函数层SILU）的输入端构成CBAM-CBS的结构，动态调整特征图中各个通道和空间位置的权重；使用上采样和1×1卷积操作来调整特征图的尺寸和通道数，实现不同层次特征的融合，增强模型的特征表示能力。同时，改进损失函数，使其更适合边界框回归的准确性需求；利用空洞卷积的优势提高网络的感受野范围，使模型能够更好地理解图像的上下文信息。试验结果表明，与改进前的YOLOv5s模型相比，YOLOPC模型对小白菜小菜蛾和潜叶蝇虫害检测的平均精度均值（mean average precision, mAP）达到91.4%，提高了12.9%；每秒传输帧数（Frame Per Second, FPS）为58.82帧/s，增加了11.2帧/s，增加幅度达23.53%；参数量仅为14.4 M，降低了25.78%。与经典的目标检测算法SSD、Faster R-CNN、YOLOv3、YOLOv7和YOLOv8相比，YOLOPC模型的平均精度均值分别高出20.1%、24.6%、14%、13.4%和13.3%，此外，其准确率、召回率、帧速率和参数量均展现出显著优势。该模型可为复杂背景下小白菜虫害的快速准确检测提供技术支持。

Abstract: Pak choi (Chinese cabbage) has been one of the most popular vegetables with a wide planting area in China. Rapid detection and accurate identification of Pak choi insect infestation is of great significance to ensure the safety of vegetable supply. However, the insect pests can often appear in specific areas of leaves under the real vegetable field environment. The relatively large light and background interference posed a great challenge to the detection efficiency and accuracy. In this study, an improved YOLOPC model was proposed to identify Pak choi pests using the YOLOv5s network framework. Firstly, the CBAM (Convolutional Block Attention Modul) was introduced to place on the input end of CBS (Convolution layer + normalization layer Batch normalization layer + activation function layer SILU). The structure of CBAM-CBS was formed to dynamically adjust the weights of each channel and spatial position in the feature graph. The upsample and 1×1 convolution operations were used to adjust the size and number of channels of the feature graph, in order to realize the fusion of features at different levels. The feature representation of the model was enhanced at the same time. The loss function was improved more suitable for the accuracy of bounding box regression. The void convolution was used to improve the receptive field range of the network, in order to better learn the context information of the image. Specifically, the improvement strategy included the following three aspects: 1) The attention mechanism of space and channel was added to extract the network feature, in order to better learn the cabbage diamondback moth and leaf miner insect pests; 2) The alpha-IoU loss function was used to replace the CIoU one in YOLOv5s. Different levels of boundary box regression accuracy were adapted for the insect targets of cabbage, broccoli moth, and leaf leaf-divers at different scales and aspect ratios; 3) Atrous Spatial Pyramid Pooling (ASPP) was introduced to improve the receptive field range of the network, in order to better learn the context information of the image. The test results showed that the mean average precision (mAP) of the YOLOPC model was 91.4% with an increase of 12.9 percentage points, compared with the improved YOLOv5s model. The frame per second (FPS) was 58.82 frames/s, indicating an increase of 11.2 frames/s, or 23.53%. The number of parameters was only 14.4M with a decrease of 5M, or 25.78%. Furthermore, the average accuracy of the YOLOPC model was 20.1, 24.6, 14, 13.4, and 13.3 percentage points higher, respectively, Compared with the target detection of SSD, Faster R-CNN, YOLOv3, YOLOv7, and YOLOv8. Significant advantages were achieved in the accuracy, recall rate, frame rate, and number of parameters. This improved model can provide technical support for the rapid and accurate detection of Pak choi pests under a complex background. In short, the accuracy and immediacy of the improved model were significantly optimized in the field of pest detection of Pak choi. YOLOPC model also shared significantly improved parameters, detection rate, and accuracy, compared with the current mainstream ones. Therefore, the camera equipment can be deployed to identify and warn the pest of Pak choi for field planting. The improved mode can also be applied on the plant protection drone, in order to realize the precise spraying of variables and targeted pest control. The findings can provide a strong reference to saving pesticides and effectively reducing environmental pollution.

HTML全文

参考文献(33)

施引文献

资源附件(0)