改进YOLOv5s和迁移学习对番茄穴盘苗的分级检测

    Grading detection of tomato hole-pan seedlings using improved YOLOv5s and transfer learning

    • 摘要: 为了提高番茄穴盘苗分级检测精度,该研究提出了改进YOLOv5s目标检测模型,并通过迁移学习对番茄穴盘病苗识别精度进行优化。采用轻量级网络EfficientNetv2的Backbone部分作为特征提取网络,保留YOLOv5s中的SPPF空间金字塔池化模块,压缩模型参数数量以减少计算量;更改模型Neck部分原始上采样模块为CARAFE轻量级上采样模块,在引入很少参数量的情况下提高模型精度;同时将PANet替换为BiFPN,引入特征权重信息,增强不同尺度特征融合能力;引入有效多尺度注意力机制(efficient multi-scale attention,EMA),提高对番茄苗的关注,减少背景干扰;替换CIoU损失函数为SIoU损失函数,考虑真实框与预测框之间的方向匹配,提高模型收敛效果。试验结果表明,改进的YOLOv5s目标检测模型经过迁移学习训练后,平均精度均值达到95.6%,较迁移学习前提高了0.7个百分点;与原YOLOv5s模型相比,改进YOLOv5s模型平均精度均值提升2.6个百分点;改进YOLOv5s模型的参数量、计算量和权重大小分别为原YOLOv5s模型的53.1%、20.0%和53.6%,便于后期将模型部署到边缘设备中;与Faster-RCNN、CenterNet及YOLO系列目标检测模型相比,改进YOLOv5s模型在检测精度和检测速度方面均有明显优势,该研究成果可以为穴盘苗的分级检测提供依据。

       

      Abstract: Grading is one of the most important steps in the tomato hole-pan seedlings. In this study, an improved YOLOv5s target model was proposed to optimize the recognition accuracy of tomato hole-pan diseased seedlings using transfer learning. The backbone part of the lightweight network (EfficientNetv2) was used as the feature extraction. The spatial pyramid pooling fusion (SPPF) module in YOLOv5s was retained to compress the number of model parameters, in order to reduce the amount of computation; The lightweight up-sampling module of CARAFE was used to introduce a small number of parameters in the Neck part of the model; And the PANet was replaced with BiFPN to optimize the accuracy of tomato hole-pan seedling identification using transfer learning. The feature weight information was introduced to enhance the fusion of features at different scales; The efficient multi-scale attention mechanism (EMA) was introduced to increase the attention for the tomato hole-pan seedlings, and reduce the background interference; The CIoU loss function was replaced by the SIoU loss function to improve the model accuracy. Direction matching between the real and predicted frame was considered to improve the convergence of the model. The mean average accuracy of the improved YOLOv5s target model reached 95.6% after training by transfer learning, which was 0.7 percentage points higher than before; The improved YOLOv5s model was 53.1% of the number of parameters in the original model, and the computation was 20.0% of the original with only 3.20G, while the weights were 53.6% of the original and the weight size was only 7.35MB, and the mean average accuracy was improved by 2.6 percentage points, compared with the original; The visualization show that the stronger feature extraction was achieved in the improved model. The feature weights were centrally distributed in the center of the burrow and the edge of the burrow holes. Meanwhile, the heat map drawn with the GradCam showed that the improved YOLOv5s model was focused mainly on the burrow seedling itself and the edge of the burrow hole, which reduced the interference of the substrate; The ablation test verified that the improved YOLOv5s model was performed the best in the detection accuracy and the frame rate, compared with the Faster-RCNN, CenterNet, and YOLO series. The findings can provide a strong reference for the grading detection and subsequent deployment of hole tray seedlings.

       

    /

    返回文章
    返回