基于超分辨率增强与改进YOLOv8的番茄叶片病害检测

    Tomato leaf disease detection method based on super-resolution enhancement and improved YOLOv8

    • 摘要: 针对番茄叶片病害检测面临的小目标病害易漏检、病害种类易混淆和足量病害样本难收集等问题,该研究提出一种基于超分辨率增强和改进YOLOv8的番茄叶片病害检测模型。针对小目标病害分辨率低、检测困难的问题,采用超分辨率生成对抗网络(super-resolution generative adversarial network, SRGAN)对番茄叶片图像进行增强,通过超分辨率重建增强小目标病害的细微特征,以提升小目标病害检测精度;针对病害特征相似,小目标病害特征不明显的问题,构建了一种检测模型G-YOLOv8,通过融合小目标检测层Upsample-Concat-C2f与三重注意力机制Triplet Attention,提高模型对细微特征的关注能力;针对实际检测中面临的病害样本不足问题,利用公用数据集进行模型预训练,获取最优初参数,借助迁移学习提高检测模型在自然环境背景中的番茄病害检测精度。在早疫病、晚疫病和灰霉病三种病害数据集上进行检测试验,结果表明,YOLOv8网络在超分辨率增强后的早疫病害检测的平均精度均值达95.4%,相较于未处理的早疫病害公共数据集,提升了4个百分点;所提出的G-YOLOv8网络在迁移学习后平均精度均值达97.8%,相比原YOLOv8网络提升了3.2个百分点。该研究为番茄叶片视觉检测,尤其是多类病害区分及小目标病害识别提供了一种可行性的解决方案和技术支持。

       

      Abstract: Visual features of different disease types can often exhibit high similarity in the detection tasks of the tomato leaf diseases. Particularly, some challenges still remained for the small-target diseases. Spatial areas are limited in the indistinct features to accurately identify. Deep learning models can be a great potential for plant disease detection with their powerful feature extraction. However, it is typically required for a substantial amount of the high-quality training samples. It is still lacking in the large-scale and well-annotated disease samples in practice. In this study, an improved version of the YOLO v8a model was proposed to detect tomato leaf diseases using super-resolution enhancement. A Super-Resolution Generative Adversarial Network (SRGAN) was employed to enhance the images of tomato leaves. The super-resolution reconstruction effectively amplified the subtle features of the small target diseases, thereby improving the detection accuracy. Furthermore, the detection model named G-YOLO v8 was developed to detect similar disease features and the indistinct characteristics of small target diseases. A small target detection layer (Upsample-Concat-C2f) was integrated with a Triplet Attention module. The small-target detection layer was incorporated with upsampling, feature concatenation, and integration operations. The representational capacity was then achieved for the small-target features. A triplet attention mechanism was incorporated to capture the cross-dimensional dependencies among three feature branches. The network was significantly improved to identify the subtle differences among various disease types. The public datasets were pre-trained to obtain the optimal initial parameters. The insufficient disease samples were addressed in the practical scenarios. Additionally, transfer learning was employed to enhance the accuracy of the detection. Tomato diseases were identified within natural environmental backgrounds. Comparative experiments were conducted on both public and self-constructed datasets. Three types of diseases included the early blight, late blight, and gray mold. Visualization results demonstrated that the disease samples from the public dataset were significantly enhanced after SRGAN network processing. Among them, the small-target diseases were effectively magnified from the public dataset. The YOLO v8 network successfully identified the previously undetectable features of the small-target disease. The quantitative results show that the average accuracy of the YOLOv8 network in the detection of early blight after super-resolution enhancement reaches 95.4%, which is 4 percentage points higher than that of the unprocessed public dataset of early blight. The proposed G-YOLOv8 network has an average accuracy of 97.8% after transfer learning, which is 3.2 percentage points higher than that of the original YOLOv8 network. This finding can offer a viable solution and technical support to the visual detection of the tomato leaves, particularly for the multiple types and small target diseases.

       

    /

    返回文章
    返回