基于改进YOLOv8的田间复杂环境下蓝莓成熟度检测

    Detecting blueberry maturity under complex field conditions using improved YOLOv8

    • 摘要: 为了快速精确识别田间复杂环境下的蓝莓果实的成熟度,该研究基于YOLOv8提出了一种蓝莓成熟度轻量化检测模型(MSC-YOLOv8)。首先,为了有效减少参数量,提高模型的运行速度,采用MobileNetV3为主干特征提取网络进行特征信息的提取,有利于田间复杂环境下的检测。其次,在主干特征提取网络中插入卷积注意力机制模块(convolutional block attention module,CBAM),以提高深度学习网络对蓝莓特征提取的能力。最后,引入SCYLLA-IoU(SIoU)作为YOLOv8的边界框回归损失函数,以解决真实框与预测框角度不匹配的问题,进一步提高蓝莓成熟度识别的准确率。通过试验得出改进的MSC-YOLOv8模型相较于YOLOv8平均精度均值(mean average precision,mAP)提升了3.9个百分点,单张图片平均检测时间比原YOLOv8减少了3.97 ms。改进的MSC-YOLOv8模型在蓝莓数据集上取得了较优的结果,与SSD和CenterNet模型对比,mAP分别提升了4.6和1.1个百分点,在检测速度和准确率方面均有优势。该研究可为田间复杂环境下蓝莓机器人采摘提供技术支持。

       

      Abstract: To rapidly and accurately identify blueberry fruit maturity under complex field conditions, a lightweight detection model for blueberry maturity (MSC-YOLOv8)was proposed based on YOLOv8n. YOLOv8n, a recent addition to the YOLO series, offers the fastest detection speed and the lightest architecture, albeit with slightly lower recognition accuracy. Considering these attributes, YOLOv8n was chosen as the foundational model for blueberry maturity recognition. Compared to traditional object detection algorithms, YOLOv8 boasts a significantly faster detection speed. By using a single forward propagation to obtain detection results for all targets simultaneously, YOLOv8 achieves low latency, making it suitable for real-time applications. Additionally, YOLOv8 demonstrates high detection accuracy, rivaling advanced object detection algorithms, and excels particularly in detecting small targets. These advantages make YOLOv8 a superior choice for this application. Before training the model, the original blueberry datasets need to be augmented using Deep Convolution Generative Adversarial Networks (DCGAN) to generate additional blueberry images.This aims to generate diverse blueberry images to enhance the datasets, thereby improving the model's capability for detecting and recognizing blueberries in complex field environments. Firstly, to reduce the number of parameters and improve the model's running speed, MobileNetV3 was employed as the backbone feature extraction network, which was conducive to detection in complex field environments. Secondly, to enhance the network's ability to extract features from blueberries, the Convolutional Block Attention Module (CBAM) was integrated into the backbone feature extraction network. Finally, by introducing SCYLLA-IoU (SIoU) as the bounding box regression loss function for YOLOv8n, the issue of angle mismatch between the ground truth and predicted boxes was addressed, further improving the accuracy of blueberry maturity identification. In terms of the backbone feature extraction network, compared with MobileNetV3, ShuffleNet, VanillaNet, and YOLOv8's own backbone CspdarkNet, the experimental results showed that MobileNetV3 as the backbone network of YOLOv8 had the highest mAP value. Additionally, CBAM demonstrated the best mAP performance in various attention mechanisms such as CBAM, SE, ShuffleAttention and MHSA. When assessing bounding box regression loss functions, SIoU outperformed GIoU, EIoU, and CIoU in terms of mAP. Through three key improvements, this study proposed a lightweight network model, MSC-YOLOv8, for blueberry fruit maturity detection,ensuring the real-time performance and accuracy. The results showed that the improved MSC-YOLOv8 model was 3.9 percentage points higher than that of YOLOv8. The average detection time was reduced by 3.97 ms compared with the original YOLOv8. Compared with SSD and CenterNet models, the improved model achieved better results on the blueberry dataset, and the average precision mean mAP is increased by 4.6 and 1.1 percentage points, respectively, which has advantages in detection speed and accuracy. This research provides technical support for blueberry picking robots to perform picking work under complex conditions in the fields.

       

    /

    返回文章
    返回