Recognizing citrus in complex environment using improved YOLOv8n
-
Graphical Abstract
-
Abstract
Automated harvesting holds the promising potential in the informatization and automation of smart agriculture. Manual harvesting can be replaced to improve the high efficiency and cost saving. However, the existing models are limited to the large number of overlapping citrus fruits, branch and leaf occlusion in the complex environments, particularly for the much more parameters and high computational complexity. In this study, an improved YOLOv8-MEIN model was proposed for the citrus recognition using YOLOv8n. Firstly, the more efficient (ME) convolution module was designed to improve the CSPDarknet53 into 2-Stage feature pyramid network (C2f) module of YOLOv8n. The high number of common convolutional parameters and computation were reduced to fully meet the requirements of the real-time detection, due to the conditions of mobile hardware performance. The overall computation was also reduced to decrease the number of parameters in the model. Secondly, the Inner- complete intersection over union (CIoU) loss function was used to accelerate the bounding box regression for the better performance of the model. The weak generalization and slow convergence of the CIoU loss function were also compensated in the detection task. The reason was that the CIoU loss function only considered the shape loss in original YOLOv8, thus degenerating to the IoU loss function, when the target and bounding box were at the same aspect ratio. Finally, the YOLOv8-MEIN model was validated for the citrus detection. The YOLOv5s, YOLOv7-tiny, Faster R-CNN, and YOLOv8n models were selected for the training and testing on the citrus dataset. A comparison was carried out on the self-built citrus dataset under the same number of iteration rounds, in order to verify the performance of the YOLOv8-MEIN model. The YOLOv8-MEIN was improved 5.85 percentage points in the recall, 14.3 percentage points in the mean average precision (mAP) mAP0.5 metrics, and 34.8 percentage points in mAP0.5~0.95 metrics, compared with the two-stage Faster R-CNN, where the model size was only 5.1% of Faster R-CNN. Compared with the lightweight networks of YOLO series, such as YOLOv5s, YOLOv7-tiny, and YOLOv8n, the number of parameters was reduced by 59.1%, 52.3%, and 4.4%, respectively, whereas, the mAP0.5~0.95 values were improved by 0.7, 4.0, and 0.6 percentage points, respectively, indicating the outstanding mAP0.5. The experimental results show that the YOLOv8-MEIN model was achieved in the higher detection accuracy and efficiency with the smallest model volume and the lower computational cost. More convenient model was found for the transplantation and application suitable for the real-time target detection of citrus. The size of model and the number of parameters were reduced by 3.3 and 4.3 percentage points, respectively, compared with the mainstream target detections. The improved model was fully met the requirements for the fast and accurate recognition of citrus fruits in the complex environments. The finding can provide the technical references for the automated harvesting of citrus fruits.
-
-