自然环境下多类水果采摘目标识别的通用改进SSD模型

彭红星; 黄博; 邵园园; 李泽森; 张朝武; 陈燕; 熊俊涛

doi:10.11975/j.issn.1002-6819.2018.16.020

摘要: 为解决当前自然环境下水果识别率不高、泛化性不强等问题，该文以苹果、荔枝、脐橙、皇帝柑4种水果为研究对象，提出了一种改进的SSD（single shot multi-boxdetector）深度学习水果检测模型：将经典SSD深度学习模型中的VGG16输入模型替换为ResNet-101模型，并运用迁移学习方法和随机梯度下降算法优化SSD深度学习模型。该文基于Caffe深度学习框架，对自然环境下采集的水果图像进行不同网络模型、不同数据集大小和不同遮挡比例等多组水果识别检测效果对比试验。试验表明：改进的SSD深度学习水果检测模型对4种水果在各种环境下的平均检测精度达到88.4%，高于经典SSD深度学习模型中的86.38%，经过数据增强后平均检测精度可提升至89.53%，在遮挡面积低于50%的情况下F1值能达到96.12%，有较好的泛化性和鲁棒性，可以很好地实现自然环境下多类水果的精准检测，可为农业自动化采摘中的水果识别检测问题提供新的方案。

Abstract: Abstract: China is the leading country in the world for the production of fruits, and the variety of fruits is very wide. But fruits harvesting heavily depends on manual hand picking, and it's time-consuming, low efficient and labor-intensive. Fruit picking robot can realize the automation of fruit picking operation and solve the problems of shortage of labor force and high cost. Fruit identification with machine vision is the primary task. But in the field environment, fruit images are easily affected by many external environmental factors such as light changes, fruit size difference, complicated background noise, which can reduce the identification accuracy of fruit with traditional fruit recognition algorithm. And without general feature extraction model, traditional fruit recognition algorithm can only focus on one specific fruit. Deep learning algorithm has the advantages of strong non-linear feature expression ability, and good generalization performance, and can avoid the subjectivity and limitation of human selection on feature selection. In order to solve the problems of low recognition rate and weak generalization for fruit recognition in the field environment, with the apple, litchi, navel orange, Huangdi gan as the research object, an improved single shot detector (SSD) deep learning model for fruit detection is put forward in this study. That is to use ResNet-101 model to replace the VGG16 network in the classic SSD detection framework. After the replacement, the framework still uses 6 feature extraction layers to predict the type and location of fruit objects at each layer; then the weight model under the large data sets is transplanted to multi-class fruit detection tasks with the method of transfer learning. The SSD deep learning model is optimized by using SGD (stochastic gradient descent) algorithm. The weight model of the pre-training on the ImageNet data set is used as the initial weight model of the SSD detection framework, and the training time and resources are further reduced by transferring the characteristics of the learning. At the same time, data enhancement method is used to improve the robustness of the algorithm without reducing the detection accuracy. Based on the Caffe deep learning framework, fruit detection results are compared for the multi-class fruit pictures collected in the field environment with different network models, different data set sizes and different occlusion ratios. Experimental results show that after a day of training, the residual error reference model takes about 0.14 s when detecting the image with a resolution of 500×500 pixels, only about 0.09 s slower than the VGG16 network model. And in various environments, the average detection accuracy of the 4 kinds of fruit based on the improved SSD deep learning fruit detection model can reach 88.4%, which is higher than that of the classic SSD deep learning model that is 86.38%. After data are enhanced, the average detection accuracy can be improved by 1.13 percentage points and reach 89.53%, and the F1-score can reach 96.12% when the occlusion area is lower than 50%. Therefore, compared with the traditional recognition algorithm, this method based on improved SSD model can realize multi-class fruit image detection simultaneously without artificial feature selection for different fruit images, and has better generalization and robustness. It can achieve accurate detection of multiple kinds of fruits in the field environment, and provides a new solution for the problem of fruit detection and recognition in agricultural automation.

自然环境下多类水果采摘目标识别的通用改进SSD模型

General improved SSD model for picking object recognition of multiple fruits in natural environment