基于轻量级SSD模型的夜间金蝉若虫检测

    Detecting nocturnal golden cicada nymphs using lightweight SSD model

    • 摘要: 为实现夜间树上金蝉若虫的快速准确检测,该研究以自然环境图像数据集为研究对象,结合近距离实际应用场景,考虑到嵌入式系统的模型小型化和计算过程轻量化,在保持精度指标基本不变的前提下,基于适度削减模型深度、宽度的思路对已有目标检测网络MobileNet-SSD提出改进。具体措施包括:删除骨干网络末端的小尺寸特征图卷积层,逐级裁剪模型整体宽度、适当增加中高层卷积深度,在目标检测的分类层和预测框的回归层中使用深度可分离卷积代替传统3×3卷积等措施,先后获取3种改进的精简模型以进行比较。夜间图像测试结果表明,在基本保持网络性能的前提下,改进后的模型大小及计算量均呈现大幅减小,其中最优模型大小从原MobileNet-SSD的15.22 MB减少到1.51 MB,模型的浮点运算量也由原先的1.13×109减少到1.26×108,其平均准确率达90.46%,平均交并比达83.52%,F1分数达92.35%,GPU上的检测速度达179.3帧/s,CPU上的检测速度达到6.49帧/s,与改进前的模型相比具有更好的综合性能,白天图像的试验结果也显示出较好的泛化性能。该文提出的改进模型在大幅减少模型大小及其计算量的同时使模型性能保持在一个较高的水平,更适合部署在移动终端等资源受限设备上,可为金蝉的人工养殖提供有益参考。

       

      Abstract: Abstract: Golden cicada nymphs have been one of the most commonly edible insect species in the world, due to the extremely high medicinal and nutritional value with the unique taste. Therefore, the breeding of golden cicada nymphs can be brought the high economic value in food production. In this study, a rapid and accurate detection was proposed for the golden cicada nymphs in the natural environment. 366 images were captured from the Jiangsu University campus, Zhejiang Province, China, and another 180 images were from the green belt of residential area, and the downloaded from the Internet as supplement. Three procedures were proposed to realize the rapid and accurate detection of golden cicada nymphs, while reducing the model size and computational load. The first procedure was to reduce the number of branches of the original MobileNet-Single-Shot multibox Detection (SSD) network. Six SSD branches were used to detect the objects, each of which was corresponding to the different size of the feature map in original model. A relatively simple task was implemented to avoid the very complex neural network. Therefore, three small-scale branches of 3×3, 2×2, and 1×1 were removed to reduce the size of the model and the calculation amount. The second procedure was to reduce the width of MobileNet-SSD backbone, namely the input/output channels of the convolution layer. The number of features was determined in the network, where the enough width was used to ensure each layer learn the rich features. Among them, the width of the network was dominated the size of the parameters and calculation amount of network. A much wider network was used to extract much more the repetitive features, leading to the higher calculation amount without the substantial contribution to the network. Thus, the width of the backbone network was reduced significantly, due to only one type of object to identify in this case. The third procedure was to adjust the depth (the number) of the convolution layers. The depth of the neural network also determined the non-linear expression of the system. Generally speaking, the deeper neural networks presented the better nonlinear expression capabilities to learn more complex features. Thus, the depth of the convolution layer increased to improve the performance of the network. Three models were then proposed for the detection of golden cicada nymphs. The test results of night images show that the size and calculation amount of the improved model were greatly reduced with the normal network performance. Specifically, the size of the model was reduced from 15.22 to 1.51 MB, the floating-point operation amount was also reduced from 1.13×109 to 1.26×108. The average precision, the average Intersection over union (IoU), and the F1 score were 90.46%, 83.52%, and 92.35%, respectively, while, the GPU and CPU detection speed reached 179.3 and 6.49 frames/s, respectively. The daytime images show that the excellent generalization performance was achieved for the improved model. The improved MobileNet-SSD model presented the higher cost efficiency to greatly reduce the size of the model while reducing the computation load, while only a slight decrease in the performance, compared with the original. A comparative advantage was gained, compared with other lightweight target detection models. Therefore, the model can be widely expected to balance the performance, the size of the model and the computation load. The improved model can be very suitable for the deployment on mobile terminals and other embedded resource-constrained devices, which is conducive to real-time and accurate detection of golden cicada nymphs.

       

    /

    返回文章
    返回