融合双分支特征和注意力机制的葡萄病虫害识别模型

彭红星; 徐慧明; 刘华鼐

doi:10.11975/j.issn.1002-6819.2022.10.019

融合双分支特征和注意力机制的葡萄病虫害识别模型

Model for identifying grape pests and diseases based on two-branch feature fusion and attention mechanism

摘要

摘要: 葡萄病虫害识别是精细化防治的前提。针对现有研究中存在的数据集少、识别精度低、模型参数量大等问题，该研究构建包含健康叶片、3类病害叶片和16类虫害的葡萄病虫害数据集，提出基于改进MobileNet V2模型的葡萄病虫害识别模型。首先在MobileNet V2模型的反向残差模块中嵌入坐标注意力（Coordinate Attention，CA）机制，提升模型的信息表征能力；然后使用深度可分离卷积设计双分支特征融合模块，加强模型的特征提取能力；最后对模型的通道数进行调整，精简模型结构。试验结果表明：MobileNet_Vitis在葡萄病虫害数据集上的识别准确率和F1分数为89.16%和80.44%，相比改进前的MobileNet V2 提高了1.83和9.31个百分点，而模型参数大小为7.85 MB，减少了8.5%。与ResNet 101、ShuffleNet V2、MobileNet V3和GhostNet相比，MobileNet_Vitis的识别精度和F1分数更高，参数量更小。MobileNet_Vitis对单张葡萄病虫害图像的推理时间为17.53 ms，可以达到快速识别的要求。该研究提出的模型能够较好地识别葡萄病虫害，并且较大幅度地减少模型的参数量。将MobileNet_Vitis模型部署到移动端的小程序上，可为葡萄病虫害的防治提供帮助。

Abstract: Abstract: The grape industry has been one of the most important strategies in the rural revitalization and precise poverty alleviation. A serious disturbance to grape planting can be the pests and diseases in recent years. The refined measures of agricultural and biological control are highly required to effectively reduce the occurrence of pests and diseases for economic and safety concerns. Accurate identification and diagnosis of grape diseases and insect pests can be the premise of fine control actions. The traditional identification of pests and diseases relies mainly on the experts or technical personnel for the visual sightings, particularly with the time-consuming, laborious, high cost, low timeliness, and difficult to be widely used. Alternatively, machine learning and computer vision can be expected to serve the promising application of efficient image identification. There is also a great potential to accelerate the identification efficiency of pests and diseases, cost saving, and high accuracy. Machine learning classification includes the feature representation and classifier. However, the manual feature-based identification can only be suitable for the small data sets at present. Most machine learning algorithms also depend on complex image processing and hand-designed features, resulting in low robustness in pests and disease identification, especially in a complex environment. Fortunately, deep learning can be widely expected in the agricultural field, due to the most cutting-edge, modern, and promising technology in recent years. This study aims to treat the small data set and a large number of model parameters in the existing deep learning for higher identification accuracy. A data set of grape pests and diseases was also constructed, containing the healthy leaves, three types of diseased leaves, and 16 types of insect pests. An identification model was then proposed for the grape pests and diseases using the improved MobileNet V2. Firstly, a Coordinate Attention (CA) was embedded in the reverse residual module of MobileNet V2 to enhance the information representation of the model. Secondly, a two-branch feature fusion module was designed using depthwise separable convolution for the better identification performance of the model. Finally, the number of channels was adjusted to streamline the structure of MobileNet V2. As such, a lightweight identification model (named MobileNet_Vitis) was proposed for the grape pests and diseases. The results show that the better performance of the model with the CA module was achieved after 1×1 pointwise convolution. Specifically, the accuracy of the improved model was enhanced by 1.61 percentage points, while the parameter size was reduced by 1.62 MB, compared with the introducing CA module after 3×3 depthwise convolution. The two-branch feature fusion also greatly contributed to the higher identification accuracy, which was improved by 1.4 percentage points than before. In addition, a tradeoff of identification accuracy and speed was obtained to adjust the channels in the model. The identification accuracy and F1 score of MobileNet_Vitis on the dataset were 89.16% and 80.44%, respectively, which was 1.83 percentage points and 9.31 percentage points higher than that of MobileNet V2 before improvement, respectively. More importantly, the parameter size of MobileNet_Vitis was 7.85 MB, which was 8.5% less than that of MobileNet V2. Consequently, the MobileNet_Vitis presented a higher identification accuracy and F1 score with a smaller size of parameters, compared with the ResNet 101, ShuffleNet V2, MobileNet V3, and GhostNet. The inference time of MobileNet_Vitis for a single pest image was 17.53 ms, fully meeting the requirement of fast identification. The improved model can be widely expected to better identify the grape pests and diseases with less complexity. Therefore, the MobileNet_Vitis model can be further deployed to a mobile applet for the grape pests and disease control.

HTML全文

参考文献(28)

施引文献

资源附件(0)