融合双线性网络和注意力机制的油橄榄品种识别

    Identification of olive cultivars using bilinear networks and attention mechanisms

    • 摘要: 为解决自然条件下的油橄榄品种识别问题,该研究以油橄榄品种佛奥、莱星、皮削利和鄂植8号为研究对象,融合双线性网络与注意力机制,提出双线性注意力EfficientNet模型。针对不同品种油橄榄表型差异很小的特点,搭建双线性网络以充分提取油橄榄图像中的特征信息。在此基础上,选用兼顾了速度和精度的EfficientNet-B0网络为特征提取网络。针对自然条件下油橄榄品种识别易受复杂背景干扰的问题,将CBAM(convolutional block attention module,CBAM)注意力与双线性网络结合,使模型在提取油橄榄图像特征时,能够聚焦到对油橄榄品种识别起关键作用的特征上。经测试,所提模型对4个油橄榄品种识别的总体准确率达到90.28%,推理时间为9.15 ms。Grad-CAM(gradient-weighted class activation mapping,Grad-CAM)热力图可视化结果也表明,所提模型在识别油橄榄品种时重点关注了果实以及部分叶子区域。消融试验结果表明,在EfficientNet模型中引入CBAM注意力和搭建双线性网络后,总体准确率分别提高了5.00和10.97个百分点。并且,对比试验结果表明,与双线性ResNet34、EfficientNet-SE注意力、双线性ResNet18、双线性VGG16和双线性GoogLeNet等模型相比,所提模型的总体识别准确率分别高12.78、11.53、11.11、10.70和5.00个百分点。该研究为解决自然条件下的油橄榄品种识别提供了依据,同时也可为其他作物的品种识别提供参考。

       

      Abstract: The extensive range of olive cultivars available in the market exhibit minor differences in their phenotypic traits. Nonetheless, their quality attributes, particularly the oil content and fatty acid composition, significantly differ among distinct cultivars, resulting in the emergence of use iinferior products as superior products in the market. The accurate and quick identification of olive cultivars holds significant importance in enhancing the production and quality of olives. As such, delving into the study of olive cultivar identification is crucial for the advancement of the olive industry. This study presents a novel approach to address the challenge of identifying olive cultivars in natural conditions. Specifically, a bilinear attentional EfficientNet model is proposed, which incorporates the bilinear network design concept and attention mechanism. The model is trained and evaluated using four commonly planted olive cultivars (i.e., Frantoio, Leccino, Picholine, and Ezhi 8) in Longnan, Gansu. The experimental results demonstrate the effectiveness of the proposed model in accurately and quickly identifying different olive cultivars. A bilinear network has been suggested to comprehensively extract feature information from olive images, for the limited phenotypic differences across different olive cultivars. In light of this, the selection of a feature extraction network has been made with consideration for both speed and accuracy, leading to the selection of the EfficientNet-B0 network. To tackle the challenge of identifying olive cultivars under natural conditions which are prone to intricate background interferences, a novel approach combining convolutional block attention module (CBAM) with bilinear network has been proposed. This approach facilitates the model in selectively focusing on the salient features responsible for cultivar identification during the feature extraction process of olive images. Upon conducting experiments, the bilinear attention EfficientNet model presented in this study has exhibited an overall accuracy of 90.28% and an inference time of 9.15 ms in identifying four distinct olive cultivars. These experiment results demonstrate that the proposed model better achieved better rapaid and accurate identification of olive cultivars under natural conditions. The present study proposes a bilinear attention EfficientNet model for identifying olive cultivars and utilizes gradient-weighted class activation mapping (Grad-CAM) to analyse its performance. The results demonstrate that the proposed model exhibits a greater attention towards fruit regions, as well as some leaf regions, within olive images. These findings are in agreement with the expert knowledge and experience of human practitioners. The analysis of heat maps generated from misidentified olive images revealed that inadequate focus on the fruit and leaf regions, which are pivotal for successful identification of cultivars, was the primary contributing factor to misidentification. The outcomes of the ablation experiments indicated that the bilinear network and the CBAM exhibited a positive impact on the precision of olive variety recognition. To ascertain the efficacy of the method elucidated in this manuscript, a set of comparative experiments has been formulated. The primary objective of these experiments is to juxtapose the proposed bilinear attention EfficientNet model against the conventional cultivar identification models, including bilinear ResNet34, EfficientNet-SE attention, bilinear ResNet18, bilinear VGG16, and bilinear GoogLeNet. The experimental results obtained from the comparison analysis provide that the proposed bilinear attention EfficientNet model exhibits superior performance in terms of overall accuracy for the identification of olive cultivars. The bilinear attention EfficientNet model's accuracy for the identification of olive cultivars exceeded that of bilinear ResNet34, EfficientNet-SE attention, bilinear ResNet18, bilinear VGG16, and bilinear GoogLeNet models by 12.78, 11.53, 11.11, 10.70, and 5.00 percentage points, respectively. The present study establishes a foundation for resolving the challenge of accurately identifying olive cultivars in natural conditions. Moreover, it offers a valuable point of reference for the identification of cultivars of diverse crops.

       

    /

    返回文章
    返回