Qin Lifeng, He Dongjian, Song Huaibo. Bag of words feature multi-PCA subspace adaptive fusion for cucumber diseases identification[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2018, 34(8): 200-205. DOI: 10.11975/j.issn.1002-6819.2018.08.026
    Citation: Qin Lifeng, He Dongjian, Song Huaibo. Bag of words feature multi-PCA subspace adaptive fusion for cucumber diseases identification[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2018, 34(8): 200-205. DOI: 10.11975/j.issn.1002-6819.2018.08.026

    Bag of words feature multi-PCA subspace adaptive fusion for cucumber diseases identification

    • Abstract: Cucumber disease identification in greenhouse is of great significance to cucumber production. The traditional recognition method based on colour and texture features with single classifier has low accuracy in diagnosing many kinds of diseases. In view of this problem, a cucumber disease identification method based on Bag of Words (BoW) model and multi classifiers adaptive fusion on PCA-subspaces, BoW-mPCA, was proposed. First, for each category disease images, the SIFT points were extracted and clustered into a certain number of clusters, and the centres, called visual words (VW), were gathered to establish the category-related BoW (CR-BoW) model. Considering that the image data in the remote transmission can reduce the real-time of the disease recognition, and the disease location plays a decisive role in disease identification, so we cut the diseases location of the sample images by hand into sub-images of different size (minimum 58×68, maximum 562×487), which were collected as the dataset for CR-BoW model construction and experiments. To present the images based on the CR-BoW model, every SIFT point of each image was presented by the nearest VW in the VW space. Then each image was presented as a histogram of the VWs appearing on the image. To solve the problem of the high-dimension BoW-feature of disease images, PCA (Principal Component Analysis) method was adopted to reduce the feature dimensionality to series of subspaces of different size, and on each subspace a BP network was trained. During the identification process, a self-adaptive strategy for classifiers combination was built based on the idea that each single type of disease should be classified on one optimal PCA-subspace. The diseases image was classified by the BP net trained on the smallest PCA-subspace. If the distance of the highest two scores was higher than a given threshold, the image was classified into the category corresponding to the highest score. If not, the image was classified again by the BP net trained on the next larger PCA-subspace, and the scores were confused with the former ones and the highest two were compared to decide the final result. Repeating this process until the highest two scores were obviously discrepant or all the BP nets were used. The experiments were carried on a dataset containing five types of cucumber diseases, angular leaf spot, corynes pora, powder, downy and anthrax. The results showed that the average classification precision of five kinds of cucumber diseases in two PCA subspace fusion with preservation of 75%, and 85% energy, respectively, was 90.38%, higher than the single colour, texture and colour-texture mix feature by 6.97, 26.15, 13.02 percent respectively. Although for single disease of angular leaf Spot, corynes pora and anthrax, BoW-2PCA obtained precisions of 86.96% and 88.00%, slightly lower than colour features (89.78% and 93.20%), the precision on downy was 89.13% comparing to 64.20% from colour feature. This result indicated that BoW-2PCA could deal with the situation that the traditional methods can hardly handle. Moreover, the BOW-2PCA algorithm showed more stability in classification precision over all five types of diseases than the other three traditional methods. These results showed that the proposed CR-BoW model and BoW-mPCA algorithm was effective for diagnosing greenhouse cucumber disease.
    • loading

    Catalog

      /

      DownLoad:  Full-Size Img  PowerPoint
      Return
      Return