李培, 牛智有, 谭鹤群, 张伟健, 皇甫季璇. 鱼粉品质检测电子鼻传感器阵列的多特征数据融合优化[J]. 农业工程学报, 2019, 35(12): 313-320. DOI: 10.11975/j.issn.1002-6819.2019.12.038
    引用本文: 李培, 牛智有, 谭鹤群, 张伟健, 皇甫季璇. 鱼粉品质检测电子鼻传感器阵列的多特征数据融合优化[J]. 农业工程学报, 2019, 35(12): 313-320. DOI: 10.11975/j.issn.1002-6819.2019.12.038
    Li Pei, Niu Zhiyou, Tan Hequn, Zhang Weijian, Huangfu Jixuan. Multi-feature data fusion optimization of sensor array of electronic nose for fish meal quality detection[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2019, 35(12): 313-320. DOI: 10.11975/j.issn.1002-6819.2019.12.038
    Citation: Li Pei, Niu Zhiyou, Tan Hequn, Zhang Weijian, Huangfu Jixuan. Multi-feature data fusion optimization of sensor array of electronic nose for fish meal quality detection[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2019, 35(12): 313-320. DOI: 10.11975/j.issn.1002-6819.2019.12.038

    鱼粉品质检测电子鼻传感器阵列的多特征数据融合优化

    Multi-feature data fusion optimization of sensor array of electronic nose for fish meal quality detection

    • 摘要: 为了提高基于仿生嗅觉的鱼粉品质检测装置的鉴别能力,该文利用自主设计的仿生嗅觉鱼粉品质检测装置,提取鱼粉样本的响应特征信息,对其传感器阵列进行多特征数据融合优化。依据各传感器对样本的响应曲线,提取传感器特征值(10×6个)构成原始特征矩阵,后对传感器阵列特征值进行归一化处理,以紧凑性作为评价特征选择方法合理性的标准,采用了3种单特征排序方法(MIC、c2、F-test),3种多特征排序方法(RF、LR、SVM),4种特征递减消除方法(RFRFE、SVMRFE、DTRFE、LRRFE)对不同品质的鱼粉进行分类准确率检验,得到基于随机森林的特征递减消除算法(RFRFE)的紧凑性最好,此时最佳的分类准确率为98.3%,特征数目为33个。优化后的传感器阵列特征发生了明显的变化,传感器阵列由原来的10个变为了8个,去掉了传感器TGS2620和传感器TGS2600,特征值也减少了45%。为了避免选择偏差,采用了10折交叉验证方法,再次得到了RFRFE算法具有更佳的紧凑性。该特征选择方法为利用仿生嗅觉技术鉴别其他动物源性原料样本的特征优化提供了新的方法和参考。

       

      Abstract: Abstract: In order to improve the identification ability of fish meal quality detection device based on bionic olfaction, in this paper, we used the developed fish meal quality detection device based on bionic olfaction to extract response characteristic information of fish meal samples, and performed multi-characteristic data fusion optimization on its sensor array. Firstly, according to the response curve of each sensor to the sample, the sensor features (10×6) were extracted to form the original feature matrix, then the normalization method was adopted to normalize the sensor features, and after that, compactness was taken as the standard to evaluate the rationality of the feature selection methods. Three single feature ranking methods (MIC, c2、F-test), three multi-feature ranking methods (RF, LR, SVM) and four recursive feature elimination methods (RFRFE, SVMRFE, DTRFE, LRRFE) were selected to carry out classification accuracy tests on fish meal with different quality. The experimental results showed that in the single feature ranking method, the MIC method had the best classification accuracy of 98.3 %, the number of features was 55, the Chi - square method had the best classification accuracy of 98.9 %, the number of features was 40, the F-test method had the best classification accuracy of 98.3 %, and the number of features was 50, thus the Chi - square feature selection method was more compact than the other two single feature selection methods. In the multi-feature ranking methods, the best classification accuracy rate of RF method was 98.3 %, the number of features was 38, the best classification accuracy rate of LR method was 83.3 %, the number of features was 24, the best classification accuracy rate of SVM method was 92.2%, and the number of features was 33. Therefore, RF feature selection method was more compact than the other two multi-feature selection methods. In recursive feature elimination, the best classification accuracy rate of RFRFE method was 98.3 %, the number of features was 33, the best classification accuracy rate of SVMRFE method was 92.2%, the number of features was 34, the best classification accuracy rate of DTRFE method was 95.6 %, the number of features was 22, the best classification accuracy rate of LRRFE method was 83.9 %, and the number of features was 37. From this, it could be seen that DTRFE and LR feature selection methods had the least number of features, but the classification accuracy rate was low. However, the RFRFE feature selection method was relatively more compact. The random forest-based recursive feature elimination algorithm (RFRFE) was adopted to select the original features, and the best classification accuracy was 98.3%, at this time, the number of features was 33. The idea of this feature selection method was to repeatedly build the model, then select the worst features, put the selected features aside, and then repeat the process on the remaining features until all features had been traversed. The order in which features were eliminated in this process was the order of features. Therefore, this was a greedy algorithm to find the optimal feature subset, while the RFRFE method selected the random forest (RF) as the base model, and obtained the optimal subset by obtaining the best classification accuracy rate. The number of features optimized by RFRFE feature selection method was 33, which reduced the number of features by 45% and greatly reduced irrelevant and redundant information for fish meal quality classification. The characteristics of the optimized sensor array had changed obviously. The sensor array had changed from the original 10 to 8 and sensor 4 (TGS2620) and sensor 6 (TGS2600) had been removed. This showed that these sensors had little contribution to the classification of fish meal quality using RF classifier. Of the six selected features values, only sensor 1 (TGS822), sensor 3 (TGS813) and sensor 5 (MQ136) had selected all the characteristic values, which showed that these sensors played an important role in the classification of fish meal quality by using RF classifier. By using 10 fold cross validation, the RFRFE algorithm was verified to be more compact again. The feature selection method provided a new method and reference for feature optimization of identifying other animal-derived raw material samples by bionic olfaction technology.

       

    /

    返回文章
    返回