基于近红外光谱和深度学习数据增强的大米品种检测

杨森; 张新奡; 王振民; 类成伟; 宋文龙

doi:10.11975/j.issn.1002-6819.202305088

基于近红外光谱和深度学习数据增强的大米品种检测

Rice variety detection based on near-infrared spectroscopy anddeep learning data augmentation

摘要

摘要: 近红外光谱和深度学习结合的思路是大米品种检测的重要研究方向，其准确检测模型的建立依赖大规模的样本数据，然而采集和预处理样本耗时巨大，对准确性的提升造成限制。为解决上述不足，便于深入探究近红外光谱结合深度学习方法在大米品种检测领域应用的可行性，该研究提出基于近红外光谱结合改进型深度卷积生成式对抗神经网络（deep convolutional generative adversarial network，DCGAN）数据增强的大米品种检测方法。首先，在相同环境下采集4种大米品种的近红外光谱并对原始光谱数据进行预处理，使用去趋势校正（detrend correction，DC）和无信息变量消除算法（uninformative variable elimination，UVE）消除无用光谱特征点。然后，建立改进型DCGAN模型对预处理后的光谱数据进行数据增强，对比试验结果表明，改进型DCGAN相比与传统数据增强方法，改进型DCGAN生成数据的结构相似性指标更优。最后，研究不同数据增强方法结合不同分类方法建立大米品种分类模型的性能，对比试验结果表明，改进型DCGAN数据增强结合一维卷积神经网络（one-dimensional convolution neural network，1D-CNN）分类算法所建模型面向测试集的准确率最高，为98.21%，为简便准确的大米品种检测方案提供了新思路。

Abstract: It is widely recognized that the value of rice in the market is highly depends on its variety, with some varieties commanding significantly higher prices than others. Unfortunately, this has led to a situation where unscrupulous vendors may attempt to pass off inferior or lower-value rice varieties as more valuable ones. This urgently requires a reliable and efficient methods for accurately detecting and identifying different rice varieties. The use of near-infrared spectroscopy and deep learning for rice variety detection is an important research direction in this field. The establishment of an accurate detection model depends on a large-scale sample data, however, the process of collecting and preprocessing samples is very time-consuming, which limits the improvement of accuracy. To address the aforementioned needs and limitations, this paper proposes a rice variety detection method based on near-infrared spectroscopy combined with an improved deep convolutional generative adversarial network (DCGAN) for data augmentation. Based on the classical deep convolution generative adversarial neural network, the network increases the number of convolution layers and adjusts the network structure. Subsequent experiments showed that the network has better generation ability. In this study, we selected four rice varieties, namely Wuchang, Xiangshui, Yinshui, and Yueguang, and collected 80 samples for each variety, amounting to a total of 320 rice powder samples as experimental samples. Firstly, we collected near-infrared spectra of the samples and preprocessed the raw spectral data by applying detrend correction (DC) and uninformative variable elimination (UVE) algorithms to eliminate irrelevant spectral features. Then, we established an improved DCGAN model to augment the preprocessed spectral data. Three traditional data enhancement methods, random left and right translation, superimposed Gaussian noise and translation noise, were used for comparison. Comparative experiments demonstrated that the spectral data generated by the improved DCGAN showed significant superiority in terms of structural similarity compared to traditional data augmentation methods. Finally, we investigated different data augmentation methods combined with different classification methods to establish a rice variety classification model. We employed five classifiers, namely partial least squares discriminant analysis (PLS-DA), k-nearest neighbors (KNN), random forest (RF), decision tree (DT), and one-dimensional convolutional neural network (1D-CNN), and combined them with four data augmentation methods to build the rice variety classification model. The results showed that the model built using DCGAN-augmented data achieved the highest accuracy on the test set, the accuracy rate was 98.21%. Through quantitative analysis of the models built using DCGAN combined with the five classifiers, it was found that the variety detection model established by DCGAN-1D-CNN method performed the best on the test set, with a coefficient of determination R² of 0.9922, mean absolute error (MAE) of 0.0089, and root mean square error (RMSE) of 0.0505, indicating its optimal performance. Based on the comprehensive experimental results, the model established by combining the improved DCGAN data augmentation with the 1D-CNN classification algorithm exhibited the best performance in terms of accuracy on the test set, providing a new approach for convenient and accurate rice variety detection applications. By utilizing near-infrared spectroscopy and deep learning methods, combined with the improved DCGAN for data augmentation, we can enhance the accuracy and reliability of rice variety detection, thus meeting the market demand and consumer expectations.

HTML全文

参考文献(35)

施引文献

资源附件(0)