郭俊先,张振振,韩景,等. 基于傅里叶变换近红外光谱的籽棉含水率无损检测[J]. 农业工程学报,2023,39(21):152-160. DOI: 10.11975/j.issn.1002-6819.202308012
    引用本文: 郭俊先,张振振,韩景,等. 基于傅里叶变换近红外光谱的籽棉含水率无损检测[J]. 农业工程学报,2023,39(21):152-160. DOI: 10.11975/j.issn.1002-6819.202308012
    GUO Junxian, ZHANG Zhenzhen, HAN Jing, et al. Non-destructive detection of seed cotton moisture content based on Fourier transform near-infrared spectroscopy[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2023, 39(21): 152-160. DOI: 10.11975/j.issn.1002-6819.202308012
    Citation: GUO Junxian, ZHANG Zhenzhen, HAN Jing, et al. Non-destructive detection of seed cotton moisture content based on Fourier transform near-infrared spectroscopy[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2023, 39(21): 152-160. DOI: 10.11975/j.issn.1002-6819.202308012

    基于傅里叶变换近红外光谱的籽棉含水率无损检测

    Non-destructive detection of seed cotton moisture content based on Fourier transform near-infrared spectroscopy

    • 摘要: 为了实现对籽棉含水率的快速、无损检测,该研究采用傅里叶变换近红外光谱技术建立籽棉含水率定量检测模型。首先探究了籽棉样本密度对于光谱曲线的影响,该研究发现样本密度大小对光谱曲线影响显著,密度越小光谱信号越强,当样品密度不低于0.0886 g/cm3时,光谱曲线变化趋于平稳。通过采集籽棉样本在3900~11000 cm−1波数范围的吸光度光谱数据,并应用了9种预处理方法对原始光谱数据进行处理。发现一阶导数结合消除趋势(first derivative- detrending,FD-DT)预处理方法在偏最小二乘回归(partial least squares regression,PLSR)模型建立时表现最佳。使用了竞争自适应重复加权法(competitive adaptive reweighted sampling,CARS)、信息增益法(information gain,IG)、连续投影法(successive projections algorithm,SPA)和相关系数(correlation coefficient,CC)等算法,来获取最佳的特征波长。构建PLSR和支持向量机(support vector machine,SVM)的籽棉水分含量预测模型,比较不同分析算法,确定了FD-DT-CARS-PLSR和FD-DT-CARS-SVM两种算法组合作为最佳预测模型,预测集决定系数(R2P)分别为0.933和0.931,预测集均方根误差(root mean square error,RMSE)分别为0.480和0.500,剩余预测偏差(residual prediction deviation,RPD)分别为3.88和3.85。研究结果表明,利用近红外光谱技术可以无损和准确地检测籽棉样本的含水率。

       

      Abstract: Cotton is one of the most crucial global textile raw materials to determine the quality of final products. The moisture content of seed cotton can profoundly impact the storage, transportation, and textile processing of cotton. This study aims to rapidly and non-destructively measure the moisture content of seed cotton. A quantitative detection model was established using Fourier-transformed near-infrared spectroscopy. A series of experiments were conducted to explore the influence of seed cotton sample density on spectral curves. Sample density was significantly dominated in the spectral curves, where the lower densities resulted in stronger spectral signals. However, the fluctuations were stabilized in the spectral curve, when the sample density reached a specific threshold 0.088 60 g/cm3. Data accuracy and comparability were the pivotal reference points. Subsequently, Fourier-transformed near-infrared spectroscopy was employed to collect the absorbance spectral data of seed cotton samples within the range of 3 900-11 000 cm−1 wavelength. The samples were dried in an air blast drying oven (DWG-9240A) under constant temperature. The airflow was used to remove the moisture from the seed cotton. Pre-experimental verification showed that the weight of samples remained relatively unchanged, when the temperature was raised to (105 ±3) ℃ and maintained for three hours. Afterward, the dried samples were removed and weighed to measure their moisture content using a balance with a precision of 0.001 g. Furthermore, nine preprocessing methods were applied to the original spectral data, in order to enhance the data quality. Comparative analysis determined that the best performance was achieved in the first-order derivative combined with detrending (FD-DT) preprocessing using the Partial Least Squares Regression (PLSR) model. The calibration and prediction set determination coefficients were 0.974, and 0.845, respectively, with the root mean square errors of 0.316 and 0.721, respectively, as well as the residual prediction deviation (RPD) of 3.00 after FD-DT preprocessing. The wavenumber range of 4000-10000 cm−1 was selected to extract feature spectral data. The reason was that the lower sensitivity and response of the spectrometer at the beginning and end of the spectra, led to weaker or unstable signals in these regions. The optimal feature wavelengths were then obtained using Competitive Adaptive Reweighted Sampling (CARS), Information Gain (IG), Successive Projections Algorithm (SPA), and Pearson's Correlation Coefficient (CC). The feature wavelength counts of 47, 27, 30, and 27, accounting for 6.03%, 3.46%, 3.85%, and 3.46% of the spectral range, respectively. These features effectively reduced the number of variables for the high efficiency and performance of the model. After the extraction of feature wavelength, the quantitative models were established to predict the moisture content of seed cotton using Partial Least Squares Regression (PLSR) and Support Vector Machine (SVM). Comprehensive analysis of various analytical algorithms showed that the combination of FD-DT-CARS-PLSR and FD-DT-CARS-SVM was the most effective predictive model, where the determination coefficients of 0.933 and 0.931, root mean square errors of 0.480 and 0.500, and residual prediction deviations of 3.88 and 3.85 in the prediction dataset. FD-DT was used to effectively remove the trends and noise from the data for data quality and usability. CARS was used to efficiently select the most relevant feature wavelengths for the performance and prediction accuracy of the model. PLSR demonstrated excellent performance on multicollinear data for better interpretability, but with a relatively weaker performance in fitting nonlinear data. SVM displayed strong capabilities in nonlinear modeling and adaptability to high-dimensional data, but it was relatively difficult to interpret with the slower training times for large datasets. Two models can be combined to effectively predict in this case. In summary, near-infrared spectroscopy can be expected to rapidly, non-destructively, and accurately detect the moisture content of seed cotton .

       

    /

    返回文章
    返回