基于模糊判别成分分析法的高光谱作物信息提取与分类

杨晨; 董丽芳; 赵海士; 常志勇

doi:10.11975/j.issn.1002-6819.2019.21.019

摘要: 针对高光谱遥感影像的高维性、不确定性及作物光谱变异性问题，传统信息提取方法不适用于高光谱遥感影像作物分类，该文基于一种调节学习方法，引入模糊统计学相关理论，提出模糊判别成分分析（fuzzy-statistics-based discriminative component analysis, FS-DCA）用于提取利于作物分类的高光谱遥感特征空间。首先定义模糊统计学数字特征，利用分块统计建立模糊可分特征子空间，抑制噪声像元造成的不确定性。将主成分分析（principal component analysis, PCA）及判别成分分析（discriminative component analysis, DCA）与FS-DCA所提取特征和原始全部波段分别应用于AVIRIS Indian Pines 92AV3C高光谱遥感影像中进行分类。结果表明，利用FS-DCA的7个特征进行作物分类获得的平均总体精度比采用全部波段、PCA和DCA分别高出6.88、3.28和0.5个百分点，种植作物的生产者精度与用户精度比传统方法提高1.37~18.47个百分点。该方法有效减少了高光谱影像维数，可为作物信息提取与分类提供参考。

Abstract: Abstract: Hyperspectral remote sensing technology are typically used to collect data with at least ten spectral bands of relatively narrow bandwidths. The hyperspectral remote sensing images often have extensive interband correlations, and high-dimensional data with uncertainty, which is significant characteristics of hyperspectral remote sensing imagery. As a result, the adjacent bands of the hyperspectral images may contain similar spectral information and spatial structures. Reducing the dimensionality of data therefore becomes one of the most important preprocessing step in hyperspectral image analysis. Since the individual bands of a hyperspectral image share high similarity, statistical data compression tools, such as principal component analysis (PCA), have been widely applied for dimensionality reduction of hyperspectral images. However, it is well known that PCA is sensitive to the presence of outliers and missing data. Meanwhile, the spectral variability from the identification of crop species makes the conventional information extraction methods unsuitable for the crop classification with hyperspectral remote sensing images. During the PCA, the geographical information including remotely sensed data cannot also be accurate, which means that the boundaries between different classes and phenomena are not clear, i.e., fuzzy. The fuzzy approach to statistical analysis rather than other mathematics methods can be quite suitable to process this kind of image data with the fuzziness and statistical property. Recently, a new learning paradigm—adjustment learning (AL) has been studied for the image retrieval as an emerging technology. In the AL scheme, data points can be identified as small group sets, i.e., "chunklets", with equivalence constraints that are known to originate from the same class (but the label is unknown). A non-iterative and efficient metric learning method, called discriminant component analysis (DCA), has been developed based on the AL. In this paper, the fuzzy sets theory and statistics were introduced into the AL-based technique, DCA, and then a hyperspectral imagery feature extraction method called fuzzy-statistics-based discriminant component analysis (FS-DCA) was proposed to effectively distinguish different types of crops via their hyperspectral signal subspace. The statistical characteristics involved the fuzzy mean, as well as fuzzy scatter matrix were defined based on the fuzzy statistics. In the feature space of FS-DCA, the unknown measurement vector (pixel) has membership grade values to describe the distance between the vector and the mean vector. The fuzzy separable subspace was then established to suppress the uncertainty caused by noise pixels. To verify this approach, a widely used hyperspectral imagery data set, i.e., AVIRIS Indian Pines 92AV3C was chosen including nine classes and two types of grow crops. The performance of the proposed FS-DCA method on AVIRIS hyperspectral images classification was compared with those of using all spectral channels and two representative feature extraction methods, the PCA and DCA. Experimental results showed that the proposed FS-DCA provided the better classification performance and lower standard deviation than the previous PCA and DCA. Specifically, the average overall accuracy of FS-DCA was much higher than that using all spectral channels, the PCA and DCA, ranking values in order 7.88, 3.28 and 0.5 percentage points. The producer’s accuracies and user’s accuracies of grow crops were significantly higher than that using the conventional methods, ranging from 1.37~18.47 percentage points. This finding demonstrated that the proposed FS-DCA technique can greatly reduce the dimension of hyperspectral images, and thus provide a promising method for the information extraction and classification of crops using remote sensing data.

基于模糊判别成分分析法的高光谱作物信息提取与分类

Hyperspectral feature extraction using fuzzy-statistics-based discriminative component analysis method for crop classification