基于UNet-ResNet14*半监督学习的无人机影像森林树种分类

陈龙伟; 周小成; 李传昕; 林华章; 王永荣; 崔永红

doi:10.11975/j.issn.1002-6819.202310172

摘要: 无人机遥感在森林树种精细和高效分类制图中具有巨大的潜力。为了快速准确获取森林的优势树种分布信息，该研究探讨了半监督学习方法在树种分类方面的有效性。以福建省福州市、龙岩市和三明市的4个试验区为例，构建精简的ResNet18为主干的UNet树种分类模型（UNet-ResNet14*），使用交叉熵和Dice系数的联合损失函数来优化模型参数，对比分析Self-training和Mean Teacher两种不同的半监督学习方法在无人机影像森林树种分类模型的泛化能力。结果表明，以ResNet14*作为主干的分类模型与其他模型相比精度更高且预测速度更快，当联合损失函数权重值为0.5的情况下模型预测效果最好，总体精度达到了91.15%。经过Self-training的模型在木荷、马尾松、杉木3个样本充足的类别中精度均有所提升，总精度为91.08%，比原始模型略低，但在独立验证区的精度为88.50%，比原始模型高；Mean Teacher方法的总精度为88.56%，在独立验证区的精度为73.56%。因此，研究认为可以采用Self-trainin半监督方法结合UNet-ResNet14*的方案快速得到试验区的树种组成信息。

Abstract: Unmanned aerial vehicle (UAV) remote sensing has the promising potential for the precise and efficient classification and mapping of forest tree species. Deep learning also requires a large number of datasets for training, typically on manual annotation. In this study, the framework of forest tree species classification was proposed to fully utilize a large amount of unlabeled data and a small amount of annotated data using semi-supervised learning. A rapid and accurate classification was also achieved in the high-precision distribution of dominant tree species in forests. The experimental areas were taken as the complex mountainous forest environment in Fujian Province. The composition of tree species was then obtained in a rapid, effective, and cost-saving manner. Taking four experimental areas in Fuzhou, Longyan, and Sanming in Fujian Province as examples, the simplified classification was constructed in the UNet tree species (ResNet14*) model with ResNet18 as the backbone. ResNet14* was different from ResNet18: ResNet14* was used to remove the layer4 part of ResNet18, i.e., the last downsampled cascaded block, which retained slightly higher spatial information; At the end of the layer2 and layer3 sections of ResNet14*, a max pooling layer was added to reduce the training parameters of the neural network while retaining the original features. A joint loss function of cross entropy and Dice coefficient was used to optimize the model parameters. The generalization of Self-training and Mean teacher was evaluated on the classification models with semi-supervised learning using UVA images. The results show that the overall accuracy of the ResNet14* network reached 91.15%, with a Kappa coefficient of 0.827, which was within 1% of the accuracy of the rest ResNet models. At the same time, a smaller number of parameters and the shortest prediction time were achieved to balance the accuracy and efficiency of tree species classification. The best prediction performance of ResNet14* was achieved with the joint loss function weight of 0.5, indicating an overall accuracy of 91.15%. Therefore, the joint loss function weight of 0.5 was an optimal value for semi-supervised learning in this case. Self-training and Mean teacher semi-supervised learning were implemented with UNet (ResNet14*) as the main network. The experiment showed that the overall accuracy of the Self-training on the test set reached 91.08%, slightly lower than the original. The higher category accuracy was also achieved in the categories of Schima superba, Pinus massoniana, and Chinese fir with sufficient samples. Furthermore, the overall accuracy of the self-training with pseudo labels was improved among two semi-supervised models in experimental area D, reaching 88.50% compared with the original; There was a significant decrease in the overall accuracy of the Mean teacher model with consistency loss. The total accuracy of the Mean teacher model was 88.56%, where the accuracy was 73.56% in the independent validation area. Accuracy evaluation was also conducted on an independent validation area. The classification accuracy of above 80% was found in the three types of tree species, namely Schima superba, Pinus Massoniana, and Chinese fir. A relatively large area was accounted for to meet the accuracy requirements of tree species mapping in the experimental area. Therefore, the semi-supervised learning of the Self-training model can be expected to rapidly obtain the composition of tree species in the experimental area.

基于UNet-ResNet14*半监督学习的无人机影像森林树种分类

Classification of tree species based on UNet-ResNet14* semi-supervised learning using UAV images