俞焘杰,陈建能,彭伟杰,等. 基于Tea DCGAN网络和Fake Tea框架的茶鲜叶数据增强方法[J]. 农业工程学报,2024,40(22):1-10. DOI: 10.11975/j.issn.1002-6819.202405076
    引用本文: 俞焘杰,陈建能,彭伟杰,等. 基于Tea DCGAN网络和Fake Tea框架的茶鲜叶数据增强方法[J]. 农业工程学报,2024,40(22):1-10. DOI: 10.11975/j.issn.1002-6819.202405076
    YU Taojie, CHEN Jianneng, PENG Weijie, et al. Tea data enhancement method based on Tea DCGAN network and Fake Tea pipeline[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2024, 40(22): 1-10. DOI: 10.11975/j.issn.1002-6819.202405076
    Citation: YU Taojie, CHEN Jianneng, PENG Weijie, et al. Tea data enhancement method based on Tea DCGAN network and Fake Tea pipeline[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2024, 40(22): 1-10. DOI: 10.11975/j.issn.1002-6819.202405076

    基于Tea DCGAN网络和Fake Tea框架的茶鲜叶数据增强方法

    Tea data enhancement method based on Tea DCGAN network and Fake Tea pipeline

    • 摘要: 当茶叶图片的原始数据数量不足时,深度学习模型泛化性不足导致对茶叶嫩梢的检测能力大幅度下降。为解决这一问题,该研究提出一种Tea DCGAN(tea deep convolution generative adversarial networks)的对抗生成网络及其数据增强方法。首先,在DCGAN(deep convolution generative adversarial networks)网络的生成器和判别器中分别添加了64×64×64的网络层来优化模型对低维度特征感知与学习能力。同时,DCGAN中的LeakyReLU(leaky rectified linear unit)函数被更加线性可控的ELU(exponential linear units)函数替换,提升模型训练稳定性与训练精度。其次,基于Tea DCGAN网络构建Fake Tea数据增强算法框架,对已有数据集的真实茶叶嫩梢分布进行数据分析,得到分布规律。根据分布规律将Tea DCGAN网络生成的样本图像分布进真实茶树图像中,并自动形成深度学习数据集。最后,对该研究提出的数据增强方法进行对抗生成网络消融试验、罕见茶种对照试验以及不同量级下的多种数据增强方法对比试验。消融试验结果显示,Tea DCGAN在FID(frechet inception distance )指标上表现最优,特别是在100000训练轮次时,紫鹃茶种的FID值从322.10降至265.63,龙井43茶种的FID值从396.38降至323.09,显著提升了生成图像的质量。在多个检测模型的多种数据增强方法试验中,该研究Fake Tea方法在不同检测模型中均优于其他方法。其中,Faster R-CNN模型在25张龙井43和25张紫鹃茶种形成的数据集上mAP分别达到42.71%和38.46%。随着数据集规模的增加,所有方法的性能均有所提升,但Fake Tea方法在所有规模的数据集上均保持最高mAP值,尤其是在原始数据为200张时,mAP值达到89.41%,可用于智能化茶叶采摘。该研究结果表明Tea DCGAN和Fake Tea数据增强方法在茶叶图像生成和目标检测任务中的有效性和优越性。该研究提出的Tea DCGAN和Fake Tea数据增强方法可有效缓解数据获取困难、样本不足等问题,有效提升小样本下的茶叶嫩梢目标检测精度。

       

      Abstract: In the field of deep learning, the generalization ability of models is often significantly compromised when the original data of tea leaves is insufficient, leading to a substantial decline in the detection capability for tender tea shoots. To address this issue, this study proposes a Tea DCGAN (tea deep convolution generative adversarial networks) and its corresponding data augmentation method. Initially, a 64×64×64 layer was added to both the generator and discriminator of the DCGAN (deep convolution generative adversarial networks) to enhance the model's perception and learning ability for low-dimensional features. Additionally, the LeakyReLU (leaky rectified linear unit) function in the DCGAN was replaced with the more linearly controllable ELU (exponential linear units) function, thereby improving the stability and accuracy of model training. Subsequently, a Fake Tea data augmentation algorithm framework was constructed based on the Tea DCGAN network. This framework analyzes the distribution of real tender tea shoots in the existing dataset to understand the underlying patterns. According to these patterns, the sample images generated by the Tea DCGAN network are distributed into real tea tree images, automatically forming a deep learning dataset. Finally, the proposed data augmentation method underwent several experiments, including adversarial generative network ablation tests, rare tea variety control tests, and comparisons of various data augmentation methods at different scales. The ablation test results indicated that Tea DCGAN performed optimally in terms of the FID (Frechet Inception Distance) metric, especially after 100,000 training epochs, where the FID value for the Zijuan tea variety dropped from 322.10 to 265.63, and for the Longjing 43 tea variety, it decreased from 396.38 to 323.09, significantly enhancing the quality of the generated images.In various detection model experiments with multiple data augmentation methods, the Fake Tea method outperformed other approaches across different detection models. Specifically, the Faster R-CNN model achieved an mAP of 42.71% and 38.46% on datasets comprising 25 Longjing 43 and 25 Zijuan tea varieties, respectively. As the dataset size increased, the performance of all methods improved, but the Fake Tea method consistently maintained the highest mAP value across all dataset sizes. Notably, when the original dataset consisted of 200 images, the mAP value reached 89.41%, making it suitable for intelligent tea harvesting.The findings of this study demonstrate the effectiveness and superiority of Tea DCGAN and the Fake Tea data augmentation method in tea leaf image generation and object detection tasks. The proposed Tea DCGAN and Fake Tea data augmentation methods can effectively alleviate difficulties in data acquisition and the scarcity of samples, significantly enhancing the accuracy of tender tea shoot detection in scenarios with limited samples.

       

    /

    返回文章
    返回