周亚同, 赵翔宇, 何峰, 石超君. 基于高斯过程混合模型的大气温湿度预测[J]. 农业工程学报, 2018, 34(5): 219-226. DOI: 10.11975/j.issn.1002-6819.2018.05.029
    引用本文: 周亚同, 赵翔宇, 何峰, 石超君. 基于高斯过程混合模型的大气温湿度预测[J]. 农业工程学报, 2018, 34(5): 219-226. DOI: 10.11975/j.issn.1002-6819.2018.05.029
    Zhou Yatong, Zhao Xiangyu, He Feng, Shi Chaojun. Atmospheric temperature and humidity prediction of Gaussian process mixed model[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2018, 34(5): 219-226. DOI: 10.11975/j.issn.1002-6819.2018.05.029
    Citation: Zhou Yatong, Zhao Xiangyu, He Feng, Shi Chaojun. Atmospheric temperature and humidity prediction of Gaussian process mixed model[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2018, 34(5): 219-226. DOI: 10.11975/j.issn.1002-6819.2018.05.029

    基于高斯过程混合模型的大气温湿度预测

    Atmospheric temperature and humidity prediction of Gaussian process mixed model

    • 摘要: 温湿度预测在国民经济各领域有重要作用,实现温湿度精准预测可有效提高农业生产及保障行人安全,室内温湿度预测有助于植物健康生长,减少经济损失;室外温湿度预测对行人安全及航空等科研起保障作用。针对现有温湿度预测效果不佳且不能实现多模态预测,该文采用高斯过程混合(gaussian process mixture, GPM)模型进行大气温湿度多模态预测。另外为了提升模型学习效率,给GPM模型提出了的一种隐变量后验硬划分迭代学习算法。该算法采用一种新的近似策略,利用最大后验估计不断矫正样本划分,借助迭代学习实现样本最优分组。在用自相关函数和最大Lyapunov指数等解析评价温湿度序列基础上,将GPM模型与核回归(kernel-regression, K-R)、最小最大概率机回归(minimax probability machine regression, MPMR)、线性回归(linear- regression, L-R)、高斯过程(gaussian process, GP)等传统预测模型进行比较。结果表明GPM不仅能够实现多模态预测,而且预测准确率要明显优于其它传统模型。最终湿度预测最优结果=0.062 0、=0.936 2,训练耗时为113.417 5 s;温度预测最优结果=0.042 6、=0.966 6,训练耗时为90.004 9 s。由于GPM为无环境因子输入模型,因此该文的研究不仅对大气温湿度预测有促进作用,同时对室内及固体表面温湿度预测具有一定借鉴价值。

       

      Abstract: Abstract: Temperature and humidity prediction plays an important role in various fields of the national economy. In view of the poor predictions of temperature and humidity and the failure to achieve multimodal prediction, this paper uses the Gaussian mixture process (GPM) model to predict the multimodal temperature and humidity. GPM model was developed from the Gaussian process (GP) model. It was proposed by Tresp in 2000. The main idea of GPM is to fit the samples in various input regions with different GP models in order to fit different patterns in these regions. However, the existing GPM models are rather complicated and thus the learning algorithms involve various approximation schemes, which can lead to much computation or rough prediction. So, in order to improve the model learning efficiency, a variable-hidden posterior hard-cut iterative training algorithm is proposed for the GPM model. This algorithm improves the EM (expectation maximization) learning with a new approximation strategy, which achieves the optimal grouping of samples through iterative learning and using the maximum a posteriori estimation. Prior to the experiment, the autocorrelation function and partial autocorrelation function were used to verify the nonlinear and non-stationary property of humidity and temperature sequences. Largest Lyapunov, saturation correlation dimension and recursive graph were used to verify the chaotic characteristic of humidity and temperature sequences. Finally, the GPM proposed in this paper is compared with the kernel regression (K-R), the minimax probability machine regression (MPMR), the linear regression (L-R), and GP models. K-R is a kind of prediction model based on kernel function. By adjusting the optimal window width, K-R gradually gets the corresponding prediction result. K-R has been the basis of predictive model because it can predict the linear and non-linear time series simultaneously. MPMR does not need to make specific assumptions about the model distribution. It only needs to know the mean and covariance matrix of the data distribution. This model improves the neural network easy to fall into the minimum and over-fitting. L-R is a very traditional model, using the quantitative relationship between variables to obtain a linear expression and then predict the test sample. The results show that not only GPM can perform multimodal prediction, but also the accuracy of the model is significantly better than other traditional models. In the humidity sequence prediction, the minimum of RMSE can reach 0.0620 and the maximum of R2 can reach 0.9362. In the temperature sequence prediction, the minimum of RMSE can reach 0.0426 and the maximum of R2 can reach 0.9666. In the prediction time consumed, GPM is a little more than the other 4 traditional models. The main reason is that GPM requires iterative learning to achieve the optimal grouping of samples. Finally, humidity prediction needs 113.4175 s and temperature prediction needs 90.0049 s. The research in this paper not only can promote atmospheric temperature and humidity forecasting, but also has some reference value for indoor and solid surface temperature and humidity forecasting because GPM is the prediction model without environment factors input.

       

    /

    返回文章
    返回