中国农业科学 ›› 2021, Vol. 54 ›› Issue (18): 3860-3870.doi: 10.3864/j.issn.0578-1752.2021.18.006

• 植物保护 • 上一篇    下一篇

基于增强回归树的海河平原小麦赤霉病预测模型构建与验证

陶晡1(),齐永志1(),屈赟2,曹志艳1,赵绪生1,甄文超3()   

  1. 1河北农业大学植物保护学院,河北保定 071001
    2河北农业大学现代教育技术中心,河北保定 071001
    3河北农业大学农学院/华北作物改良与调控国家重点实验室/河北省作物生长调控重点实验室,河北保定 071001
  • 收稿日期:2021-02-01 接受日期:2021-03-02 出版日期:2021-09-16 发布日期:2021-09-26
  • 联系方式: 陶晡,Tel:0312-7526131;E-mail: taobu@hebau.edu.cn。|齐永志,E-mail: qiyongzhi1981@163.com。
  • 基金资助:
    国家重点研发计划(2017YFD0300906);国家重点研发计划(2018YFD0300502);河北省现代农业产业技术体系(HBCT2018010205)

Construction and Verification of Fusarium Head Blight Prediction Model in Haihe Plain Based on Boosted Regression Tree

TAO Bu1(),QI YongZhi1(),QU Yun2,CAO ZhiYan1,ZHAO XuSheng1,ZHEN WenChao3()   

  1. 1College of Plant Protection, Hebei Agricultural University, Baoding 071001, Hebei
    2Modern Educational Technology Center, Hebei Agricultural University, Baoding 071001, Hebei
    3College of Agronomy, Hebei Agricultural University/State Key Laboratory of North China Crop Improvement and Regulation/Key Laboratory of Regulation and Control of Crop Growth of Hebei, Baoding 071001, Hebei
  • Received:2021-02-01 Accepted:2021-03-02 Published:2021-09-16 Online:2021-09-26

摘要:

【背景】自1995年至今,小麦赤霉病(Fusarium head blight,FHB)逐渐在海河平原蔓延,由零星出现演变成连片发生,在流行年份呈现出暴发快、面积大、损失重的特点,小麦赤霉病已由次要病害上升为主要病害之一。准确的预测预报是有效控制小麦赤霉病发生与发展的关键和难点。【目的】根据海河平原小麦赤霉病发生情况的监测分析,构建适宜的小麦赤霉病预测模型,为科学防控赤霉病提供技术支撑。【方法】基于2001—2016年海河平原21个小麦主产县(市)的赤霉病病穗率数据,以及小麦关键生育期内的气象数据,采用逐步回归分析,筛选影响小麦赤霉病发生的关键气象因子,构建基于多元线性回归模型和增强回归树模型的小麦赤霉病发生预测模型。【结果】明确了增强回归树模型的学习效率(lr)为0.005、树的复杂度(tc)为6时,模型的预测偏差最低,残差标准误为0.006311;筛选出8个对海河平原小麦赤霉病发生影响显著的关键气象因子,即MRH15、Rain-35、MRH-55、SD15、LT-65、MWS-55、MT-25、DRain15,并构建了含有8个预测变量的多元线性回归模型(R2=0.8158,矫正R2=0.8018,P<2.2×10-16)。同时,应用增强回归树模型评估了上述8个关键气象因子的重要性,分别为69.62%、14.08%、4.89%、4.34%、3.35%、2.02%、1.20%、0.50%;根据重要的预测变量进一步简化预测模型,构建了含有4个预测变量的多元线性回归模型(y=-19.45376+0.11689MRH15+0.17346Rain-35+0.04185SD15+0.26592MRH-55,R2=0.7575,矫正R2=0.7468,P<2.2×10-16);当预测变量由8个调减至4个时,利用2008、2010、2012年安新、定州、馆陶等地历史数据验证模型预测病穗率的准确度,多元线性回归模型预测准确度由88.43%降至85.90%,增强回归树模型预测准确度由87.72%升至91.23%;利用2001—2016年正定、栾城的历史数据验证模型预测病穗率的准确度,两个模型预测准确度无显著变化,多元线性回归模型预测准确度由87.53%变为87.42%,增强回归树模型预测准确度由89.20%变为89.21%。整体而言,多元线性回归模型预测准确度呈下降趋势,而增强回归树模型预测准确度呈上升趋势。【结论】研究构建了含有4个预测变量的增强回归树模型,其预测准确度达89.21%,病穗率预测值与实际观测值的波动趋势基本一致,表明增强回归树模型在海河平原小麦赤霉病预测预报中具有很好的应用前景。

关键词: 小麦赤霉病, 禾谷镰孢, 预测模型, 增强回归树

Abstract:

【Background】 Since 1995, Fusarium head blight (FHB) has gradually spread and risen from a secondary disease to a major disease in Haihe Plain, from sporadic occurrence to continuous occurrence, showing the characteristics of rapid outbreak, large area and heavy loss in epidemic years. To realize effective prevention and control of FHB, accurate forecasting technology is an important prerequisite for controlling the occurrence and development of FHB. 【Objective】According to the occurrence of FHB in Haihe Plain, the prediction model of FHB suitable for Haihe Plain was established to provide technical supports for scientific prevention and control of FHB.【Method】Based on the data about spike rate of FHB and meteorological factors of key growth stage of wheat in 21 counties of Haihe Plain from 2001 to 2016, the key meteorological factors which have significant influences on the FHB occurrence in Haihe Plain were screened by stepwise regression analysis, and the prediction models of FHB occurrence based on multiple linear regression model and boosted regression tree model were constructed, respectively.【Result】When the learning efficiency (lr) of the boosted regression tree model was 0.005 and the complexity (tc) of the tree was 6, the prediction deviation of the model was the lowest, and the residual standard error was 0.006311. Eight key meteorological factors, including MRH15, Rain-35, MRH-55, SD15, LT-65, MWS-55, MT-25 and DRain15, which had a significant impact on the occurrence of FHB in Haihe Plain, were screened out, and a multiple linear regression model with eight predictive variables was established (R2=0.8158, corrected R 2=0.8018, P<2.2×10 -16). Meanwhile, the importance of each key meteorological factor was evaluated by using the boosted regression tree model, with the values of 69.62%, 14.08%, 4.89%, 4.34%, 3.35%, 2.02%, 1.20% and 0.50%, respectively. According to the key predictive variables, the prediction model was further simplified, and a multiple linear regression model with four predictive variables was constructed (y=-19.45376+0.11689MRH15+0.17346Rain-35+0.04185SD15+0.26592MRH-55, R2=0.7575, corrected R 2=0.7468, P<2.2×10 -16). When the prediction variables was reduced from 8 to 4, the prediction accuracy of the multiple linear regression model decreased from 88.43% to 85.90%, but the prediction accuracy on the disease spike rate of the boosted regression tree model increased from 87.72% to 91.23%, which was verified by using the historical data of Anxin, Dingzhou and Guantao, etc in 2008, 2010 and 2012. The prediction accuracy on the disease spike rate of the multiple linear regression model and the boosted regression tree model changed from 87.53% to 87.42% and from 89.20% to 89.21%, respectively, but there was no significant difference between the multiple linear regression model and the boosted regression tree model, when they were verified with the historical data of Zhengding and Luancheng from 2001 to 2016. In a word, the prediction accuracy of multiple linear regression model showed a downward trend, while the prediction accuracy of boosted regression tree model showed an upward trend.【Conclusion】In this study, the boosted regression tree model with four predictive variables was constructed, with the prediction accuracy of 89.21%. At the same time, the disease spike rate predicted by the boosted regression tree model was basically consistent with the observed fluctuation trend, indicating that the boosted regression tree model had a good application prospect in the prediction of FHB in Haihe Plain.

Key words: Fusarium head blight (FHB), Fusarium graminearum, prediction model, boosted regression tree (BRT)