Journal of Integrative Agriculture ›› 2020, Vol. 19 ›› Issue (8): 2127-2136.DOI: 10.1016/S2095-3119(19)62857-1

所属专题: 农业生态环境-遥感合辑Agro-ecosystem & Environment—Romote sensing

• 论文 • 上一篇    

  

  • 收稿日期:2019-06-11 出版日期:2020-08-01 发布日期:2020-06-21

A case-based method of selecting covariates for digital soil mapping

LIANG Peng1, 2, QIN Cheng-zhi1, 2, 3, ZHU A-xing1, 2, 3, 4, 5, HOU Zhi-wei1, 2, FAN Nai-qing1, 2, WANG Yi-jie1, 2
  

  1.  
    1 State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, P.R.China
    2 University of Chinese Academy of Sciences, Beijing 100049, P.R.China
    3 Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, School of Geography, Nanjing Normal University, Nanjing 210097, P.R.China
    4 Key Laboratory of Virtual Geographic Environment, Ministry of Education, Nanjing Normal University, Nanjing 210023, P.R.China
    5 Department of Geography, University of Wisconsin-Madison, Madison, WI 53706, USA
  • Received:2019-06-11 Online:2020-08-01 Published:2020-06-21
  • Contact: Correspondence QIN Cheng-zhi, Tel: +86-10-64888959, E-mail: qincz@lreis.ac.cn
  • About author:LIANG Peng, E-mail: liangp@lreis.ac.cn;
  • Supported by:
    This work was supported by grants from the National Natural Science Foundation of China (41431177 and 41871300), the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD), China, the Innovation Project of State Key Laboratory of Resources and Environmental Information System (LREIS), China (O88RA20CYA), and the Outstanding Innovation Team in Colleges and Universities in Jiangsu Province, China.

Abstract:

Selecting a proper set of covariates is one of the most important factors that influence the accuracy of digital soil mapping (DSM).  The statistical or machine learning methods for selecting DSM covariates are not available for those situations with limited samples.  To solve the problem, this paper proposed a case-based method which could formalize the covariate selection knowledge contained in practical DSM applications.  The proposed method trained Random Forest (RF) classifiers with DSM cases extracted from the practical DSM applications and then used the trained classifiers to determine whether each one potential covariate should be used in a new DSM application.  In this study, we took topographic covariates as examples of covariates and extracted 191 DSM cases from 56 peer-reviewed journal articles to evaluate the performance of the proposed case-based method by Leave-One-Out cross validation.  Compared with a novices’ commonly-used way of selecting DSM covariates, the proposed case-based method improved more than 30% accuracy according to three quantitative evaluation indices (i.e., recall, precision, and F1-score).  The proposed method could be also applied to selecting the proper set of covariates for other similar geographical modeling domains, such as landslide susceptibility mapping, and species distribution modeling.
 

Key words: digital soil mapping ,  covariates ,  case-based reasoning ,  Random Forest