中国农业科学 ›› 2025, Vol. 58 ›› Issue (20): 4054-4069.doi: 10.3864/j.issn.0578-1752.2025.20.003

• 盐碱地智慧监测 • 上一篇    下一篇

基于土壤高光谱特征波段优选的盐分预测模型构建

李明丽1,2(), 温彩运1, 马东豪3, 李存军4, 王宇文1, 康璐1, 陆苗1,2()   

  1. 1 北方干旱半干旱耕地高效利用全国重点实验室(中国农业科学院农业资源与农业区划研究所),北京 100081
    2 国家盐碱地综合利用技术创新中心,山东东营 257347
    3 中国科学院南京土壤研究所/土壤与农业可持续发展重点实验室,南京 211135
    4 北京市农林科学院信息技术研究中心,北京 100097
  • 收稿日期:2025-07-23 接受日期:2025-09-24 出版日期:2025-10-16 发布日期:2025-10-14
  • 通信作者:
    陆苗,E-mail:
  • 联系方式: 李明丽,E-mail:2236072794@qq.com。
  • 基金资助:
    本文为退化耕地监测研究成果; 中国农业科学院重大科技任务(CAAS-ZDRW202407)

Construction of Salinity Prediction Model Based on Optimal Selection of Soil Hyperspectral Characteristic Bands

LI MingLi1,2(), WEN CaiYun1, MA DongHao3, LI CunJun4, WANG YuWen1, KANG Lu1, LU Miao1,2()   

  1. 1 State Key Laboratory of Efficient Utilization of Arid and Semi-Arid Arable Land in Northern China (Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences), Beijing 100081
    2 National Center of Technology Innovation for Comprehensive Utilization of Saline-Alkali Land, Dongying 257347, Shandong
    3 Institute of Soil Science, Chinese Academy of Sciences/Key Laboratory of Soil and Sustainable Agriculture, Nanjing 211135
    4 Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097
  • Received:2025-07-23 Accepted:2025-09-24 Published:2025-10-16 Online:2025-10-14

摘要:

【目的】土壤盐渍化导致土壤结构恶化、作物减产及生态系统退化,是威胁干旱区农业可持续发展的关键环境问题。本研究旨在利用光谱变换、波段筛选以及多种机器学习方法,构建土壤盐分预测模型,快速、准确地预测土壤盐分,为盐渍化农田的科学管理提供技术支撑。【方法】以达拉特旗农田土壤为研究对象,系统采集土壤样品并测定其电导率(EC)与光谱反射率数据。首先,采用Savitzky-Golay(S-G)滤波对原始光谱(R)进行平滑去噪,在此基础上系统开展包括倒数、对数、一阶微分、二阶微分等在内的12种光谱变换处理,以挖掘隐含光谱特征。进而,分别采用相关性分析(CA)和最小角回归(LAR)方法进行特征降维,并结合竞争性自适应重加权采样(CARS)算法进一步筛选敏感特征波段。最后,基于优选特征分别构建偏最小二乘回归(PLSR)、支持向量机(SVM)、反向传播神经网络(BPNN)和随机森林(RF)模型,通过决定系数(R2)和均方根误差(RMSE)综合评价模型性能,对比特征集在不同算法中的建模效果。【结果】原始光谱经过光谱变换后,相关系数均有不同程度的提升,表明光谱变换能显著增强土壤盐分与光谱特征的相关性;在使用竞争性自适应重加权采样进行特征波段优选时,最小角回归比相关性分析具有更好的特征降维效果;倒数对数一阶微分(ATFD)结合偏最小二乘回归模型表现最优,其验证集精度为R2=0.81、RMSE=2.04 dS·m-1;不同建模方法对比显示,偏最小二乘回归模型的预测性能优于其他3种模型(反向传播神经网络/随机森林/支持向量机),表明偏最小二乘回归模型更适合该区域土壤盐分的预测。【结论】基于ATFD-LAR-CARS-PLSR的土壤盐分高光谱预测模型精度高、预测能力最优,证实了高光谱技术结合多维度特征优化可有效实现干旱区土壤盐分预测。

关键词: 土壤盐分, 光谱变换, 特征波段选择, 相关性分析, 最小角回归, 竞争性自适应重加权采样

Abstract:

【Objective】Soil salinization is a key environmental problem threatening the sustainable development of agriculture in arid areas, leading to the deterioration of soil structure, crop yield reduction and ecosystem degradation. The purpose of this study is to use spectral transformation, band selection and a variety of machine learning methods to build a soil salinity prediction model, which can quickly and accurately estimate soil salinity, and provide technical support for the scientific management of salinized farmland.【Method】Taking farmland soil in Dalate Banner as the research object, soil samples were collected systematically and their electrical conductivity (EC) and spectral reflectance data were measured. Firstly, Savitzky-Golay (S-G) filter was used to smooth the original spectrum (R). On this basis, 12 kinds of spectral transformation processing including reciprocal, logarithmic, first-order differential and second-order differential were carried out to mine the hidden spectral features. Then, the correlation analysis (CA) and least angle regression (LAR) methods were used to reduce the feature dimension, and the competitive adaptive reweighted sampling (CARS) algorithm was combined to further screen the sensitive feature bands. Finally, partial least squares regression (PLSR), support vector machine (SVM), back propagation neural network (BPNN) and random forest (RF) models were constructed based on the optimal features. The performance of the model was comprehensively evaluated by determination coefficient (R2) and root mean square error (RMSE), and the modeling effects of feature sets in different algorithms were compared.【Result】After spectral transformation, the correlation coefficients of the original spectrum were improved in varying degrees, indicating that spectral transformation could significantly enhance the correlation between soil salinity and spectral characteristics; When CARS was used for feature band optimization, LAR had better feature dimension reduction effect than CA; The reciprocal logarithmic first-order differential (ATFD) combined with PLSR model performed best, and its validation set accuracy was R2=0.81, RMSE=2.04 dS·m-1; The comparison of different modeling methods showed that the performance of PLSR prediction model was better than the other three models (BPNN/RF/SVM), indicating that PLSR model was more suitable for the prediction of soil salinity in this region.【Conclusion】The hyperspectral prediction model of soil salinity based on ATFD-LAR-CARS-PLSR has high accuracy and optimal prediction ability, which proves that hyperspectral technology combined with multi-dimensional feature optimization can effectively realize the prediction of soil salinity in arid areas.

Key words: soil salinity, spectral transformation, characteristic band selection, correlation analysis (CA), least angle regression (LAR), competitive adaptive reweighted sampling (CARS)