光谱学与光谱分析, 2016, 36 (3): 691, 网络出版: 2016-12-09   

基于可见-近红外光谱变量选择的荒漠土壤全磷含量估测研究

Study on Estimation of Deserts Soil Total Phosphorus Content by Vis-NIR Spectra with Variable Selection
杨爱霞 1,2,*丁建丽 1,2李艳红 3,4邓凯 1,2
作者单位
1 新疆大学资源与环境科学学院, 新疆 乌鲁木齐 830046
2 绿洲生态教育部重点实验室, 新疆 乌鲁木齐 830046
3 新疆师范大学地理科学与旅游学院, 新疆 乌鲁木齐 830054
4 自治区重点实验室“新疆干旱区湖泊环境与资源实验室”, 新疆 乌鲁木齐 830054
摘要
以新疆艾比湖湿地保护区采集的300个荒漠土壤样品为研究对象, 利用ASD Field Spec○R 3 HR光谱仪获取的土壤可见-近红外光谱数据以及化学分析获取的土壤全磷数据为数据源, 将原始光谱数据经过卷积平滑、 标准正态变量变换以及一阶微分预处理后, 采用蚁群-遗传结合区间偏最小二乘法提取荒漠土壤全磷含量特征波长, 构建土壤全磷含量偏最小二乘回归预测模型; 并与全谱偏最小二乘、 蚁群-区间偏最小二乘、 遗传-偏最小二乘模型进行比较。 结果表明: 经蚁群-区间偏最小二乘法筛选后, 荒漠土壤全磷特征波段为500~700, 1 101~1 300, 1 501~1 700, 1 901~2 100 nm; 进一步采用遗传-区间偏最小二乘法进行变量选择, 得到共线性最小的13个有效波长, 分别为: 1 621, 546, 1 259, 573, 1 572, 1 527, 564, 1 186, 1 988, 1 541, 2 024, 1 118和1 191 nm。 建模方法比较显示, 采用蚁群-遗传结合区间偏最小二乘法选择的特征变量, 建立的模型精度最高, 其次是遗传算法、 蚁群算法和全光谱。 蚁群-遗传结合区间偏最小二乘法建立的土壤全磷含量的模型, 效验证均方根误差RMSECV以及预测集均方根误差RMSEP分别为0.122和0.108 mg·g-1, 效验证相关系数Rc以及预测集的相关系数Rp分别为0.535 7, 0.555 9。 因此, 经过卷积平滑、 标准正态变量变换以及一阶微分预处理, 并利用蚁群-遗传结合区间偏最小二乘法建立的模型不仅简单, 而且具有较高的预测精度和较好的稳健性, 可以估算荒漠土壤全磷含量。
Abstract
In this paper, 300 samples of desert soil collected in the Ebinur Lake Wetland Nature Reserve of Xinjiang were used as the research subject, and the visible/near-infrared spectra data about the soil obtained with the ASD Field Spec○R 3 HR spectrometer and the data about total phosphorus in the soil obtained through chemical analysis were used as the data sources; following Savizky-Golay smoothing, standard normal variation transformation and the first-order differential pretreatment, the combination of ant colony optimization interval partial least squares (ACO-iPLS) and genetic algorithm interval partial least squares (GA-iPLS) were employed to extract the characteristic wavelengths of the total phosphorus content in desert soil, before the partial least squares regression model for predicting the total-phosphorus content in soil was constructed; and this model was compared with the full-spectrum partial least squares model, ACO-iPLS and GA-iPLS. According to the results: through filtering with ACO-iPLS, the total-phosphorus characteristic wavebands in the desert soil were 500~700, 1 101~1 300, 1 501~1 700, and 1 901~2 100 nm; through further variable selection with GA-iPLS, 13 effective wavelengths with the minimum colinearity were selected, which were respectively: 1 621, 546, 1 259, 573, 1 572, 1 527, 564, 1 186, 1 988, 1 541, 2 024, 1 118, and 1 191 nm. According to the comparison of modeling methods, the most accurate model was the one based on the characteristic variables selected with the combination of ACO-iPLS and GA-iPLS, followed by the ones with genetic algorithm, ant colony optimization algorithm and the full spectrum method. For the total phosphorus content in soil model established with the combination of ACO-iPLS and GA-iPLS, the root mean square error of cross validation (RMSECV) and the root mean square error of prediction (RMSEP) were respectively 0.122 and 0.108 mg·g-1, and the related coefficient for cross validation (Rc) and the related coefficient for prediction (Rp) were 0.535 7 and 0.555 9, respectively. Therefore, it can be seen that the model constructed through Savizky-Golay smoothing, standard normal variation transformation and the first-order differential pretreatment and by using the combination of ACO-iPLS and GA-iPLS has simple structure, high prediction accuracy and good robustness, and can be used for estimating the total phosphorus content in desert soil.

杨爱霞, 丁建丽, 李艳红, 邓凯. 基于可见-近红外光谱变量选择的荒漠土壤全磷含量估测研究[J]. 光谱学与光谱分析, 2016, 36(3): 691. YANG Ai-xia, DING Jian-li, LI Yan-hong, DENG Kai. Study on Estimation of Deserts Soil Total Phosphorus Content by Vis-NIR Spectra with Variable Selection[J]. Spectroscopy and Spectral Analysis, 2016, 36(3): 691.

本文已被 5 篇论文引用
被引统计数据来源于中国光学期刊网
引用该论文: TXT   |   EndNote

相关论文

加载中...

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!