光谱学与光谱分析, 2018, 38 (12): 3883, 网络出版: 2018-12-16   

混合式随机森林的土壤钾含量高光谱反演

Random Forests-Based Hybrid Feature Selection Algorithm for Soil Potassium Content Inversion Using Hyperspectral Technology
作者单位
1 中国海洋大学环境科学与工程学院海洋环境与生态教育部重点实验室, 山东 青岛 266100
2 青岛农业大学理学与信息科学学院, 山东 青岛 266109
3 山西工程技术学院信息工程与自动化系, 山西 阳泉 045000
4 国家海洋局北海环境监测中心, 山东 青岛 266033
摘要
从土壤速效钾光谱中挖掘关键特征较为困难, 导致高光谱反演模型预测精度较低。 针对此问题, 提出了一种混合式随机森林特征选择算法。 首先采用封装式特征选择方法进行特征预选, 快速去除冗余并保留相关特征, 然后再利用改进的随机森林特征选择算法对预处理后的特征进行精选, 通过增大关键特征与冗余特征的区分度以及采用迭代特征选择的方式, 使精选后的特征具有更好的鲁棒性与区分性, 较好的解决了土壤速效钾高光谱反演模型精度较低的问题。 为了验证所提出算法的有效性, 选取了青岛市大沽河流域具有代表性的124个土壤样品为实验对象, 利用提出的算法从2 051个原始波段选出含有13个敏感波段的最优光谱子集建立土壤速效钾反演模型, 并与现有特征选择算法所建模型进行对比分析。 结果表明: 该算法构建的回归模型具有较低的预测均方根误差RMSEP(9.661 5), 较高的相关系数r(0.936 9)和预测分析相对误差RPD(2.14)。 混合式随机森林特征选择算法以较少的特征波长数实现了较好的预测效果, 可为土壤养分实时光谱传感器的设计提供一定的理论依据。
Abstract
In order to solve the problem of lower prediction performance caused by the difficulty in retrieving the key features from hyperspectral data of soil available potassium, this paper proposes a novel hybrid feature selection algorithm based on Random Forests. Firstly, wrapper-based feature selection methods were applied to rapidly remove the redundancies and preserve the related features. Secondly, an Improved-RF feature selection algorithm was applied to further accurately select the wavelength variables from the pre-selected feature sets. In this step, characteristic wavelength with strong robustness and discriminative could be selected through improving the dipartite degree between the key and redundant features and using an iterative feature selection method. Therefore, the problem of low prediction performance in the soil available potassium inversion model could be better solved by using our hybrid feature selection algorithm. In order to verify the validity of our algorithm, 124 representative soil samples collected from the Dagu River Basin were chosen. Using our algorithm, the optimal feature subset which contained 13 sensitive bands have been selected and used to build soil available potassium content inversion model. This work compared the model performance of full bands, current feature selection algorithms and our algorithm. The comparison results indicated that our algorithm not only selects minimum numbers of wavelength features and reduces the dimension of full bands, but also achieves better prediction performance with lower RMSEP (9.661 5), higher R (0.936 9) and RPD (2.14). As an effective method of soil available potassium inversion model, the algorithm proposed in this paper can provide theoretical basis for the design of real-time soil nutrient sensors.

王轩慧, 郑西来, 韩仲志, 王轩力, 王娟. 混合式随机森林的土壤钾含量高光谱反演[J]. 光谱学与光谱分析, 2018, 38(12): 3883. WANG Xuan-hui, ZHENG Xi-lai, HAN Zhong-zhi, WANG Xuan-li, WANG Juan. Random Forests-Based Hybrid Feature Selection Algorithm for Soil Potassium Content Inversion Using Hyperspectral Technology[J]. Spectroscopy and Spectral Analysis, 2018, 38(12): 3883.

本文已被 3 篇论文引用
被引统计数据来源于中国光学期刊网
引用该论文: TXT   |   EndNote

相关论文

加载中...

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!