光学 精密工程, 2020, 28 (12): 2719, 网络出版: 2021-01-19  

面向标签恢复的子集划分迭代投影集成

S u b set -d ivided iterative p rojection b aggin g for n oisy -label recovery
作者单位
1 华大学信息科学与技术学院, 上海 201620
2 人工智能教育部重点实验室, 上海 200240
摘要
在图像特征提取中, 样本标签并非完全真实有效, 可能导致图像归类框架的分类精度大幅下降, 而现有标签恢复算法面临含噪样本难以高效再利用的瓶颈问题。为此, 本文提出一种基于子集划分迭代投影集成的标签恢复算法。该算法首先随机多次地提取小规模子集信息, 然后综合主成分分析、邻域图正则化及 K-近邻算法等技术进行样本图像的可靠降维与迭代投影集成, 最后遵循多数投票原则实现标签复原。本文选取两大代表性的人脸数据库, 对多种标签恢复算法进行了不同指标下的大量对比分析。实验结果证明, 本文算法能够有效地校正样本的含噪标签, 在同一图像归类框架下针对 Yale B与 AR数据库分别使分类精度提升了 16. 9%与 8. 1%。相较于目前最好的标签恢复算法, 本文子集划分迭代投影集成算法可以提升 4. 3%~4. 7%的分类精度, 且在确保样本数据完整性的同时具备了一定的可扩展性。
Abstract
In image feature extraction, sample labels are rarely completely true and effective. This often leads to a significant decrease in the accuracy of an image classification framework. In addition, existing label recovery algorithms often must deal with a bottleneck problem in which noisy samples are difficult to reuse. Therefore, this paper proposes a subset-divided iterative projection bagging algorithm for noisy-label recovery. First, the proposed algorithm extracts small-scale subset information randomly and re. peatedly. It then integrates principal component analysis, neighbor graph regularization, K-nearest neigh. bor, and other techniques to achieve effective dimension reduction and iterative projection integration of sample images. Finally, class-label recovery is conducted by implementing the majority voting principle. This study uses common databases as experimental objects and conducts several comparisons and analy. ses of various recovery algorithms using different indicators. Experimental results show that the proposed algorithm effectively corrects the noisy labels of samples, and the classification accuracy of the default framework is improved by as much as 16. 9% and 8. 1% for the Yale B and AR databases, respective. ly. Compared with the state-of-the-art algorithm, the classification accuracy of the proposed algorithm is improved by 4. 3-4. 7%. The proposed algorithm also has good scalability and can ensure the integrity of sample data.

应晓清, 刘浩东, 袁文野, 杨正成. 面向标签恢复的子集划分迭代投影集成[J]. 光学 精密工程, 2020, 28(12): 2719. YING Xiao-qing, LIU hao, YUAN Wen-ye, YANG Zheng-cheng. S u b set -d ivided iterative p rojection b aggin g for n oisy -label recovery[J]. Optics and Precision Engineering, 2020, 28(12): 2719.

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!