Multi-label Feature Selection Based on Label Complementarity
Author:
Affiliation:
School of Software, East China Jiaotong University, Nanchang 330013, China
Fund Project:
摘要
|
图/表
|
访问统计
|
参考文献
|
相似文献
|
引证文献
|
资源附件
摘要:
已有的多标记特征选择方法主要根据特征与标记之间的依赖度以及特征与特征之间的冗余度确定每个特征的重要度,然后根据重要度进行特征选择,常常忽略标记关系对特征选择的影响。针对上述问题,引入邻域互信息设计了基于标记补充的多标记特征选择算法(Multi-label feature selection algorithm based on label complementarity,MLLC),该算法将依赖度、冗余度以及标记关系作为特征重要度的评价要素,然后基于这3个要素重新设计特征重要度评估函数,使得选取的特征能够获得更佳的分类性能。最后,在6个多标记数据集上验证了MLLC算法的有效性和鲁棒性。
Abstract:
Multi-label feature selection is an important research component in the field of multi-label learning. Existing multi-label feature selection methods mainly measure the importance of each feature based on the dependency between features and labels, and the redundancy among features. Then, feature ranking is performed based on feature importance, often ignoring the influence of label relationships on feature importance. To solve this problem, a multi-label feature selection algorithm based on label complementarity(MLLC) is designed, which introduces neighbourhood mutual information. The algorithm takes dependency, redundancy and label relationships as the evaluation elements of feature importance. And then it redesigns the feature importance evaluation function based on these three elements, so as to select features with stronger discriminative power and achieve better classification performance. Finally, the effectiveness and robustness of the algorithm are verified on six classical multi-label datasets.