模糊粗糙集中基于条件熵的特定类属性约简
作者:
作者单位:

1四川师范大学计算机科学学院,成都 610101;2四川师范大学数学科学学院,成都 610066

作者简介:

通讯作者:

基金项目:

教育部人文社科规划基金(23YJA630114);四川省自然科学基金(2024NSFSC0486,2026NSFSC0444);四川师范大学研究生创新能力培养项目(KY2025016)。


Class-Specific Attribute Reduction Based on Conditional Entropy in Fuzzy Rough Sets
Author:
Affiliation:

1School of Computer Science, Sichuan Normal University, Chengdu 610101, China;2School of Mathematical Science, Sichuan Normal University, Chengdu 610066, China

Fund Project:

Humanities and Social Sciences Planning Fund of Ministry of Education(No.23YJA630114);Sichuan Provincial Natural Science Foundation(Nos.2024NSFSC0486,2026NSFSC0444);Sichuan Normal University Postgraduate Innovation Ability Training Program(No.KY2025016).

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    传统的属性约简方法面向决策分类构建统一的属性约简集,忽视了不同决策类间属性的差异性表征,导致特定类的关键属性被冗余覆盖,且特定类分类准确率不高等问题。针对以上问题,考虑到模糊粗糙集在广泛存在的数值型和模糊性数据上的处理优势,本文提出了一种条件熵驱动基于模糊粗糙集的特定类属性约简方法。首先,通过融合模糊粗糙集的决策包含度与信息熵理论,定义了面向特定决策类的条件熵,量化了条件属性对目标类别的局部区分能力;其次,给出了基于条件熵的特定类属性约简条件,并分别定义了基于特定类条件熵的内部和外部属性重要度,进一步提出了基于属性重要度的特定类属性约简前向算法(Forward algorithm based on class-specific conditional entropy,FA-CE)和后向算法(Backward algorithm based on class-specific conditional entropy,BA-CE)。最后,在UCI的7个数据集和特征选择数据的2个数据集上与邻域条件熵、互信息、邻域粗糙集和一种传统依赖度约简方法进行了特定类约简比较,并在支持向量机(Support vector machine,SVM)、K最近邻(K-nearest neighbor,KNN)和分类与回归树(Classification and regression tree,CART)3种分类器上比较了特定类分类的准确率和F1-score,验证了本文提出的特定类属性约简方法的合理性与有效性。

    Abstract:

    Traditional attribute reduction methods construct a unified attribute reduction set for decision classification, ignoring the differentiated representation of attributes among various decision classes, which often results in the pivotal attributes of specific classes being redundantly covered and leads to suboptimal classification accuracy for those specific classes. To address these issues, this paper proposes a class-specific attribute reduction method driven by conditional entropy based on fuzzy rough sets, considering the advantages of fuzzy rough sets in handling widely existing numerical and fuzzy data. Firstly, by integrating the decision inclusion degree of fuzzy rough sets with information entropy theory, a class-specific conditional entropy is defined to quantify the local discriminative power of conditional attributes with respect to the target class. Secondly, the paper presents a class-specific attribute reduction condition based on conditional entropy and defines both internal and external attribute significance measures based on this class-specific conditional entropy. Furthermore, forward (FA-CE) and backward (BA-CE) attribute reduction algorithms are proposed based on attribute significance. Finally, the class-specific attribute reduction is conducted on seven UCI datasets and two feature selection datasets, and comparative analyses are performed against methods based on neighborhood conditional entropy, mutual information, neighborhood rough sets, and a conventional dependency-based reduction approach. The classification accuracy and F1-score of the proposed method are evaluated using support vector machine(SVM), K-nearest neighbor(KNN) and classification and regression tree(CART) classifiers, demonstrating the rationality and effectiveness of the proposed class-specific attribute reduction approach.

    参考文献
    相似文献
    引证文献
引用本文

闵艳玲,杨霁琳,懂梦梦,张贤勇.模糊粗糙集中基于条件熵的特定类属性约简[J].数据采集与处理,2026,(3):825-840

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:2025-06-12
  • 最后修改日期:2025-08-15
  • 录用日期:
  • 在线发布日期: 2026-06-10