基于相同稀疏模式的稀疏主成分分析算法
作者:
作者单位:

1.昆明理工大学信息工程与自动化学院,昆明 650500;2.云南电网有限责任公司红河供电局,红河 661100

作者简介:

通讯作者:

基金项目:

国家自然科学基金(61763020)。


Sparse Principal Component Analysis Algorithm Based on Same Sparse Pattern
Author:
Affiliation:

1.Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China;2.Honghe Power Supply Bureau, Yunnan Power Grid Co., Ltd., Honghe 661100,China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    稀疏主成分分析是一种用于降维和特征选择的无监督方法。由于计算多个主成分时主载荷向量间不具有相同的稀疏模式,导致难以从原始特征空间中确定出对主成分贡献最大的小部分变量,为解决此问题,提出一种自适应稀疏主成分分析(Adaptive sparse principal component analysis, ASPCA)算法。首先使用组套索模型,通过在载荷向量上施加块稀疏约束得出自适应稀疏主成分分析公式,随后对稀疏矩阵的不同列使用不同的调整参数获得自适应惩罚,最后运用块坐标下降法对自适应稀疏主成分分析公式进行两阶段优化,从而找到稀疏载荷矩阵和正交矩阵,实现降维的最优化。对稀疏主成分分析(Sparse principal component analysis, SPCA)算法、结构化且稀疏的主成分分析(Structured and sparse principal component analysis, SSPCA)算法和ASPCA算法进行仿真比较,结果表明ASPCA算法的降维性能更优,能提取更有价值的特征,从而显著提高了分类模型的平均分类准确率。

    Abstract:

    Sparse principal component analysis is an unsupervised method for dimensionality reduction and feature selection. An adaptive sparse principal component analysis (ASPCA) algorithm is proposed, because the principal load vectors do not have the same sparse pattern when calculating multiple principal components, and it is difficult to determine a small number of the variables that contribute the most to the principal components from the original feature space. Firstly, the group lasso model is used, and the ASPCA formula is obtained by applying block sparse constraints on the load vector. Subsequently, different adjustment parameters are used for different columns of the sparse matrix to obtain adaptive penalty. Finally, the block-coordinate descent method is used to optimize the adaptive sparse principal component analysis formula in two stages, so as to find the sparse load matrix and the orthogonal matrix and achieve the optimization of dimensionality reduction. The comparison results of the sparse principal component analysis (SPCA) algorithm, the structured and sparse principal component analysis (SSPCA) algorithm and the ASPCA algorithm show that the ASPCA algorithm has better dimensionality reduction performance and can extract more valuable features, thereby effectively improving the average classification accuracy of the classification model.

    参考文献
    相似文献
    引证文献
引用本文

邵剑飞,浦蓉,黄伟,季建杰,郭鹏.基于相同稀疏模式的稀疏主成分分析算法[J].数据采集与处理,2022,37(5):1084-1091

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:2020-09-07
  • 最后修改日期:2020-12-10
  • 录用日期:
  • 在线发布日期: 2022-09-25