一种基于压缩感知的说话人识别参数分析
作者:
作者单位:

作者简介:

通讯作者:

基金项目:


Parameter of Speaker Recognition Based on Compressed Sensing
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    本文为在传统的说话人识别理论研究中“较少的特征参数量不能与较高的识别率共存”的难题找到了一种解决方案。本文基于压缩感知的理论,利用行阶梯观测矩阵进行信号的投影,改变了传统的梅尔频率倒谱系数(Mel-frequency cepstral coefficient, MFCC)参数,从而提出了一种新的识别参数CS-MFCC(Compressed sensing-MFCC)。该参数不仅使得参数存储量降低到少于原存储量的1/n(n为行阶梯观测矩阵的压缩比),而且明显提高了系统的鲁棒性。通过仿真 实验证明了当压缩比n为4时,平均识别率能够提高到96%以上。

    Abstract:

    A solution is proposed to deal with the problem that ″less number of features cannot coexist with higher recognition rate″ in the traditional theory of speaker recognition. Ladder observation matrix projection is used to change the traditional Mel-frequency cepstral coefficient (MFCC) parameters based on compressed sensing theory, presenting a new recognition parameters named compressed sensing MFCC (CS-MFCC) parameters. These parameters make storage capacity decrease to less than 1/n of the original, here n is the compression ratio of the line ladder matrix, and also greatly increase the robustness of the system. Furthermore simulation results prove that when n is 4, the recognition rate increases to 96% above.

    参考文献
    相似文献
    引证文献
引用本文

潘海琦 杨震 徐珑婷 朱俊华.一种基于压缩感知的说话人识别参数分析[J].数据采集与处理,2015,30(2):399-407

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2015-04-23