基于i-vector的电子伪装语音鲁棒还原方法研究
作者:
作者单位:

陆军工程大学指挥控制工程学院,南京,210007

作者简介:

通讯作者:

基金项目:

国家自然科学基金(61471394,62071484)资助项目;江苏省优秀青年基金(BK20180080)资助项目。


Noise Robust Restoration of Electronic Disguised Voices Based on i-vector
Author:
Affiliation:

College of Command and Control Engineering, Army Engineering University, Nanjing, 210007, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    语音的电子伪装是指采用变声设备或语音处理软件改变说话人的个性特征,以达到故意隐藏该说话人身份的目的。电子伪装语音还原是指通过技术手段将伪装语音变回原声,这对基于语音的身份鉴别有重要意义。本文将频域和时域伪装语音的还原问题抽象为伪装因子的估计问题,通过基于i-vector的自动说话人确认方法估计伪装因子,并引入对称变换进一步提高估计效果。该方法借助于i-vector的噪声鲁棒性,提高了真实含噪场景下伪装因子的估计精度,从而改进了噪声条件下电子伪装语音的还原效果。在干净语音库TIMIT上训练i-vector并在含噪语音库VoxCeleb1上对本文方法进行测试,结果表明,伪装因子估计的错误率从基线系统的9.19%降低为4.49%,还原语音在自动说话人确认等错误率和听觉感知方面也取得了提升。

    Abstract:

    Electronic voice disguise refers to hiding the identity of a speaker by voice changing equipment or voice processing software. The restoration of disguised voice refers to changing it back to its original version, which is of great significance for speaker identification. This paper first models the restoration of disguised voices as the estimation of disguising factors in both frequency and time domains. The estimation of disguising factor is made by automatic speaker verification using i-vector. Symmetric transformation is proposed to improve the performance on parameter estimation. By virtue of the noise robustness of i-vector, the proposed method improves the estimation accuracy of the disguising factor in the real noise-containing scene, thereby improving noise robust restoration effect of electronic disguised voice. Evaluation results on noisy speech library VoxCeleb1 of the trained model on clean speech library TIMIT demonstrated good performance of the approach by reducing the error rate from 9.19% to 4.49%. The quality of the restored voice is also improved in the aspects of automatic speaker verification and auditory perception.

    参考文献
    相似文献
    引证文献
引用本文

郑琳琳,张雄伟,孙蒙,李嘉康,张星昱.基于i-vector的电子伪装语音鲁棒还原方法研究[J].数据采集与处理,2020,35(5):880-891

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:2020-01-09
  • 最后修改日期:2020-05-16
  • 录用日期:
  • 在线发布日期: 2020-10-22