Multi-channel Linear Prediction for Speech Dereverberation Using Cross-Band Filters and Sparse Priors
CSTR:
Author:
Affiliation:

1.Digitalization Department, Open University of China, Beijing 100039, China;2.Center for Machine Vision and Signal Analysis, University of Oulu, Oulu 90570, Finland;3.Key Laboratory of Noise and Vibration Research, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, China

Clc Number:

TN912.3

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    The multi-channel linear prediction (MCLP) is one of the most popular speech dereverberation methods. The band-to-band spectral subtraction model has been adopted by most existing studies to obtain the desired speech signal in each frequency band, but it neglects the interaction between different frequencies. This paper proposes a MCLP-based speech dereverberation method using the cross-band spectral subtraction model instead of the widely adopted band-to-band spectral subtraction model. The proposed model employs cross-band filters to account for the interactions between different frequencies. We model the desired signal using the complex generalized Gaussian (CGG) distribution. Compared with the Gaussian distribution, the CGG distribution can capture the sparse nature of speech signals using a suitable shape parameter. Within the maximum likelihood estimation framework, the speech dereverberation problem is formulated as an optimization problem involving the band-to-band and cross-band filters. An optimization algorithm with guaranteed convergence is derived based on the majorization-minimization method. A series of speech dereverberation experiments under various reverberation times, different channel numbers and different source-to-microphone distances demonstrate that the proposed method significantly outperforms traditional methods in terms of dereverberation performance.

    Reference
    Related
    Cited by
Get Citation

KANG Yao, KANG Fang, YANG Feiran. Multi-channel Linear Prediction for Speech Dereverberation Using Cross-Band Filters and Sparse Priors[J].,2024,39(5):1135-1146.

Copy
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:June 19,2024
  • Revised:August 23,2024
  • Adopted:
  • Online: October 14,2024
  • Published:
Article QR Code