Multi-channel Linear Prediction for Speech Dereverberation Using Cross-Band Filters and Sparse Priors

doi:10.16337/j.1004-9037.2024.05.007

Home > Archive>Volume 39, Issue 5, 2024 >1135-1146. DOI:10.16337/j.1004-9037.2024.05.007

Multi-channel Linear Prediction for Speech Dereverberation Using Cross-Band Filters and Sparse Priors
DOI:
                        10.16337/j.1004-9037.2024.05.007
                    
CSTR:
                        
Author:
                        
Affiliation:1.Digitalization Department, Open University of China, Beijing 100039, China;2.Center for Machine Vision and Signal Analysis, University of Oulu, Oulu 90570, Finland;3.Key Laboratory of Noise and Vibration Research, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, China
Clc Number:TN912.3
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The multi-channel linear prediction （MCLP） is one of the most popular speech dereverberation methods. The band-to-band spectral subtraction model has been adopted by most existing studies to obtain the desired speech signal in each frequency band， but it neglects the interaction between different frequencies. This paper proposes a MCLP-based speech dereverberation method using the cross-band spectral subtraction model instead of the widely adopted band-to-band spectral subtraction model. The proposed model employs cross-band filters to account for the interactions between different frequencies. We model the desired signal using the complex generalized Gaussian （CGG） distribution. Compared with the Gaussian distribution， the CGG distribution can capture the sparse nature of speech signals using a suitable shape parameter. Within the maximum likelihood estimation framework， the speech dereverberation problem is formulated as an optimization problem involving the band-to-band and cross-band filters. An optimization algorithm with guaranteed convergence is derived based on the majorization-minimization method. A series of speech dereverberation experiments under various reverberation times， different channel numbers and different source-to-microphone distances demonstrate that the proposed method significantly outperforms traditional methods in terms of dereverberation performance.

Reference

Cited by

Get Citation

KANG Yao, KANG Fang, YANG Feiran. Multi-channel Linear Prediction for Speech Dereverberation Using Cross-Band Filters and Sparse Priors[J]. Journal of Data Acquisition and Processing,2024,39(5):1135-1146.

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:June 19,2024
Revised:August 23,2024
Adopted:
Online: October 14,2024
Published:

For Authors

Get Citation

Related Videos

Share

Article Metrics

History

Article QR Code