An Acoustic Echo Cancellation System for Double-Talk Scenario
Author:
Affiliation:
School of Electronics and Information Engineering, Sichuan University, Chengdu 610065, China
Fund Project:
摘要
|
图/表
|
访问统计
|
参考文献
|
相似文献
|
引证文献
|
资源附件
摘要:
声学回声消除中存在的双端讲话情况会导致回声消除器的性能发生恶化,而传统的双端话音检测及其控制算法无法有效应对双讲。针对这一问题,提出一种根据频谱信扰比(近端话音功率谱值和回声功率谱值之比)调整自适应步长的方法。为减少计算量以及降低处理时延,采用分段频域块最小均方(Partitioned frequency block least mean square, PFBLMS)算法作为自适应滤波算法,因而系统的自适应步长在频域上进行调整。首先建立频谱信扰比和相干函数之间的关系,通过计算出的相干函数值得到步长因子,最后根据步长因子实时调整每个频点的自适应步长。此外,还结合双滤波器和稀疏控制算法进一步提高系统的鲁棒性和收敛性能。仿真实验表明,该系统在双讲状态下依然能够保证良好的回声抑制能力,并能够及时跟踪回声信道的变化。相比于基于归一化互相关函数的双端话音检测算法以及开源项目Speex中的回声消除算法,系统在双讲情况下的回波抑制比(Echo return loss enhancement, ERLE)和语音质量得分(Perceptual evaluation of speech quality, PESQ)均有所提高。
Abstract:
The double-talk scenario will deteriorate the performance of echo canceller in acoustic echo cancellation, while traditional double-talk detection and other methods of controlling the adaptive step-size cannot effectively deal with it. To solve this problem, a method of adjusting the adaptive step-size according to the spectral signal-to-interference ratio (the ratio of the near-end speech’s power spectrum to the echo’s power spectrum) is proposed. In order to reduce computational complexity and processing delay, the partitioned frequency block least mean square (PFBLMS) algorithm is used as the adaptive filtering algorithm. So the adaptive step-size is adjusted in the frequency domain. First, the relationship between the spectral signal-to-interference ratio and the coherence function is established. Second, the step-size is obtained through the coherence function. Third, the adaptive step-size of each frequency point is adjusted in real time according to the calculated value. In addition, the dual filter and the sparse control algorithms are combined to further improve robustness and convergence performance of the system. The computer simulation shows that the system can not only guarantee good echo suppression ability in the double-talk scenario, but also track the changes of the echo channel in time. Compared with the double-talk detection method based on the normalized cross-correlation function and the echo cancellation algorithm in the open source project Speex, the proposed system achieves better echo return loss enhancement (ERLE) and perceptual evaluation of speech quality (PESQ).