基于门控混合膨胀卷积的轻量级语音增强
DOI:
作者:
作者单位:

南京邮电大学

作者简介:

通讯作者:

基金项目:

国家自然科学基金(61901227),江苏省科技重大专项(BG2024027)


Lightweight Speech Enhancement Based on Gated Hybrid Dilated Convolution
Author:
Affiliation:

Nanjing University of Posts and Telecommunications

Fund Project:

The National Natural Science Foundation of China( 61901227),Jiangsu Provincial Major Science and Technology Project (BG2024027)

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    针对主流语音增强模型存在的参数量膨胀以及计算复杂度剧增的问题,本文提出了一种基于门控混合膨胀卷积的轻量级语音增强网络。首先,设计了一种门控混合膨胀卷积模块,该模块结合门控线性单元与混合膨胀卷积,实现对语音信号的多尺度特征提取以及对噪声敏感区域的精准抑制,从而在有效保留语音长短时特征的同时,增强模型的鲁棒性;其次,设计了一种层级通道注意力模块,通过层级式特征融合,在低参数量条件下提升对通道维度中语音特征相关性的捕捉能力。在VoiceBank+DEMAND数据集上进行的实验结果表明,本文模型以仅0.41M的参数量,在PESQ、STOI、CSIG、CBAK和COVL五项指标上取得良好表现,实现了模型轻量化与优性能的有机结合。

    Abstract:

    To address the issues of parameter inflation and soaring computational complexity in mainstream speech enhancement models, a lightweight speech enhancement network based on gated hybrid dilated convolution is proposed in this paper. Firstly, a gated hybrid dilated convolution module is designed, which integrates gated linear units with hybrid dilated convolution to achieve multiscale feature extraction of speech signals and precise suppression of noise-sensitive regions, thereby effectively preserving both long-term and short-term speech characteristics while enhancing model robustness. Secondly, a hierarchical channel attention module is proposed to enhance the capture of speech feature correlations in channel dimensions through hierarchical feature fusion, while maintaining low parameter complexity. Experimental results on the VoiceBank+DEMAND dataset demonstrate that our proposed model, with only 0.41 million parameters, achieves competitive performance on the PESQ, STOI, CSIG, CBAK, and COVL metric, thus achieving an organic integration of model lightweighting and high-precision performance.

    参考文献
    相似文献
    引证文献
引用本文
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:2025-05-21
  • 最后修改日期:2025-08-30
  • 录用日期:2025-11-10
  • 在线发布日期: