Lightweight Speech Enhancement Based on Gated Hybrid Dilated Convolution
CSTR:
Author:
Affiliation:

School of Communications and Information Engineering,Nanjing University of Posts and Telecommunications, Nanjing 210003, China

Clc Number:

TN912.3

Fund Project:

Jiangsu Provincial Major Science and Technology Project (No.BG2024027);National Natural Science Foundation of China (No.61901227).

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    To address the issues of parameter inflation and soaring computational complexity in mainstream speech enhancement models, a lightweight speech enhancement network based on gated hybrid dilated convolution is proposed in this paper. Firstly, a gated hybrid dilated convolution module is designed, which integrates gated linear units with hybrid dilated convolution to achieve multiscale feature extraction of speech signals and precise suppression of noise-sensitive regions, thereby effectively preserving both long-term and short-term speech characteristics while enhancing model robustness. Secondly, a hierarchical channel attention module is proposed to enhance the capture of speech feature correlations in channel dimensions through hierarchical feature fusion, while maintaining low parameter complexity. Experimental results on the VoiceBank+DEMAND dataset demonstrate that the proposed model, with only 0.41 million parameters, achieves competitive performance on the perceptual evaluation of speech quality (PESQ), the short-time objective intelligibility (STOI), cepstral signal-to-noise ratio (CSIG), cepstral background noise(CBAK) and cepstral overall loudness (COVL), thus achieving an organic integration of model lightweighting and high-precision performance.Highlights:1. Propose a lightweight speech enhancement network with gated hybrid dilated convolution.2. Integrate multiscale feature extraction, channel attention, and Ghost convolution for efficient feature modeling.3. Achieve a good balance between enhancement performance and model complexity on VoiceBank+DEMAND.

    Reference
    Related
    Cited by
Get Citation

SUN Linhui, WEI Pengbin, WANG Chunyan, YE Lei, SHAO Xi. Lightweight Speech Enhancement Based on Gated Hybrid Dilated Convolution[J]. Journal of Data Acquisition and Processing,2026,(3):814-824.

Copy
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:May 21,2025
  • Revised:August 30,2025
  • Adopted:
  • Online: June 10,2026
  • Published:
Article QR Code