Language Identification Method for Multi-task Learning Based on Contrastive Predictive Coding Model
CSTR:
Author:
Affiliation:

1.School of Mathematics and Big Data, Guizhou Education University, Guiyang 550018, China;2.Big Data Science and Intelligent Engineering Research Institute, Guizhou Education University, Guiyang 550018, China;3.School of Computer Science and Technology, Harbin Institute of Technology(Shenzhen), Shenzhen 518000,China

Clc Number:

TN912.34

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    The key of language identification is to extract useful features from speech fragments. The time-delayed neural network (TDNN) can extract feature vectors, which contain rich context and improve system performance effectively. This paper proposes a multi-task learning method of ECAPA(Emphasized channel attention)-TDNN+contrastive predictive coding(CPC) network for language identification. ECAPA-TDNN is the main network to extract the global features of language. The improved CPC model is the auxiliary network, and the frame level features extracted by ECAPA-TDNN are compared and predicted. Finally, the joint loss function is used to optimize the network. The proposed method is tested on the 10 language data sets provided by the AP17-OLR data set.The result shows that the identification accuracy of the proposed network is higher than baseline on the 1 s, 3 s and All test data sets of AP17-OLR.

    Reference
    Related
    Cited by
Get Citation

ZHAO Jianchuan, YANG Haoquan, XU Yong, WU Lian, CUI Zhongwei. Language Identification Method for Multi-task Learning Based on Contrastive Predictive Coding Model[J].,2022,37(2):288-297.

Copy
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:January 17,2022
  • Revised:February 19,2022
  • Adopted:
  • Online: March 25,2022
  • Published:
Article QR Code