Zero Resource Korean ASR Based on Acoustic Model Sharing
CSTR:
Author:
Affiliation:

1.Beijing National Research Center for Information Science and Technology, Department of Electronic Engineering, Tsinghua University, Beijing 100084, China;2.Beijing Haitian Ruisheng Science Technology Ltd., Beijing 100083, China

Clc Number:

TN912

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    A precise speech recognition system usually is based on a large amount of training data with handcrafted transcription, which sets a barrier to the recognition of many low-resource languages. Acoustic model sharing, which is based on the similarity of certain rich and low resource language pair, provides a new method to solve the problem and helps to build an automatic speech recognition (ASR) system without any training data of the given low resource language. This paper expands the method to Korean speech recognition. Specifically, we train an acoustic model on Mandarin data, and lay down a set of mapping rules between Mandarin and Korean phonemes. A character error rate (CER) of 27.33% is achieved on Zeroth Korean test set without using any Korean speech data. Moreover, we also test the difference between source-to-target and target-to-source phoneme mapping rules, and prove that the latter is more appropriate for acoustic model sharing.

    Reference
    Related
    Cited by
Get Citation

Wang Haoyu, Jeon Eunah, Zhang Weiqiang, Li Ke, Huang Yukai. Zero Resource Korean ASR Based on Acoustic Model Sharing[J].,2023,38(1):93-100.

Copy
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:October 19,2021
  • Revised:November 04,2021
  • Adopted:
  • Online: January 25,2023
  • Published:
Article QR Code