Spoken Document Classification Based on Fusion of Acoustic Features and Deep Features
CSTR:
Author:
Affiliation:

National Engineering Laboratory for Speech and Language Information Processing, University of Science and Technology of China, Heifei 230027, China

Clc Number:

TN911

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    Traditional speech document classification systems are usually completed through the transcribed text from speech recognition systems, which suffer from the recognition errors. Although the fusion of speech and recognized text can reduce the impact of recognition errors to some extent, the fusion that is made at the level of representation vector does not take full advantage of the complementarity between speech and text information. A neural network spoken document classification system based on the fusion of acoustic feature and deep feature is proposed in this paper. In the training procedure of the neural network,a trained acoustic model is first adopted to generate deep feature that contains semantic information for each document. Then acoustic feature and deep feature of each spoken document are fused frame by frame through the gating mechanism. Finally, the fused feature is used for spoken document classification. The proposed system is evaluated on a speech news broadcast corpus. The experimental result showed that the proposed system was obviously superior to the spoken document classification systems based on the fusion of speech and text, and the final accuracy reached 97.27%.

    Reference
    Related
    Cited by
Get Citation

LIU Tan, GUO Wu. Spoken Document Classification Based on Fusion of Acoustic Features and Deep Features[J].,2021,36(5):932-938.

Copy
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:January 12,2021
  • Revised:July 12,2021
  • Adopted:
  • Online: September 25,2021
  • Published:
Article QR Code