Unsupervised Query-by-Example Spoken Term Detection Based on Acoustic Segment Models

Home > Archive>Volume 31, Issue 2, 2016 >407-414

Unsupervised Query-by-Example Spoken Term Detection Based on Acoustic Segment Models
DOI:
                        
CSTR:
                        
Author:
                        
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

A study of acoustic segment models(ASMs) for unsupervised query-by-example spoken term detection is presented. Firsty, a Gaussian mixture model(GMM) is trained without any transcription information to label speech frames with Gaussian posteriorgram. Hierarchical agglomerative clustering is used to decompose the posterior features into acoustically exhibiting segments. A label is assigned to each result segment by k-means clustering, then posteriorgram is faciltitated to train ASMs. In query matching phase, Viterbi decode is proposed to represent query and test posteriorgrams as ASM sequences. Dynamic match lattice spotting based on minimum edit distance is used to locate possible occurrences of the query term. Experimental results show that the proposed method outperforms traditional GMM and ASMs tokenizers.

Reference

Cited by

Get Citation

Li Bohao, Zhang Lianhai, Zheng Yongjun. Unsupervised Query-by-Example Spoken Term Detection Based on Acoustic Segment Models[J]. Journal of Data Acquisition and Processing,2016,31(2):407-414.

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:
Revised:
Adopted:
Online: April 09,2018
Published:

For Authors

Get Citation

Related Videos

Share

Article Metrics

History

Article QR Code