基于分段动态时间规整的语音样例快速检索
DOI:
作者:
作者单位:

作者简介:

通讯作者:

基金项目:


Fast Query-by-Example Spoken Term Detection Using SegmentalDynamic Time Warping
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    提出了一种融合下界估计和分段动态时间规整的语音样例快速检索方法。该方法针对缺乏合适的训练数据等语音资源较为有限的语言进行快速检索所设计。此方法首先提取查询样例和测试集的音素后验概率;然后,根据限制条件在测试语句中选定候选分段,并计算查询样例和每个候选分段之间实际动态时间规整得分的下界估计,再运用K最近邻搜索算法搜索与查询样例相似度最高的分段;最后,使用虚拟相关反馈技术对检索结果进行修正。实验结果表明:尽管此方法的检索精度略低于直接运用动态时间规整进行检索的检索精度,但其检索速度大大优于后者,且检索结果经过虚拟相关反馈技术修正后,其检索精度也得到有效提升。

    Abstract:

    This paper presents a method of query-by-example spoken term detection(QbE STD) using segmental dynamic time warping(SDTW) and lower-bound estimate(LBE). The approach is designed for low-resource situations in which limited or no in-domain training material is available. According to this method, the phone posterior probabilities of query examples and test materials should first of all be got, and then the candidate segments are selected in test materials and the lower-bound estimates of actual DTW scores are computed between the query example and all candidate segments in test materials quickly. the K nearest neighbor (KNN) search algorithm is chosen to search for the segments that have maximal similarity. Finally, the retrieval results can be modified by pseudo relevance feedback(PRF). The experimental result indicates that although there is a slightly degraded in retrieval precision when compared with formulating a DTW procedure directly, the retrieval speed of the method presented by this paper has a big advantage over the latter, and the retrieval precision can be enhanced availably after the retrieval results modified by PRF. .

    参考文献
    相似文献
    引证文献
引用本文

冯志远,张连海.基于分段动态时间规整的语音样例快速检索[J].数据采集与处理,2014,29(2):274-279

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2014-05-08