首页    期刊浏览 2024年07月22日 星期一
登录注册

文章基本信息

  • 标题:Spoken Term Detection of Zero-Resource Language Using Posteriorgram of Multiple Languages
  • 本地全文:下载
  • 作者:Satoru MIZUOCHI ; Takashi NOSE ; Akinori ITO
  • 期刊名称:Interdisciplinary Information Sciences
  • 印刷版ISSN:1340-9050
  • 电子版ISSN:1347-6157
  • 出版年度:2022
  • 卷号:28
  • 期号:1
  • 页码:1-13
  • DOI:10.4036/iis.2022.A.04
  • 语种:English
  • 出版社:The Editorial Committee of the Interdisciplinary Information Sciences
  • 摘要:We propose in this paper a query-by-example spoken term detection (QbE-STD) method for keyword detection from zero-resource language speech databases. The proposed method employs the phonetic posteriorgram (PPG) trained with multiple resource-rich languages and combines multilingual PPGs for speech representation. The keywords are detected using the dynamic time warping method. We examined three types of combination of multiple languages such as concatenation of PPG (PPG CONC), a combination of language resources to calculate multilingual PPG (PPG ALL), and multi-task training of PPG using multiple languages (PPG DIV). We carried out an experiment of the QbE-STD from Kaqchikel speech. As a result, the use of PPG showed better detection performance than the method based on the conventional speech feature (MFCC), and the use of multiple languages gave a further improvement of detection.
  • 关键词:spoken term detection;query by example;zero-resource language;Kaqchikel;phonetic posteriorgram
国家哲学社会科学文献中心版权所有