首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:PodCastle: A Spoken Document Retrieval System Improved by User Contributions
  • 本地全文:下载
  • 作者:Masataka Goto ; Jun Ogata ; Kouichirou Eto
  • 期刊名称:人工知能学会論文誌
  • 印刷版ISSN:1346-0714
  • 电子版ISSN:1346-8030
  • 出版年度:2010
  • 卷号:25
  • 期号:1
  • 页码:104-113
  • DOI:10.1527/tjsai.25.104
  • 出版社:The Japanese Society for Artificial Intelligence
  • 摘要:In this paper, we describe a public web service, ``PodCastle'' , that provides full-text searching of speech data (Japanese podcasts) on the basis of automatic speech recognition technologies. This is an instance of our research approach, ``Speech Recognition Research 2.0'' , which is aimed at providing users with a web service based on Web 2.0 so that they can experience state-of-the-art speech recognition performance, and at promoting speech recognition technologies in cooperation with anonymous users. PodCastle enables users to find podcasts that include a search term, read full texts of their recognition results, and easily correct recognition errors by simply selecting from a list of candidates. Even if a state-of-the-art speech recognizer is used to recognize podcasts on the web, a number of errors will naturally occur. PodCastle therefore encourages users to cooperate by correcting these errors so that those podcasts can be searched more reliably. Furthermore, using the resulting corrections to train the speech recognizer, it implements a mechanism whereby the speech recognition performance is gradually improved. Our experience with this web service showed that user contributions we collected actually improved the performance of PodCastle.
  • 关键词:information retrieval ; speech recognition ; error correction ; wisdom of crowds ; Web 2.0
国家哲学社会科学文献中心版权所有