首页    期刊浏览 2024年11月24日 星期日
登录注册

文章基本信息

  • 标题:Construction of a Test Collection for Spoken Document Retrieval from Lecture Audio Data
  • 本地全文:下载
  • 作者:Tomoyosi Akiba ; Kiyoaki Aikawa ; Yoshiaki Itoh
  • 期刊名称:Information and Media Technologies
  • 电子版ISSN:1881-0896
  • 出版年度:2009
  • 卷号:4
  • 期号:2
  • 页码:485-497
  • DOI:10.11185/imt.4.485
  • 出版社:Information and Media Technologies Editorial Board
  • 摘要:The lecture is one of the most valuable genres of audiovisual data. Though spoken document processing is a promising technology for utilizing the lecture in various ways, it is difficult to evaluate because the evaluation require a subjective judgment and/or the verification of large quantities of evaluation data. In this paper, a test collection for the evaluation of spoken lecture retrieval is reported. The test collection consists of the target spoken documents of about 2, 700 lectures (604 hours) taken from the Corpus of Spontaneous Japanese (CSJ), 39 retrieval queries, the relevant passages in the target documents for each query, and the automatic transcription of the target speech data. This paper also reports the retrieval performance targeting the constructed test collection by applying a standard spoken document retrieval (SDR) method, which serves as a baseline for the forthcoming SDR studies using the test collection.
国家哲学社会科学文献中心版权所有