首页    期刊浏览 2024年12月05日 星期四
登录注册

文章基本信息

  • 标题:Can automatic speech recognition be satisficing for audio/video search? Keyword-focused analysis of Hebrew automatic and manual transcription
  • 本地全文:下载
  • 作者:Vered Silber-Varod ; Nitza Geri
  • 期刊名称:The Online Journal of Applied Knowledge Management
  • 印刷版ISSN:2325-4688
  • 出版年度:2014
  • 卷号:2
  • 期号:1
  • 页码:104-121
  • 出版社:The International Institute for Applied Knowledge Management
  • 摘要:With massive amounts of academic audio and video content over the web, it is important to assess the performance of state-of-the-art automatic speech recognition (ASR) systems for audio/video navigation through search queries. This paper suggests a novel perspective of the challenges of ASR: instead of minimizing word error rates (WER), focus on keyword recognition. Focusing on keywords may be worthwhile for under-reso urced languages, such as Hebrew, which their ASR systems have not yet reached a satisfactory accuracy level of transcription. We provide an initial Proof of Concept by demonstrating the feasible use of ASR for achieving affordable mass transcription that enables satisficing keyword recognition of a video or an audio lecture via a search engine. A forty-minutes recording set, which includes audio books and academic lectures, is used for measuring the performance of two Hebrew ASR systems, and comparing them to stenographer recordings of the video lectures, while focusing on keyword recognition. Keyness tests show advantage of keyword recognition over key-phrases results, and stenographers' records exceed both engines. Yet, keyword recognition up to 78% was achieved, which suggests that ASR has reached a satisficing accuracy level that enables its use for searching audio/video content on the web
  • 关键词:auto matic speech recognition (ASR); audio/video search; academic video lectures; audio books; manual ; transcription; transcription of under-resourced languages; keyword search
国家哲学社会科学文献中心版权所有