首页    期刊浏览 2024年09月15日 星期日
登录注册

文章基本信息

  • 标题:Detection and Separation of Speech Event Using Audio and Video Information Fusion and Its Application to Robust Speech Interface
  • 本地全文:下载
  • 作者:Futoshi Asano ; Kiyoshi Yamamoto ; Isao Hara
  • 期刊名称:EURASIP Journal on Advances in Signal Processing
  • 印刷版ISSN:1687-6172
  • 电子版ISSN:1687-6180
  • 出版年度:2004
  • 卷号:2004
  • 期号:11
  • 页码:1727-1738
  • DOI:10.1155/S1110865704402303
  • 出版社:Hindawi Publishing Corporation
  • 摘要:

    A method of detecting speech events in a multiple-sound-source condition using audio and video information is proposed. For detecting speech events, sound localization using a microphone array and human tracking by stereo vision is combined by a Bayesian network. From the inference results of the Bayesian network, information on the time and location of speech events can be known. The information on the detected speech events is then utilized in the robust speech interface. A maximum likelihood adaptive beamformer is employed as a preprocessor of the speech recognizer to separate the speech signal from environmental noise. The coefficients of the beamformer are kept updated based on the information of the speech events. The information on the speech events is also used by the speech recognizer for extracting the speech segment.

国家哲学社会科学文献中心版权所有