首页    期刊浏览 2024年12月03日 星期二
登录注册

文章基本信息

  • 标题:Joint Audio-Visual Tracking Using Particle Filters
  • 本地全文:下载
  • 作者:Dmitry N. Zotkin ; Ramani Duraiswami ; Larry S. Davis
  • 期刊名称:EURASIP Journal on Advances in Signal Processing
  • 印刷版ISSN:1687-6172
  • 电子版ISSN:1687-6180
  • 出版年度:2002
  • 卷号:2002
  • 期号:11
  • 页码:1154-1164
  • DOI:10.1155/S1110865702206058
  • 出版社:Hindawi Publishing Corporation
  • 摘要:

    It is often advantageous to track objects in a scene using multimodal information when such information is available. We use audio as a complementary modality to video data, which, in comparison to vision, can provide faster localization over a wider field of view. We present a particle-filter based tracking framework for performing multimodal sensor fusion for tracking people in a videoconferencing environment using multiple cameras and multiple microphone arrays. One advantage of our proposed tracker is its ability to seamlessly handle temporary absence of some measurements (e.g., camera occlusion or silence). Another advantage is the possibility of self-calibration of the joint system to compensate for imprecision in the knowledge of array or camera parameters by treating them as containing an unknown statistical component that can be determined using the particle filter framework during tracking. We implement the algorithm in the context of a videoconferencing and meeting recording system. The system also performs high-level semantic analysis of the scene by keeping participant tracks, recognizing turn-taking events and recording an annotated transcript of the meeting. Experimental results are presented. Our system operates in real-time and is shown to be robust and reliable.

国家哲学社会科学文献中心版权所有