期刊名称:International Journal of Hybrid Information Technology
印刷版ISSN:1738-9968
出版年度:2008
卷号:1
期号:3
出版社:SERSC
摘要:Audio has a key index in digital videos that can provide useful information for video editing,such as capturing conversations only, clipping only talking people, and so on. In this paper, weare studying about video editing based on audio with a two-channel (stereo) microphone that isstandard equipment on video cameras, where the video content is automatically recorded without acameraman. In order to capture only a talking person on video, a novel voice/non-voice detectionalgorithm using AdaBoost, which can achieve extremely high detection rates in noisy environments,is used. In addition, the sound source direction is estimated by the CSP (Crosspower-SpectrumPhase) method in order to zoom in on the talking person by clipping frames from videos, where atwo-channel (stereo) microphone is used to obtain information about time differences between themicrophones.