首页    期刊浏览 2024年09月29日 星期日
登录注册

文章基本信息

  • 标题:Speaker Change Detection based on Mean Shift
  • 本地全文:下载
  • 作者:Yang, Ji-chen ; He, Qian-hua ; Li, Yan-xiong
  • 期刊名称:Journal of Computers
  • 印刷版ISSN:1796-203X
  • 出版年度:2013
  • 卷号:8
  • 期号:3
  • 页码:638-644
  • DOI:10.4304/jcp.8.3.638-644
  • 语种:English
  • 出版社:Academy Publisher
  • 摘要:To settle out the problem that search of speaker change point (SCP) is blind and exhaustive, mean shift is proposed to seek SCP by estimating the kernel density of speech stream in this paper. It contains three steps: seeking peak points using mean shift firstly, using maximum likelihood ratio (MLR) to compute the MLR value of the peak points secondly, and seeking SCPs from MLR value using the maximum method thirdly. The relationship of MLR and BIC is given then. Compared with those methods of using metric or model, the process of seeking SCP is no longer blind because mean shift always points the direction of maximum increase in the density. The experiments show that the proposed algorithm can arrive a comparable result against to BIC and DISTBIC, while it can save detection time, for a 3-second speech segment , the time using the proposed algorithm is about 60% of DISTBIC and 45% of BIC . Further investigation and improvement about this method is discussed at the end of this paper.
  • 关键词:Speaker change detection;mean shift;kernel density estimation;peak point;maximum likelihood ratio
国家哲学社会科学文献中心版权所有