首页    期刊浏览 2024年09月20日 星期五
登录注册

文章基本信息

  • 标题:A Novel Method for Speech Segmentation Based on Speaker's Characteristics
  • 本地全文:下载
  • 作者:Behrouz Abdolali ; Hossein Sameti
  • 期刊名称:Signal & Image Processing : An International Journal (SIPIJ)
  • 印刷版ISSN:2229-3922
  • 电子版ISSN:0976-710X
  • 出版年度:2012
  • 卷号:3
  • 期号:2
  • 页码:65
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:Speech Segmentation is the process change point detection for partitioning an input audio stream intoregions each of which corresponds to only one audio source or one speaker. One application of this systemis in Speaker Diarization systems. There are several methods for speaker segmentation; however, most ofthe Speaker Diarization Systems use BIC-based Segmentation methods. The main goal of this paper is topropose a new method for speaker segmentation with higher speed than the current methods - e.g. BIC -and acceptable accuracy. Our proposed method is based on the pitch frequency of the speech. Theaccuracy of this method is similar to the accuracy of common speaker segmentation methods. However, itscomputation cost is much less than theirs. We show that our method is about 2.4 times faster than the BICbasedmethod, while the average accuracy of pitch-based method is slightly higher than that of the BICbasedmethod.
  • 关键词:Speaker Diarization; Speech Segmentation; Pitch-based Speech Segmentation
国家哲学社会科学文献中心版权所有