首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:SPEAKER IDENTIFICATION USING 2-D DCT, WALSH AND HAAR ON FULL AND BLOCK SPECTROGRAM
  • 本地全文:下载
  • 作者:Dr. H. B. Kekre ; Dr. Tanuja K. Sarode ; Shachi J. Natu
  • 期刊名称:International Journal on Computer Science and Engineering
  • 印刷版ISSN:2229-5631
  • 电子版ISSN:0975-3397
  • 出版年度:2010
  • 卷号:2
  • 期号:5
  • 页码:1733-1740
  • 出版社:Engg Journals Publications
  • 摘要:This paper aims to provide different approaches to text dependent speaker identification using DCT, Walsh and Haar transform along with use of spectrograms. Spectrograms obtained from speech samples are used as image database for the study undertaken. This image database is then subjected to various transforms. Using Euclidean distance as measure of similarity, most appropriate speaker match is obtained and is declared as identified speaker. Each transform is applied to spectrograms in two different ways: on full image and on image blocks. In both the ways, effect of different number of coefficients of transformed image is observed. Haar transform on full image reduces multiplications required by DCT and Walsh by 28 times whereas applying Haar transform on image blocks requires 18 times less mathematical computations as compared to DCT and Walsh on image blocks. Transforms when applied to image blocks, yield better or equal identification rates with reduced computational complexity.
  • 关键词:Speaker identification; Speaker Recognition; Spectrograms; DCT; WALSH; HAAR;; Image Blocks
国家哲学社会科学文献中心版权所有