首页    期刊浏览 2024年07月06日 星期六
登录注册

文章基本信息

  • 标题:Discriminative features based on modified log magnitude spectrum for playback speech detection
  • 本地全文:下载
  • 作者:Jichen Yang ; Longting Xu ; Bo Ren
  • 期刊名称:EURASIP Journal on Audio, Speech, and Music Processing
  • 印刷版ISSN:1687-4714
  • 电子版ISSN:1687-4722
  • 出版年度:2020
  • 卷号:2020
  • 期号:1
  • 页码:1
  • DOI:10.1186/s13636-020-00173-5
  • 出版社:Hindawi Publishing Corporation
  • 摘要:In order to improve the performance of hand-crafted features to detect playback speech, two discriminative features, constant-Q variance-based octave coefficients and constant-Q mean-based octave coefficients, are proposed for playback speech detection in this work. They rely on our findings that variance-based modified log magnitude spectrum and mean-based modified log magnitude spectrum can enhance the discriminative power between genuine speech and playback speech. Then constant-Q variance-based octave coefficients (constant-Q mean-based octave coefficients) can be obtained by combining variance-based modified log magnitude spectrum (mean-based modified log magnitude spectrum), octave segmentation, and discrete cosine transform. Finally, constant-Q variance-based octave coefficients and constant-Q mean-based octave coefficients are evaluated on ASVspoof 2017 corpus version 2.0 and ASVspoof 2019 physical access, respectively. Experimental results show that variance-based modified log magnitude spectrum and mean-based modified log magnitude spectrum can produce discriminative features toward playback speech. Further results on the two databases show that constant-Q variance-based octave coefficients and constant-Q mean-based octave coefficients can perform better than some common features, such as mel frequency cepstral coefficients and constant-Q cepstral coefficients.
  • 关键词:Discriminative feature ; Playback attack detection ; Modified log magnitude spectrum ; Constant-Q variance-based octave coefficients ; Constant-Q mean-based octave coefficients
国家哲学社会科学文献中心版权所有