首页    期刊浏览 2024年12月01日 星期日
登录注册

文章基本信息

  • 标题:Chord-aware automatic music transcription based on hierarchical Bayesian integration of acoustic and language models
  • 本地全文:下载
  • 作者:Yuta Ojima ; Eita Nakamura ; Katsutoshi Itoyama
  • 期刊名称:APSIPA Transactions on Signal and Information Processing
  • 印刷版ISSN:2048-7703
  • 电子版ISSN:2048-7703
  • 出版年度:2018
  • 卷号:7
  • 页码:1-14
  • DOI:10.1017/ATSIP.2018.17
  • 出版社:Cambridge University Press
  • 摘要:This paper describes automatic music transcription with chord estimation for music audio signals. We focus on the fact that concurrent structures of musical notes such as chords form the basis of harmony and are considered for music composition. Since chords and musical notes are deeply linked with each other, we propose joint pitch and chord estimation based on a Bayesian hierarchical model that consists of an acoustic model representing the generative process of a spectrogram and a language model representing the generative process of a piano roll. The acoustic model is formulated as a variant of non-negative matrix factorization that has binary variables indicating a piano roll. The language model is formulated as a hidden Markov model that has chord labels as the latent variables and emits a piano roll. The sequential dependency of a piano roll can be represented in the language model. Both models are integrated through a piano roll in a hierarchical Bayesian manner. All the latent variables and parameters are estimated using Gibbs sampling. The experimental results showed the great potential of the proposed method for unified music transcription and grammar induction.
  • 关键词:Automatic Music Transcription; Chord Estimation; Non-negative Matrix Factorization; Bayesian Inference
国家哲学社会科学文献中心版权所有