首页    期刊浏览 2025年02月22日 星期六
登录注册

文章基本信息

  • 标题:Anthropomorphic Coding of Speech and Audio: A Model Inversion Approach
  • 本地全文:下载
  • 作者:Christian Feldbauer ; Gernot Kubin ; W. Bastiaan Kleijn
  • 期刊名称:EURASIP Journal on Advances in Signal Processing
  • 印刷版ISSN:1687-6172
  • 电子版ISSN:1687-6180
  • 出版年度:2005
  • 卷号:2005
  • 期号:9
  • 页码:1334-1349
  • DOI:10.1155/ASP.2005.1334
  • 出版社:Hindawi Publishing Corporation
  • 摘要:

    Auditory modeling is a well-established methodology that provides insight into human perception and that facilitates the extraction of signal features that are most relevant to the listener. The aim of this paper is to provide a tutorial on perceptual speech and audio coding using an invertible auditory model. In this approach, the audio signal is converted into an auditory representation using an invertible auditory model. The auditory representation is quantized and coded. Upon decoding, it is then transformed back into the acoustic domain. This transformation converts a complex distortion criterion into a simple one, thus facilitating quantization with low complexity. We briefly review past work on auditory models and describe in more detail the components of our invertible model and its inversion procedure, that is, the method to reconstruct the signal from the output of the auditory model. We summarize attempts to use the auditory representation for low-bit-rate coding. Our approach also allows the exploitation of the inherent redundancy of the human auditory system for the purpose of multiple description (joint source-channel) coding.

国家哲学社会科学文献中心版权所有