首页    期刊浏览 2024年07月09日 星期二
登录注册

文章基本信息

  • 标题:HMM-Based Photo-Realistic Talking Face Synthesis Using Facial Expression Parameter Mapping with Deep Neural Networks
  • 本地全文:下载
  • 作者:Kazuki Sato ; Takashi Nose ; Akinori Ito
  • 期刊名称:Journal of Computer and Communications
  • 印刷版ISSN:2327-5219
  • 电子版ISSN:2327-5227
  • 出版年度:2017
  • 卷号:05
  • 期号:10
  • 页码:50-65
  • DOI:10.4236/jcc.2017.510006
  • 语种:English
  • 出版社:Scientific Research Publishing
  • 摘要:This paper proposes a technique for synthesizing a pixel-based photo-realistic talking face animation using two-step synthesis with HMMs and DNNs. We introduce facial expression parameters as an intermediate representation that has a good correspondence with both of the input contexts and the output pixel data of face images. The sequences of the facial expression parameters are modeled using context-dependent HMMs with static and dynamic features. The mapping from the expression parameters to the target pixel images are trained using DNNs. We examine the required amount of the training data for HMMs and DNNs and compare the performance of the proposed technique with the conventional PCA-based technique through objective and subjective evaluation experiments.
  • 关键词:Visual-Speech Synthesis;Talking Head;Hidden Markov Models (HMMs);Deep Neural Networks (DNNs);Facial Expression Parameter
国家哲学社会科学文献中心版权所有