文章基本信息

标题：HMM-Based Photo-Realistic Talking Face Synthesis Using Facial Expression Parameter Mapping with Deep Neural Networks
本地全文：下载
作者：Kazuki Sato ; Takashi Nose ; Akinori Ito 等
期刊名称：Journal of Computer and Communications
印刷版ISSN：2327-5219
电子版ISSN：2327-5227
出版年度：2017
卷号：05
期号：10
页码：50-65
DOI：10.4236/jcc.2017.510006
语种：English
出版社：Scientific Research Publishing
摘要：This paper proposes a technique for synthesizing a pixel-based photo-realistic talking face animation using two-step synthesis with HMMs and DNNs. We introduce facial expression parameters as an intermediate representation that has a good correspondence with both of the input contexts and the output pixel data of face images. The sequences of the facial expression parameters are modeled using context-dependent HMMs with static and dynamic features. The mapping from the expression parameters to the target pixel images are trained using DNNs. We examine the required amount of the training data for HMMs and DNNs and compare the performance of the proposed technique with the conventional PCA-based technique through objective and subjective evaluation experiments.
关键词：Visual-Speech Synthesis;Talking Head;Hidden Markov Models (HMMs);Deep Neural Networks (DNNs);Facial Expression Parameter