文章基本信息

标题：音声対話システムにおける音声合成のための対話行為情報を利用した文末音調ラベル推定
本地全文：下载
作者：北条伸克 ; 井島勇祐 ; 杉山弘晃等
期刊名称：人工知能学会論文誌
印刷版ISSN：1346-0714
电子版ISSN：1346-8030
出版年度：2020
卷号：35
期号：4
页码：1-11
DOI：10.1527/tjsai.A-JA5
出版社：The Japanese Society for Artificial Intelligence
摘要：This paper proposes a novel sentence final tone labels estimation method using dialogue-act (DA) informationfor text-to-speech synthesis within a spoken dialogue system. Estimating appropriate sentence final tone labels isconsidered essential for communicating the exact system’s intentions to users by an utterance. In this paper, wepropose to utilize DA features as well as the conventional features, morphological information of the utterance text, toestimate the sentence final tone labels. For this study, we use the speech database with DA tags which we constructedin our previous study. We added sentence final tone labels to this database so that each utterance has all informationof utterance text, DA and sentence final tone label. Based on this database, we build the proposed sentence final toneestimation model. We evaluated the proposed method by comparing its performance with the conventional method.The evaluation results show that the proposed method overwhelms the conventional method in accuracy. We alsoanalyze the estimation results to investigate the efficacy and difficulty by the proposed method.
关键词：sentence final tone labels;dialogue-act,;speech synthesis;spoken dialogue systems