文章基本信息

标题：傾聴対話システムのための言語情報と韻律情報に基づく多様な形態の相槌の生成
本地全文：下载
作者：山口貴史 ; 井上昂治 ; 吉野幸一郎等
期刊名称：人工知能学会論文誌
印刷版ISSN：1346-0714
电子版ISSN：1346-8030
出版年度：2016
卷号：31
期号：4
页码：C-G31_1-10
DOI：10.1527/tjsai.C-G31
出版社：The Japanese Society for Artificial Intelligence
摘要：There is a growing interest in conversation agents and robots which conduct attentive listening. However, the current systems always generate the same or limited forms of backchannels every time, giving a monotonous impression. This study investigates the generation of a variety of backchannel forms appropriate for the dialogue context, using the corpus of counseling dialogue. At first, we annotate all acceptable backchannel form categories considering the permissible variation in backchannels. Second, we analyze how the morphological form of backchannels relates to linguistic features of the preceding utterance such as the utterance boundary type and the linguistic complexity. Based on this analysis, we conduct machine learning to predict backchannel form from the linguistic and prosodic features of the preceding context. This model outperformed a baseline which always outputs the same form of backchannels and another baseline which randomly generates backchannels. Finally, subjective evaluations by human listeners show that the proposed method generates backchannels more naturally and gives a feeling of understanding and empathy.
关键词：spoken dialogue system;conversation agent;attentive listening;backchannel