首页    期刊浏览 2024年09月20日 星期五
登录注册

文章基本信息

  • 标题:対話破綻検出チャレンジ3における対話破綻検出の評価尺度の選定
  • 本地全文:下载
  • 作者:角森 唯子 ; 東中 竜一郎 ; 高橋 哲朗
  • 期刊名称:人工知能学会論文誌
  • 印刷版ISSN:1346-0714
  • 电子版ISSN:1346-8030
  • 出版年度:2020
  • 卷号:35
  • 期号:1
  • 页码:1-10
  • DOI:10.1527/tjsai.DSI-G
  • 出版社:The Japanese Society for Artificial Intelligence
  • 摘要:

    The task of detecting dialogue breakdown, the aim of which is to detect whether a system utterance causes dialogue breakdown in a given dialogue context, has been actively researched in recent years. However, currently, it is not clear which evaluation metrics should be used to evaluate dialogue breakdown detectors, hindering progress in dialogue breakdown detection. In this paper, we propose finding appropriate metrics for evaluating the detectors in dialogue breakdown detection challenge 3. In our approach, we first enumerate possible evaluation metrics and then rank them on the basis of system ranking stability and discriminative power. By using the submitted runs (results of dialogue breakdown detection of participants) of dialogue breakdown detection challenge 3, we experimentally found that RSNOD(NB,PB,B) is an appropriate metric for dialogue breakdown detection in dialogue breakdown detection challenge 3 for English and Japanese, although NMD(NB,PB,B) and MSE(NB,PB,B) were found appropriate specifically for English and Japanese, respectively.

  • 关键词:Chat-oriented Dialogue System;Dialogue Breakdown Detection;Evaluation Metrics
国家哲学社会科学文献中心版权所有