首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:Training End-to-End Dialogue Systems with the Ubuntu Dialogue Corpus
  • 本地全文:下载
  • 作者:Ryan Lowe ; Nissan Pow ; Iulian Vlad Serban
  • 期刊名称:Dialogue and Discourse
  • 电子版ISSN:2152-9620
  • 出版年度:2017
  • 卷号:8
  • 期号:1
  • 页码:31-65
  • DOI:10.5087/dad.2017.102
  • 出版社:Linguistic Society of America
  • 摘要:In this paper, we construct and train end-to-end neural network-based dialogue systems usingan updated version of the recent Ubuntu Dialogue Corpus, a dataset containing almost 1 million multi-turn dialogues, with a total of over 7 million utterances and 100 million words. This dataset is interesting because of its size, long context lengths, and technical nature; thus, it can be used to train large models directly from data with minimal feature engineering, which can be both time consuming and expensive. We provide baselines  in two different environments: one where models are trained to maximize the log-likelihood of a generated utterance  conditioned on the context of the conversation, and one where models are trained to select the correct next response from a list of candidate responses. These are both evaluated on a recall task that we call Next Utterance Classification (NUC), as well as other generation-specific metrics. Finally, we provide a qualitative error analysis to help determine the most promising directions for future research on the Ubuntu  Dialogue Corpus, and for end-to-end dialogue systems in general.
国家哲学社会科学文献中心版权所有