首页    期刊浏览 2024年12月01日 星期日
登录注册

文章基本信息

  • 标题:The Role of Syntactic Planning in Compositional Image Captioning
  • 本地全文:下载
  • 作者:Emanuele Bugliarello ; Desmond Elliott
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2021
  • 卷号:2021
  • 页码:593-607
  • DOI:10.18653/v1/2021.eacl-main.48
  • 语种:English
  • 出版社:ACL Anthology
  • 摘要:Image captioning has focused on generalizing to images drawn from the same distribution as the training set, and not to the more challenging problem of generalizing to different distributions of images. Recently, Nikolaus et al. (2019) introduced a dataset to assess compositional generalization in image captioning, where models are evaluated on their ability to describe images with unseen adjective–noun and noun–verb compositions. In this work, we investigate different methods to improve compositional generalization by planning the syntactic structure of a caption. Our experiments show that jointly modeling tokens and syntactic tags enhances generalization in both RNN- and Transformer-based models, while also improving performance on standard metrics.
国家哲学社会科学文献中心版权所有