首页    期刊浏览 2024年07月09日 星期二
登录注册

文章基本信息

  • 标题:Removing Word-Level Spurious Alignment between Images and Pseudo-Captions in Unsupervised Image Captioning
  • 本地全文:下载
  • 作者:Ukyo Honda ; Yoshitaka Ushiku ; Atsushi Hashimoto
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2021
  • 卷号:2021
  • 页码:3692-3702
  • DOI:10.18653/v1/2021.eacl-main.323
  • 语种:English
  • 出版社:ACL Anthology
  • 摘要:Unsupervised image captioning is a challenging task that aims at generating captions without the supervision of image-sentence pairs, but only with images and sentences drawn from different sources and object labels detected from the images. In previous work, pseudo-captions, i.e., sentences that contain the detected object labels, were assigned to a given image. The focus of the previous work was on the alignment of input images and pseudo-captions at the sentence level. However, pseudo-captions contain many words that are irrelevant to a given image. In this work, we investigate the effect of removing mismatched words from image-sentence alignment to determine how they make this task difficult. We propose a simple gating mechanism that is trained to align image features with only the most reliable words in pseudo-captions: the detected object labels. The experimental results show that our proposed method outperforms the previous methods without introducing complex sentence-level learning objectives. Combined with the sentence-level alignment method of previous work, our method further improves its performance. These results confirm the importance of careful alignment in word-level details.
国家哲学社会科学文献中心版权所有