首页    期刊浏览 2024年07月06日 星期六
登录注册

文章基本信息

  • 标题:Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments forMS-COCO
  • 本地全文:下载
  • 作者:Zarana Parekh ; Jason Baldridge ; Daniel Cer
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2021
  • 卷号:2021
  • 页码:2855-2870
  • DOI:10.18653/v1/2021.eacl-main.249
  • 语种:English
  • 出版社:ACL Anthology
  • 摘要:By supporting multi-modal retrieval training and evaluation, image captioning datasets have spurred remarkable progress on representation learning. Unfortunately, datasets have limited cross-modal associations: images are not paired with other images, captions are only paired with other captions of the same image, there are no negative associations and there are missing positive cross-modal associations. This undermines research into how inter-modality learning impacts intra-modality tasks. We address this gap with Crisscrossed Captions (CxC), an extension of the MS-COCO dataset with human semantic similarity judgments for 267,095 intra- and inter-modality pairs. We report baseline results on CxC for strong existing unimodal and multimodal models. We also evaluate a multitask dual encoder trained on both image-caption and caption-caption pairs that crucially demonstrates CxC’s value for measuring the influence of intra- and inter-modality learning.
国家哲学社会科学文献中心版权所有