文章基本信息

标题：What Did This Castle Look like before? Exploring Referential Relations in Naturally Occurring Multimodal Texts
本地全文：下载
作者：Ronja Utescher ; Sina Zarrieß
期刊名称：Conference on European Chapter of the Association for Computational Linguistics (EACL)
出版年度：2021
卷号：2021
页码：53-60
语种：English
出版社：ACL Anthology
摘要：Multi-modal texts are abundant and diverse in structure, yet Language & Vision research of these naturally occurring texts has mostly focused on genres that are comparatively light on text, like tweets. In this paper, we discuss the challenges and potential benefits of a L&V framework that explicitly models referential relations, taking Wikipedia articles about buildings as an example. We briefly survey existing related tasks in L&V and propose multi-modal information extraction as a general direction for future research.