文章基本信息

标题：Structured (De)composable Representations Trained with Neural Networks
本地全文：下载
作者：Graham Spinks ; Marie-Francine Moens
期刊名称：Computers
电子版ISSN：2073-431X
出版年度：2020
卷号：9
期号：4
页码：79-101
DOI：10.3390/computers9040079
出版社：MDPI Publishing
摘要：This paper proposes a novel technique for representing templates and instances of concept classes. A template representation refers to the generic representation that captures the characteristics of an entire class. The proposed technique uses end-to-end deep learning to learn structured and composable representations from input images and discrete labels. The obtained representations are based on distance estimates between the distributions given by the class label and those given by contextual information, which are modeled as environments. We prove that the representations have a clear structure allowing decomposing the representation into factors that represent classes and environments. We evaluate our novel technique on classification and retrieval tasks involving different modalities (visual and language data). In various experiments, we show how the representations can be compressed and how different hyperparameters impact performance.
关键词：structured representations; composition; deep learning; multimodal structured representations ; composition ; deep learning ; multimodal