首页    期刊浏览 2025年02月21日 星期五
登录注册

文章基本信息

  • 标题:Deep Multimodal Fusion Autoencoder for Saliency Prediction of RGB-D Images
  • 本地全文:下载
  • 作者:Kengda Huang ; Wujie Zhou ; Meixin Fang
  • 期刊名称:Computational Intelligence and Neuroscience
  • 印刷版ISSN:1687-5265
  • 电子版ISSN:1687-5273
  • 出版年度:2021
  • 卷号:2021
  • 页码:1-10
  • DOI:10.1155/2021/6610997
  • 出版社:Hindawi Publishing Corporation
  • 摘要:In recent years, the prediction of salient regions in RGB-D images has become a focus of research. Compared to its RGB counterpart, the saliency prediction of RGB-D images is more challenging. In this study, we propose a novel deep multimodal fusion autoencoder for the saliency prediction of RGB-D images. The core trainable autoencoder of the RGB-D saliency prediction model employs two raw modalities (RGB and depth/disparity information) as inputs and their corresponding eye-fixation attributes as labels. The autoencoder comprises four main networks: color channel network, disparity channel network, feature concatenated network, and feature learning network. The autoencoder can mine the complex relationship and make the utmost of the complementary characteristics between both color and disparity cues. Finally, the saliency map is predicted via a feature combination subnetwork, which combines the deep features extracted from a prior learning and convolutional feature learning subnetworks. We compare the proposed autoencoder with other saliency prediction models on two publicly available benchmark datasets. The results demonstrate that the proposed autoencoder outperforms these models by a significant margin.
国家哲学社会科学文献中心版权所有