首页    期刊浏览 2024年07月03日 星期三
登录注册

文章基本信息

  • 标题:Video Captioning Based on Channel Soft Attention and Semantic Reconstructor
  • 本地全文:下载
  • 作者:Zhou Lei ; Yiyong Huang
  • 期刊名称:Future Internet
  • 电子版ISSN:1999-5903
  • 出版年度:2021
  • 卷号:13
  • 期号:2
  • 页码:55
  • DOI:10.3390/fi13020055
  • 出版社:MDPI Publishing
  • 摘要:Video captioning is a popular task which automatically generates a natural-language sentence to describe video content. Previous video captioning works mainly use the encoder–decoder framework and exploit special techniques such as attention mechanisms to improve the quality of generated sentences. In addition, most attention mechanisms focus on global features and spatial features. However, global features are usually fully connected features. Recurrent convolution networks (RCNs) receive 3-dimensional features as input at each time step, but the temporal structure of each channel at each time step has been ignored, which provide temporal relation information of each channel. In this paper, a video captioning model based on channel soft attention and semantic reconstructor is proposed, which considers the global information for each channel. In a video feature map sequence, the same channel of every time step is generated by the same convolutional kernel. We selectively collect the features generated by each convolutional kernel and then input the weighted sum of each channel to RCN at each time step to encode video representation. Furthermore, a semantic reconstructor is proposed to rebuild semantic vectors to ensure the integrity of semantic information in the training process, which takes advantage of both forward (semantic to sentence) and backward (sentence to semantic) flows. Experimental results on popular datasets MSVD and MSR-VTT demonstrate the effectiveness and feasibility of our model.
  • 关键词:video captioning; channel soft attention; semantic reconstructor; recurrent convolution networks video captioning ; channel soft attention ; semantic reconstructor ; recurrent convolution networks
国家哲学社会科学文献中心版权所有