首页    期刊浏览 2024年07月03日 星期三
登录注册

文章基本信息

  • 标题:Image Captioning Based on Deep Neural Networks
  • 本地全文:下载
  • 作者:Shuang Liu ; Liang Bai ; Yanli Hu
  • 期刊名称:MATEC Web of Conferences
  • 电子版ISSN:2261-236X
  • 出版年度:2018
  • 卷号:232
  • DOI:10.1051/matecconf/201823201052
  • 语种:English
  • 出版社:EDP Sciences
  • 摘要:With the development of deep learning, the combination of computer vision and natural language process has aroused great attention in the past few years. Image captioning is a representative of this filed, which makes the computer learn to use one or more sentences to understand the visual content of an image. The meaningful description generation process of high level image semantics requires not only the recognition of the object and the scene, but the ability of analyzing the state, the attributes and the relationship among these objects. Though image captioning is a complicated and difficult task, a lot of researchers have achieved significant improvements. In this paper, we mainly describe three image captioning methods using the deep neural networks: CNN-RNN based, CNN-CNN based and Reinforcement-based framework. Then we introduce the representative work of these three top methods respectively, describe the evaluation metrics and summarize the benefits and major challenges.
国家哲学社会科学文献中心版权所有