首页    期刊浏览 2024年10月05日 星期六
登录注册

文章基本信息

  • 标题:A Hybridized Deep Learning Method for Bengali Image Captioning
  • 本地全文:下载
  • 作者:Mayeesha Humaira ; Shimul Paul ; Abidur Rahman Khan Jim
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2021
  • 卷号:12
  • 期号:2
  • 页码:698-707
  • DOI:10.14569/IJACSA.2021.0120287
  • 出版社:Science and Information Society (SAI)
  • 摘要:An omnipresent challenging research topic in com-puter vision is the generation of captions from an input image. Previously, numerous experiments have been conducted on image captioning in English but the generation of the caption from the image in Bengali is still sparse and in need of more refining. Only a few papers till now have worked on image captioning in Bengali. Hence, we proffer a standard strategy for Bengali image caption generation on two different sizes of the Flickr8k dataset and BanglaLekha dataset which is the only publicly available Bengali dataset for image captioning. Afterward, the Bengali captions of our model were compared with Bengali captions generated by other researchers using different architectures. Additionally, we employed a hybrid approach based on InceptionResnetV2 or Xception as Convolution Neural Network and Bidirectional Long Short-Term Memory or Bidirectional Gated Recurrent Unit on two Bengali datasets. Furthermore, a different combination of word embedding was also adapted. Lastly, the performance was evaluated using Bilingual Evaluation Understudy and proved that the proposed model indeed performed better for the Bengali dataset consisting of 4000 images and the BanglaLekha dataset.
  • 关键词:Bengali image captioning; hybrid architecture; In-ceptionResNet; Xception
国家哲学社会科学文献中心版权所有