首页    期刊浏览 2024年09月19日 星期四
登录注册

文章基本信息

  • 标题:A Hybrid Deep Learning Model for Arabic Text Recognition
  • 本地全文:下载
  • 作者:Mohammad Fasha ; Bassam Hammo ; Nadim Obeid
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2020
  • 卷号:11
  • 期号:8
  • DOI:10.14569/IJACSA.2020.0110816
  • 出版社:Science and Information Society (SAI)
  • 摘要:Arabic text recognition is a challenging task because of the cursive nature of Arabic writing system, its joint writing scheme, the large number of ligatures and many other challenges. Deep Learning (DL) models achieved significant progress in numerous domains including computer vision and sequence modelling. This paper presents a model that can recognize Arabic text that was printed using multiple font types including fonts that mimic Arabic handwritten scripts. The proposed model employs a hybrid DL network that can recognize Arabic printed text without the need for character segmentation. The model was tested on a custom dataset comprised of over two million word samples that were generated using (18) different Arabic font types. The objective of the testing process was to assess the model’s capability in recognizing a diverse set of Arabic fonts representing a varied cursive styles. The model achieved good results in recognizing characters and words and it also achieved promising results in recognizing characters when it was tested on unseen data. The prepared model, the custom datasets and the toolkit for generating similar datasets are made publically available, these tools can be used to prepare models for recognizing other font types as well as to further extend and enhance the performance of the proposed model.
  • 关键词:Arabic optical character recognition; deep learning; convolutional neural networks; recurrent neural networks
国家哲学社会科学文献中心版权所有