首页    期刊浏览 2025年05月25日 星期日
登录注册

文章基本信息

  • 标题:Comparative analysis of Tesseract and Google Cloud Vision for Thai vehicle registration certificate
  • 本地全文:下载
  • 作者:Karanrat Thammarak ; Prateep Kongkla ; Yaowarat Sirisathitkul
  • 期刊名称:International Journal of Electrical and Computer Engineering
  • 电子版ISSN:2088-8708
  • 出版年度:2022
  • 卷号:12
  • 期号:2
  • 页码:1849-1858
  • DOI:10.11591/ijece.v12i2.pp1849-1858
  • 语种:English
  • 出版社:Institute of Advanced Engineering and Science (IAES)
  • 摘要:Optical character recognition (OCR) is a technology to digitize a paper-based document to digital form. This research studies the extraction of the characters from a Thai vehicle registration certificate via a Google Cloud Vision API and a Tesseract OCR. The recognition performance of both OCR APIs is also examined. The 84 color image files comprised three image sizes/resolutions and five image characteristics. For suitable image type comparison, the greyscale and binary image are converted from color images. Furthermore, the three pre-processing techniques, sharpening, contrast adjustment, and brightness adjustment, are also applied to enhance the quality of image before applying the two OCR APIs. The recognition performance was evaluated in terms of accuracy and readability. The results showed that the Google Cloud Vision API works well for the Thai vehicle registration certificate with an accuracy of 84.43%, whereas the Tesseract OCR showed an accuracy of 47.02%. The highest accuracy came from the color image with 1024×768 px, 300dpi, and using sharpening and brightness adjustment as pre-processing techniques. In terms of readability, the Google Cloud Vision API has more readability than the Tesseract. The proposed conditions facilitate the possibility of the implementation for Thai vehicle registration certificate recognition system.
  • 关键词:computer vision;Google Cloud Vision API;optical character recognition;tesseract OCR;thai vehicle registration certificate
国家哲学社会科学文献中心版权所有