首页    期刊浏览 2024年09月15日 星期日
登录注册

文章基本信息

  • 标题:Segmentation of Touching Conjunct Consonants in Telugu using Minimum Area Bounding Boxes
  • 本地全文:下载
  • 作者:J. Bharathi ; P. Chandrasekar Reddy
  • 期刊名称:International Journal of Soft Computing & Engineering
  • 电子版ISSN:2231-2307
  • 出版年度:2013
  • 卷号:3
  • 期号:3
  • 页码:260-264
  • 出版社:International Journal of Soft Computing & Engineering
  • 摘要:This paper addresses the problem of segmenting touching characters which are written or printed in the bottom zone. In the segmentation of machine printed Telugu document image, conjunct consonants are more prone to touching due to shape of the characters. It is important to segment them properly to improve the accuracy of the Telugu OCR as otherwise the reconstruction and mapping to editable electronic document is incomplete and often needs lot of tedious manual intervention. It is based on the script level characteristic that the secondary form of consonants are written in smaller size and its bounding box is smaller compared to the primary character. The structural feature of sharp peaks in both left and right side profiles at the touching location of the combined character is used for determining the correct segmentation location. The algorithm is tested on a dataset created from large set of documents. The success rate of 96.39% is achieved.
  • 关键词:Minimum area bounding box; segmentation;side profile peaks; touching conjunct consonants.
国家哲学社会科学文献中心版权所有