首页    期刊浏览 2024年11月26日 星期二
登录注册

文章基本信息

  • 标题:Enhancement and Segmentation of Historical Records
  • 本地全文:下载
  • 作者:Soumya A ; G Hemantha Kumar
  • 期刊名称:Computer Science & Information Technology
  • 电子版ISSN:2231-5403
  • 出版年度:2015
  • 卷号:5
  • 期号:13
  • 页码:95-113
  • DOI:10.5121/csit.2015.51309
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:Document Analysis and Recognition (DAR) aims to extract automatically the information in thedocument and also addresses to human comprehension. The automatic processing of degradedhistorical documents are applications of document image analysis field which is confronted withmany difficulties due to the storage condition and the complexity of the script. The main interestof enhancement of historical documents is to remove undesirable statistics that appear in thebackground and highlight the foreground, so as to enable automatic recognition of documentswith high accuracy. This paper addresses pre-processing and segmentation of ancient scripts,as an initial step to automate the task of an epigraphist in reading and deciphering inscriptions.Pre-processing involves, enhancement of degraded ancient document images which is achievedthrough four different Spatial filtering methods for smoothing or sharpening namely Median,Gaussian blur, Mean and Bilateral filter, with different mask sizes. This is followed bybinarization of the enhanced image to highlight the foreground information, using Otsuthresholding algorithm. In the second phase Segmentation is carried out using Drop Fall andWaterReservoir approaches, to obtain sampled characters, which can be used in later stages ofOCR. The system showed good results when tested on the nearly 150 samples of varyingdegraded epigraphic images and works well giving better enhanced output for, 4x4 mask sizefor Median filter, 2x2 mask size for Gaussian blur, 4x4 mask size for Mean and Bilateral filter.The system can effectively sample characters from enhanced images, giving a segmentation rateof 85%-90% for Drop Fall and 85%-90% for Water Reservoir techniques respectively.
  • 关键词:Document Analysis; Preprocessing; Filters; Segmentation; Drop Fall Technique; Water;Reservoir Technique
国家哲学社会科学文献中心版权所有