期刊名称:International Journal of Innovative Research in Computer and Communication Engineering
印刷版ISSN:2320-9798
电子版ISSN:2320-9801
出版年度:2016
卷号:4
期号:3
页码:3731
DOI:10.15680/IJIRCCE.2016.0403184
出版社:S&S Publications
摘要:Over the past years, many of our ancient historical documents, legal government documents are corroded by some means intentionally or unintentionally. So to recover these important documents, we aregoing to implement various techniques in order to save these documents digitally.Firstly,we need to understand the nature of the documents, on which it is written,and it's condition to apply various processing techniques .The image pre-processing and image enhancement is done by applying Image Processing and Hybrid Binarization methods. In this paper, we are going to mainly focus on text detection and OCR(Optical Character Recognition) technique. In some scenario, If the image is distorted or contain lot of background noise, it leads to unreliable OCR digitization as the accuracy is not provided. Our approach is to recover deformation of the entire image by image enhancement. We are maintaining the data set in order to obtain more accuracy. For irregularimage illumination, we are going to use various adjustment methods such as brightness, contrast, saturation level etc for local adjustment of the image. Following Steps are carried out for image enhancement:1)Local adjustment .2)converting image in grey scale.3)IGT is applied to OCR recognition
关键词:Hybrid Binarization; OCR(Optical Character Recognition); Tesseract; Thresholding; Weiner Filter