出版社:University of Malaya * Faculty of Computer Science and Information Technology
摘要:background and uneven illumination in natural scene images. In this paper, we present a new method based on adaptive histogram analysis for each sliding window over a word of a text line detected by the text detection method. The histogram analysis works on the basis that intensity values of text pixels in each sliding window have uniform color. The method segments the words based on region growing which studies spacing between words and characters. Then we propose to use existing OCRs such as ABBYY and Tesseract (Google) to recognize the text line at word and character levels to validate the binarization results. The method is compared with wellknown global thresholding technique of binarization to show its effectiveness.
关键词:Adaptive histogram analysis; Scene text binarization; Scene text recognition; Region growing; Word segmentation; Global thresholding.