期刊名称:International Journal of Electronics and Computer Science Engineering
电子版ISSN:2277-1956
出版年度:2012
卷号:1
期号:3
页码:1833-1839
出版社:Buldanshahr : IJECSE
摘要:This paper presents a new technique that greatly increases the speed of the connected component labeling algorithm. We propose a system to extract the text from the PDF images. This paper describes the system design based on text extraction method concentrating on text extraction from PDF images by enhancing the traditional connected component labeling as modified connected component labeling that uses a components neighbor scan labeling approach derived from Akmal et al[9]. This method produced good performance in terms of accuracy and speed. The performance of the approach is demonstrated
关键词:Image Processing; Labeling; CCL; Text extraction