期刊名称:International Journal of Advanced Research in Computer Engineering & Technology (IJARCET)
印刷版ISSN:2278-1323
出版年度:2012
卷号:1
期号:4
页码:34-36
出版社:Shri Pannalal Research Institute of Technolgy
摘要:Images and videos on webs and in databases are increasing. It is a pressing task to develop effective methods to manage and retrieve these multimedia resources by their content. Text, which carries high-level semantic information, is a kind of important object that is useful for this task.When a machine generated text is printed against clean backgrounds, it can be converted to a computer readable form (ASCII) using current optical character recognition (OCR) technology. However, text is often printed against shaded or textured backgrounds or is embedded in images. Examples include maps, photographs, advertisements, videos, etc. Current document segmentation and recognition technologies cannot handle these situations well. Our system takes advantage of the distinctive characteristics of text that make it stand out from other image material i.e. text possesses certain frequency and orientation information; text shows spatial cohesion¡ªcharacters of the same text string (a word, or words in the same line) are of similar heights, orientation, and spacing.
关键词:binarization; connected components; filters; text ; reading system