文章基本信息

标题：Efficient Text Extraction Algorithm Using Color Clustering for Language Translation in Mobile Phone
本地全文：下载
作者：Adrián Canedo-Rodríguez ; Jung Hyoun Kim ; Soo-Hyung Kim 等
期刊名称：Journal of Signal and Information Processing
印刷版ISSN：2159-4465
电子版ISSN：2159-4481
出版年度：2012
卷号：3
期号：2
页码：228-237
DOI：10.4236/jsip.2012.32031
出版社：Scientific Research Publishing
摘要：Many Text Extraction methodologies have been proposed, but none of them are suitable to be part of a real system implemented on a device with low computational resources, either because their accuracy is insufficient, or because their performance is too slow. In this sense, we propose a Text Extraction algorithm for the context of language translation of scene text images with mobile phones, which is fast and accurate at the same time. The algorithm uses very efficient computations to calculate the Principal Color Components of a previously quantized image, and decides which ones are the main foreground-background colors, after which it extracts the text in the image. We have compared our algorithm with other algorithms using commercial OCR, achieving accuracy rates more than 12% higher, and performing two times faster. Also, our methodology is more robust against common degradations, such as uneven illumination, or blurring. Thus, we developed a very attractive system to accurately separate foreground and background from scene text images, working over low computational resources devices.
关键词：Text Extraction; Color Quantization; Text Binarization; Language Translation