首页    期刊浏览 2024年11月08日 星期五
登录注册

文章基本信息

  • 标题:Text Extraction From Images
  • 本地全文:下载
  • 作者:Satish Kumar ; Sunil Kumar ; S. Gopinath
  • 期刊名称:International Journal of Advanced Research in Computer Engineering & Technology (IJARCET)
  • 印刷版ISSN:2278-1323
  • 出版年度:2012
  • 卷号:1
  • 期号:4
  • 页码:34-36
  • 出版社:Shri Pannalal Research Institute of Technolgy
  • 摘要:Images and videos on webs and in databases are increasing. It is a pressing task to develop effective methods to manage and retrieve these multimedia resources by their content. Text, which carries high-level semantic information, is a kind of important object that is useful for this task.When a machine generated text is printed against clean backgrounds, it can be converted to a computer readable form (ASCII) using current optical character recognition (OCR) technology. However, text is often printed against shaded or textured backgrounds or is embedded in images. Examples include maps, photographs, advertisements, videos, etc. Current document segmentation and recognition technologies cannot handle these situations well. Our system takes advantage of the distinctive characteristics of text that make it stand out from other image material i.e. text possesses certain frequency and orientation information; text shows spatial cohesion¡ªcharacters of the same text string (a word, or words in the same line) are of similar heights, orientation, and spacing.
  • 关键词:binarization; connected components; filters; text ; reading system
国家哲学社会科学文献中心版权所有