首页    期刊浏览 2024年09月30日 星期一
登录注册

文章基本信息

  • 标题:Combined method for scanned documents images segmentation using sequential extraction of regions
  • 本地全文:下载
  • 作者:Marina Polyakova ; Alesya Ishchenko ; Natalya Volkova
  • 期刊名称:Eastern-European Journal of Enterprise Technologies
  • 印刷版ISSN:1729-3774
  • 电子版ISSN:1729-4061
  • 出版年度:2018
  • 卷号:5
  • 期号:2
  • 页码:6-15
  • DOI:10.15587/1729-4061.2018.142735
  • 语种:English
  • 出版社:PC Technology Center
  • 摘要:We propose a combined method to segment the images of scanned documents, which, in contrast to known methods, implies a preliminary separation of the graphics and photograph regions from the text regions and a background. In this case, an analysis of the connected components is performed, which are different for graph-ics, photographs, and text regions. In order to classify the selected regions into the photograph and graphics regions, a block method is employed. It was established that such a technique for splitting the regions into blocks less affects the quality of segmentation when compared to applying the block method directly to the original im-age. To extract the text regions that are more complex in their shape from the background, the neighborhood of each pixel was processed.To detect the boundaries of illustrations on the images of scanned documents, we applied the bloomberg method. In order to classify into photographs and graphics, it is proposed to split an illustration into blocks of pixels. Each block of pixels is identified with a vector of two features: the mean value of the local gradient magnitude, and the mean value of the function that localizes at the images of scanned documents the linear objects (graphics and text characters). The derived feature vectors were classified using a sup-port vector machine.When extracting the text regions, we applied a low-frequency filtering and a thresholding.The combined method was implemented in practice to segment the test images of scanned newspaper articles from the document da-tabase mediateam at oulu university (finland). It was established that the combined method is characterized by an increase in perfor-mance speed during image segmentation at high quality processing.
  • 关键词:image segmentation;scanned document;block method;graphics;photographic image;text fragment;connected component;bloomberg method
国家哲学社会科学文献中心版权所有