首页    期刊浏览 2024年09月15日 星期日
登录注册

文章基本信息

  • 标题:Analog Document Search Using CRNN and Keyphrase Extraction
  • 本地全文:下载
  • 作者:Lokeshwar S ; Vadiraja Rao M. K ; Sujay Kumar P. S
  • 期刊名称:International Journal of Image, Graphics and Signal Processing
  • 印刷版ISSN:2074-9074
  • 电子版ISSN:2074-9082
  • 出版年度:2021
  • 卷号:13
  • 期号:2
  • 页码:16-24
  • DOI:10.5815/ijigsp.2021.02.02
  • 出版社:MECS Publisher
  • 摘要:There seems to be a peculiar trend in the way information is now used, moving to digital media not just for the newspapers but for books as well. With advances in Optical Character Recognition (OCR), Style Transfer Mapping (STM), and efficient key phrasing, we are now able to digitalize the document to a form that can be read across multiple platforms and searched efficiently. It provides users with the ease of searching for relevant documents without the tedious process of manual searching. We propose a system that uses the CRNN model to detect English characters in the document with high accuracy. We then pair it with a hybrid keyphrasing technique, which uses Positional Rank as its Graph based rank and re-rank the key phrases using the C-Value method. This process allows us to automatically digitize the printed document and summarise it to provide high-quality keyphrases, which can be used to efficiently search and retrieve relevant printed documents.
  • 关键词:Analog document search; CRNN; Keyphrase Extraction; Position Rank
国家哲学社会科学文献中心版权所有