首页    期刊浏览 2025年12月05日 星期五
登录注册

文章基本信息

  • 标题:Advanced text documents information retrieval system for search services
  • 本地全文:下载
  • 作者:Chiranjeevi H S ; Manjula K. Shenoy
  • 期刊名称:Cogent Engineering
  • 电子版ISSN:2331-1916
  • 出版年度:2020
  • 卷号:7
  • 期号:1
  • 页码:1-17
  • DOI:10.1080/23311916.2020.1856467
  • 出版社:Taylor and Francis Ltd
  • 摘要:Information technology has explored the growth of text documents data in many organizations and the structural arrangement of voluminous data is a complex task. Handling the text document data is a challenging process involving not only the training of models but also numerous additional procedures, e.g., data pre-processing, transformation, and dimensionality reduction. In this paper, we describe the system’s architecture, the technical challenges, and the novel solution we have built. We propose a Recurrent Convolutional Neural network (RCNN), based text information retrieval system which efficiently retrieves the text documents and information for the user query. Pre-processing using tokenization and stemming, retrieval using TF-IDF (Term Frequency-Inverse Document Frequency), and RCNN classifier which captures the contextual information is implemented. A real-time advanced search system is developed on a huge set of MAHE University dataset. The performance of the proposed text document retrieval system is compared with other existing algorithms and the efficacy of the method is discussed. The proposed RCNN-based text document information retrieval model performs better in terms of precision, recall, and F-measure. A high-quality and high-performance text document retrieval search system is presented.
  • 关键词:information technology text documents search engine information retrieval tokenization recurrent convolutional neural network retrieval efficiency
国家哲学社会科学文献中心版权所有