首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:AN ANALYSIS OF TEXT MINING FACTORS ENHANCING THE IDENTIFICATION OF RELEVANT STUDIES
  • 本地全文:下载
  • 作者:MOUAYAD KHASHFEH ; MOAMIN A. MAHMOUD ; MOHD SHARIFUDDIN AHMAD
  • 期刊名称:Journal of Theoretical and Applied Information Technology
  • 印刷版ISSN:1992-8645
  • 电子版ISSN:1817-3195
  • 出版年度:2018
  • 卷号:96
  • 期号:12
  • 出版社:Journal of Theoretical and Applied
  • 摘要:The development of science and the spread of knowledge coincide with growing number of publications, and the volume of online content continue to grow at a rapid rate. For some submitted queries, the search engines may return thousands of documents of questionable relevancy. In this paper, we analyze the literature and identify the text mining factors that influence the identification of relevant studies. Five factors are identified which are Text Typography; Paragraph length; Term Frequency factor; Coordination; and Strict search. Subsequently, we propose an agent based-text mining model that facilitate the identification of relevant studies in big databases. The model consists of four components which are, interface, search process, parsing process, and storage. The interface provides a communication mean between a user and his/her counterpart agent (Personal Agent). In addition, it provides an input tool for user�s search preferences. The second component is the search process that is operated by a pattern matching. The third process is the parsing that is operated by a text mining algorithm. The last part is the storage that is managed by Monitor Agent. The proposed framework would be useful in providing an alternative means of searching highly relevant studies from large databases.
  • 关键词:Text Mining; Agent-based Model; Relevant Studies
国家哲学社会科学文献中心版权所有