首页    期刊浏览 2024年12月03日 星期二
登录注册

文章基本信息

  • 标题:RESEARCH CHALLENGES IN TEXT MINING AND EMPIRICAL RESEARCH DIRECTIONS
  • 本地全文:下载
  • 作者:K Ranjith Reddy ; Dr. Sanjay Chaudhary
  • 期刊名称:Indian Journal of Computer Science and Engineering
  • 印刷版ISSN:2231-3850
  • 电子版ISSN:0976-5166
  • 出版年度:2021
  • 卷号:12
  • 期号:3
  • 页码:752-764
  • 出版社:Engg Journals Publications
  • 摘要:Document categorization is one among the prime successive and fundamental issues in viewpoints of information examination, with applications from data recovery as well as spam sifting to content personalization and etymological communication content measure. Automated text order classification is an especially difficult assignment in present day information investigation, both from an observational and from a hypothetical viewpoint. This issue is of focal interest in numerous web applications, and therefore it has received consideration from specialists in such assorted zones. Quickly streaming surges of text are created by online news, web-based media and perpetual various applications, along with subsequently the need to precisely and adaptively sort them into the sub-streams could be a significant one. The emphasis on exclusively making utilization of delimited resources could be a result of size of particular streams: each time and memory should be held under the influence. The economical analysis of the huge datasets is the one among the most challenges in trendy machine intelligence and data processing applications. In this paper, we extensively surveyed significant developments occurred in this domain over past years. We have listed some significant existing methods, tools, standard datasets for performing text mining and analysis. We also given an argument on the various open challenges involved in this domain along with the problem identification and our possible research directions / objectives to overcome these challenges.
  • 关键词:Text classification; Information retrieval; Machine learning; Feature selection; Language models.
国家哲学社会科学文献中心版权所有