首页    期刊浏览 2025年12月23日 星期二
登录注册

文章基本信息

  • 标题:Genetic algorithm rule based categorization method for textual data mining
  • 本地全文:下载
  • 作者:Afif, M. ; Ghareb, A. ; Saif, A.
  • 期刊名称:Decision Science Letters
  • 印刷版ISSN:1929-5804
  • 电子版ISSN:1929-5812
  • 出版年度:2020
  • 卷号:9
  • 期号:1
  • 页码:37-50
  • DOI:10.5267/j.dsl.2019.8.003
  • 语种:English
  • 出版社:Growing Science Publishing Company
  • 摘要:The rule based categorization approaches such as associative classification have the capability to produce classifiers rival to those learned by traditional categorization approaches such as Naïve Bayes and K-nearest Neighbor. However, the lack of useful discovery and usage of categorization rules are the major challenges of rule based approaches and their performance is declined with large set of rules. Genetic Algorithm (GA) is effective to reduce the high dimensionality and improve categorization performance. However, the usage of GA in most researches is limited in the categorization preprocessing stage and its results is used to simplify the categorization process performed by other categorization algorithms. This paper proposed a hybrid GA rule based categorization method, named genetic algorithm rule based categorization (GARC), to enhance the accuracy of categorization rules and to produce accurate classifier for text mining. The GARC consists of three main stages; namely, search space determination, rule discovery with validation (rule generation), and categorization. The experimental results are carried out on three Arabic text datasets with multiple categories to evaluate the efficiency of GARC. The results show that a promising performance was achieved by using GARC for Arabic text categorization. The GARC achieves the best performance with small feature space in most situations.
  • 关键词:Rule based categorization; Text categorization; Genetic Algorithm; Rule discovery; Categorization rule; Associative classification
国家哲学社会科学文献中心版权所有