文章基本信息

标题：Naïve Bayesian Based on Chi Square to Categorize Arabic Data
作者：Fadi Thabtah ; Mohammad Ali H. Eljinini ; Mannam Zamzeer 等
期刊名称：Communications of the IBIMA
电子版ISSN：1943-7765
出版年度：2009
卷号：2009
出版社：IBIMA Publishing
摘要：Text classification is a supervised technique that uses labelled training data to learn the classification system and then automatically classifies the remaining text using the learned system. This paper investigates Naïve Bayesian algorithm based on Chi Square features selection method. The base of our comparisons are macro F1, macro recall and macro precision evaluation measures. The experimental results compared against different Arabic text categorization data sets provided evidence that feature selection often increases classification accuracy by removing rare terms.
关键词：Text Categorization; Naïve Bayesian; Arabic Text Data; Chi Square.