期刊名称:International Journal of Computer Science & Information Technology (IJCSIT)
印刷版ISSN:0975-4660
电子版ISSN:0975-3826
出版年度:2014
卷号:6
期号:6
页码:83
出版社:Academy & Industry Research Collaboration Center (AIRCC)
摘要:Text categorization is the process of grouping documents into categories based on their contents. Thisprocess is important to make information retrieval easier, and it became more important due to the hugetextual information available online. The main problem in text categorization is how to improve theclassification accuracy. Although Arabic text categorization is a new promising field, there are a fewresearches in this field. This paper proposes a new method for Arabic text categorization using vectorevaluation. The proposed method uses a categorized Arabic documents corpus, and then the weights of thetested document's words are calculated to determine the document keywords which will be compared withthe keywords of the corpus categorizes to determine the tested document's best category.
关键词:Text Categorization; Arabic Text Classification; Information Retrieval; Data Mining; Machine Learning