首页    期刊浏览 2024年09月20日 星期五
登录注册

文章基本信息

  • 标题:Classifying Arabic Text Using KNN Classifier
  • 本地全文:下载
  • 作者:Amer Al-Badarenah ; Emad Al-Shawakfa ; Khaleel Al-Rababah
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2016
  • 卷号:7
  • 期号:6
  • DOI:10.14569/IJACSA.2016.070633
  • 出版社:Science and Information Society (SAI)
  • 摘要:With the tremendous amount of electronic documents available, there is a great need to classify documents automatically. Classification is the task of assigning objects (images, text documents, etc.) to one of several predefined categories. The selection of important terms is vital to classifier performance, feature set reduction techniques such as stop word removal, stemming and term threshold were used in this paper. Three term-selection techniques are used on a corpus of 1000 documents that fall in five categories. A comparison study is performed to find the effect of using full-word, stem, and the root term indexing methods. K-nearest – neighbors classifiers used in this study. The averages of all folds for Recall, Precision, Fallout, and Error-Rate were calculated. The results of the experiments carried out on the dataset show the importance of using k-fold testing since it presents the variations of averages of recall, precision, fallout, and error rate for each category over the 10-fold
  • 关键词:thesai; IJACSA; thesai.org; journal; IJACSA papers; categorization; Arabic; KNN; stemming; cross validation
国家哲学社会科学文献中心版权所有