首页    期刊浏览 2025年07月23日 星期三
登录注册

文章基本信息

  • 标题:Document Classification of Assamese Text Using Naïve Bayes Approach
  • 本地全文:下载
  • 作者:Moromi Gogoi ; Shikhar Kumar Sarma
  • 期刊名称:International Journal of Computer Trends and Technology
  • 电子版ISSN:2231-2803
  • 出版年度:2015
  • 卷号:30
  • 期号:4
  • 页码:182-186
  • DOI:10.14445/22312803/IJCTT-V30P132
  • 出版社:Seventh Sense Research Group
  • 摘要:Document classification has become an emerging technique in the field of research due to the abundance of documents available in digital form. Document classification can be used to organize data into smaller and meaningful classes. Correctly identifying a document into a particular class is still a huge challenge particularly in Assamese text as very few work has been done in this field . In this paper we have done document classification using Naïve bayes classifier. In regards to the various classifying approaches, Naïve Bayes is potentially good at serving as a document classification model due to its simplicity. The aim of this paper is to highlight the performance of employing Naïve Bayes in document classification. In this paper the document is classified into one of the four classes i.e. sports, politics , law and science. To build and evaluate the classification model, a total 200 documents is split into two datasets, namely training set and testing set, in which 60% of the documents is used as training set whereas the remaining 40% is used as the testing set. The results have been validated using statistical measures of precision , recall and their combination Fmeasure. Results show that Naïve Bayes is a good classifiers.
  • 关键词:Document classification; Naive Bayes
国家哲学社会科学文献中心版权所有