首页    期刊浏览 2024年11月08日 星期五
登录注册

文章基本信息

  • 标题:Document Classification Using Expectation Maximization with Semi Supervised Learning
  • 本地全文:下载
  • 作者:Bhawna Nigam ; Poorvi Ahirwal
  • 期刊名称:International Journal on Soft Computing
  • 电子版ISSN:2229-7103
  • 出版年度:2011
  • 卷号:2
  • 期号:4
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:As the amount of online document increases, the demand for document classification to aid the analysis and management of document is increasing. Text is cheap, but information, in the form of knowing what classes a document belongs to, is expensive. The main purpose of this paper is to explain the expectation maximization technique of data mining to classify the document and to learn how to improve the accuracy while using semi-supervised approach. Expectation maximization algorithm is applied with both supervised and semi-supervised approach. It is found that semi-supervised approach is more accurate and effective. The main advantage of semi supervised approach is “DYNAMICALLY GENERATION OF NEW CLASS”. The algorithm first trains a classifier using the labeled document and probabilistically classifies the unlabeled documents. The car dataset for the evaluation purpose is collected from UCI repository dataset in which some changes have been done from our side.
  • 关键词:Data mining; semi-supervised learning; supervised learning; expectation maximization; document;classification.
国家哲学社会科学文献中心版权所有