首页    期刊浏览 2024年12月01日 星期日
登录注册

文章基本信息

  • 标题:A Novel Semi Supervised Algorithm for Text Classification Using BPNN by Active Search
  • 本地全文:下载
  • 作者:Mahak Motwani ; Aruna Tiwari
  • 期刊名称:International Journal of Computer Science Issues
  • 印刷版ISSN:1694-0784
  • 电子版ISSN:1694-0814
  • 出版年度:2014
  • 卷号:11
  • 期号:3
  • 出版社:IJCSI Press
  • 摘要:Demand of Text Classification is increasing with the evolution of huge amount of text data available in internet, news, institutes , To make an effective text classifier we need large amount of labeled data in the form of training samples, to get labeled data is not only expensive but also time consuming, tedious task, whereas unlabelled data is easily available inexpensive. This paper proposes an algorithm that just makes use of some root words from expert followed by active search. Our algorithm also makes use of a very effective Term weighting method based on relevance factor that is used for feature representation, this text is train by BPNN. The proposed algorithm is compared on test data and on standard data 20 Newsgroup and mini Newsgroup on the basis of micro-average and macro-averaged F1 measure The Experimental results depicts the best micro averaged F1 measure of 0.95 at 2400 epochs for test data, 0.67 for 20 news group and is 0.95 for Mini Newsgroup which are comparable with the well known supervised Text classification.
  • 关键词:Semi Supervised; text classification; Active search; term weighting method; Neural network
国家哲学社会科学文献中心版权所有