首页    期刊浏览 2024年07月18日 星期四
登录注册

文章基本信息

  • 标题:An Efficient Discriminant Analysis Algorithm for Document Classification
  • 本地全文:下载
  • 作者:Wang, Ziqiang ; Sun, Xia
  • 期刊名称:Journal of Software
  • 印刷版ISSN:1796-217X
  • 出版年度:2011
  • 卷号:6
  • 期号:7
  • 页码:1265-1272
  • DOI:10.4304/jsw.6.7.1265-1272
  • 语种:English
  • 出版社:Academy Publisher
  • 摘要:Document categorization has become one of the most important research areas of pattern recognition and data mining due to the exponential growth of documents in the Internet and the emergent need to organize them. The document space is always of very high dimensionality and learning in such a high dimensional space is often impossible due to the curse of dimensionality. To cope with performance and accuracy problems with high dimensionality, a novel dimensionality reduction algorithm called IKDA is proposed in this paper. The proposed IKDA algorithm combines kernel-based learning techniques and direct iterative optimization procedure to deal with the nonlinearity of the document distribution. The proposed algorithm also effectively solves the so-called “small sample size” problem in document classification task. Extensive experimental results on two real world data sets demonstrate the effectiveness and efficiency of the proposed algorithm.
  • 关键词:document classification;kernel discriminant analysis;dimensionality reduction;data mining
国家哲学社会科学文献中心版权所有