首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:An Improved Swarm Based Approach for Efficient Document Clustering
  • 本地全文:下载
  • 作者:Kanika Khanna ; Madan Lal Yadav
  • 期刊名称:International Journal of Computer Trends and Technology
  • 电子版ISSN:2231-2803
  • 出版年度:2013
  • 卷号:4
  • 期号:6-1
  • 出版社:Seventh Sense Research Group
  • 摘要:Clustering is one basic and important data mining approach used independently as well as the preprocessing stage in many data mining applications. The clustering process basically divides the available dataset into smaller subsets called clusters. These clusters are generally substantially different from one other. In this present work, the clustering is performed on text documents. Text document clustering basically divide the available documents in sub groups based on clustering parameters. The document clustering includes number of basic phenomenon such as document organization, topic extraction and the information retrieval. In this, an improvement clustering approach is defined over the basic clustering approach. The basic clustering approaches that we have to improve in this work are KMeans Clustering and CMeans Clustering. The improvement is here done with the inclusion of PSO (Particle Swarm Optimization).
  • 关键词:Pre-processing; Clustering; Extraction; K-Means; C-Means; PSO
国家哲学社会科学文献中心版权所有