首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:The Comparison of SOM and K-means for Text Clustering
  • 本地全文:下载
  • 作者:Yiheng Chen ; Bing Qin ; Ting Liu
  • 期刊名称:Computer and Information Science
  • 印刷版ISSN:1913-8989
  • 电子版ISSN:1913-8997
  • 出版年度:2010
  • 卷号:3
  • 期号:2
  • 页码:268
  • DOI:10.5539/cis.v3n2p268
  • 出版社:Canadian Center of Science and Education
  • 摘要:

    SOM and k-means are two classical methods for text clustering. In this paper some experiments have been done to compare their performances. The sample data used is 420 articles which come from different topics. K-means method is simple and easy to implement; the structure of SOM is relatively complex, but the clustering results are more visual and easy to comprehend. The comparison results also show that k-means is sensitive to initiative distribution, whereas the overall clustering performance of SOM is better than that of k-means, and it also performs well for detection of noisy documents and topology preservation, thus make it more suitable for some applications such as navigation of document collection, multi-document summarization and etc. whereas the clustering results of SOM is sensitive to output layer topology.

国家哲学社会科学文献中心版权所有