期刊名称:International Journal of Advanced Research In Computer Science and Software Engineering
印刷版ISSN:2277-6451
电子版ISSN:2277-128X
出版年度:2013
卷号:3
期号:4
出版社:S.S. Mishra
摘要:Document clustering help in organising documents in groups according to their similarity of contents. This paper presents the study of various clustering techniques. In particular K-means [3], Agglomerative Hierarchical Clustering. In addition to the various clustering techniques this paper also discusses about various document-representing techniques in graph .In particular Vector Space Model [2, 18] and Matrix Representation [2]. After studying all these things, we created a new approach of clustering algorithm and also a new representation technique of documents. We compared our results with K-means Algorithm and found that our approach is giving good results
关键词:Document Clustering; K-means; Vector Space Model; Agglomerative Hierarchical Clustering