期刊名称:International Journal of Advanced Research In Computer Science and Software Engineering
印刷版ISSN:2277-6451
电子版ISSN:2277-128X
出版年度:2012
卷号:2
期号:5
出版社:S.S. Mishra
摘要:Cluster analysis is important for analyzing the number of clusters of natural data in several domains. Various clustering methods have been proposed. However, it is very difficult to choose the method best suited to the type of data. Therefore, the objective of this research was to compare the effectiveness of four clustering techniques with multivariate data. The techniques were: K means Clustering algorithm, Fuzzy C Means algorithm, competitive Neural Network and one novel method Quantum Clustering was added to evaluate its relative performance. Such algorithms may employ distinct principles, and lead to different perfor mance and results. The appropriate choice of a clustering method is a significant and often overlooked aspect in extracting information from large-scale datasets. Evidently, such choice may significantly influence the biological interpretation of the data. We present an easy-to-use and intuitive tool that compares some clustering methods within the same framework. It first reduces the dataset's dimensionality using the Singular Value Decomposition (SVD) method, and only then employs various clustering techniques. Besides its simplicity, and its ability to perform well on high dimensional data, it provides visualization tools for evaluating the results. We tested various algorithms on a variety of datasets.