期刊名称:Journal of Artificial Intelligence and Soft Computing Research
电子版ISSN:2083-2567
出版年度:2013
卷号:3
期号:1
页码:51-71
DOI:10.2478/jaiscr-2014-0005
出版社:Walter de Gruyter GmbH
摘要:Clustering is widely used to explore and understand large collections of data. K-means clustering method is one of the most popular approaches due to its ease of use and simplicity to implement. This paper introduces Density-based Split- and -Merge K-means clustering Algorithm (DSMK-means), which is developed to address stability problems of standard K-means clustering algorithm, and to improve the performance of clustering when dealing with datasets that contain clusters with different complex shapes and noise or outliers. Based on a set of many experiments, this paper concluded that developed algorithms “DSMK-means” are more capable of finding high accuracy results compared with other algorithms especially as they can process datasets containing clusters with different shapes, densities, or those with outliers and noise.