首页    期刊浏览 2024年10月05日 星期六
登录注册

文章基本信息

  • 标题:Efficient Density-Based Partitional Clustering Algorithm
  • 本地全文:下载
  • 作者:Zareen Alamgir ; Hina Naveed
  • 期刊名称:COMPUTING AND INFORMATICS
  • 印刷版ISSN:1335-9150
  • 出版年度:2021
  • 卷号:40
  • 期号:6
  • 页码:1322-1344
  • DOI:10.31577/cai_2021_6_1322
  • 语种:English
  • 出版社:COMPUTING AND INFORMATICS
  • 摘要:Clustering is an important data mining technique that helps to detect hidden structures and patterns in the data. K-means algorithm is one of the most popular and widely used partitional clustering algorithms. It is a simple and efficient method but has several shortcomings. One major drawback of traditional K-means is that it selects initial centroids randomly, resulting in low-quality clusters. Various K-means extensions are designed to solve the issue of the random initial centroid. A novel density-based K-means (DK-means) algorithm is recently proposed that uses density-parameters for selecting initial centroids. It outperforms K-means in terms of accuracy at the cost of time. In this research, we present an efficient density-based K-means algorithm (EDK-means) that uses advance data structures and significantly reduces the DK-means algorithm's execution time. Furthermore, we rigorously evaluated the performance of density-based K-means on different challenging real-world datasets and compared it with traditional K-means. The experimental results are promising and show that density-based K-means outperforms K-means. It converges more rapidly than basic K-means, and it works well for the datasets with different cluster sizes.
  • 关键词:Clustering;K-means;density-based K-means;EDK-means;partitional clustering
国家哲学社会科学文献中心版权所有