首页    期刊浏览 2025年01月05日 星期日
登录注册

文章基本信息

  • 标题:LeaDen-Stream: A Leader Density-Based Clustering Algorithm over Evolving Data Stream
  • 本地全文:下载
  • 作者:Amineh Amini ; Teh Ying Wah
  • 期刊名称:Journal of Computer and Communications
  • 印刷版ISSN:2327-5219
  • 电子版ISSN:2327-5227
  • 出版年度:2013
  • 卷号:01
  • 期号:05
  • 页码:26-31
  • DOI:10.4236/jcc.2013.15005
  • 语种:English
  • 出版社:Scientific Research Publishing
  • 摘要:Clustering evolving data streams is important to be performed in a limited time with a reasonable quality. The existing micro clustering based methods do not consider the distribution of data points inside the micro cluster. We propose LeaDen-Stream (Leader Density-based clustering algorithm over evolving data Stream), a density-based clustering algorithm using leader clustering. The algorithm is based on a two-phase clustering. The online phase selects the proper mini-micro or micro-cluster leaders based on the distribution of data points in the micro clusters. Then, the leader centers are sent to the offline phase to form final clusters. In LeaDen-Stream, by carefully choosing between two kinds of micro leaders, we decrease time complexity of the clustering while maintaining the cluster quality. A pruning strategy is also used to filter out real data from noise by introducing dense and sparse mini-micro and micro-cluster leaders. Our performance study over a number of real and synthetic data sets demonstrates the effectiveness and efficiency of our method.
  • 关键词:Evolving Data Streams; Density-Based Clustering; Micro Cluster; Mini-Micro Cluster
国家哲学社会科学文献中心版权所有