首页    期刊浏览 2025年06月13日 星期五
登录注册

文章基本信息

  • 标题:Design analysis and implementation of efficient parameter free algorithm for high quality homogeneous clusters in data mining applications
  • 作者:Prasad S.Halgaonkar ; Vijay M.Wadhai ; A.D.Potgantwar
  • 期刊名称:International Journal of Computer Science and Network Security
  • 印刷版ISSN:1738-7906
  • 出版年度:2010
  • 卷号:10
  • 期号:2
  • 页码:246-253
  • 出版社:International Journal of Computer Science and Network Security
  • 摘要:A new algorithm for clustering high-dimensional categorical data is proposed and implemented by us. Our algorithm is parameter-free, fully-automatic and is based on a two-phase iterative procedure. In the first phase, cluster assignments are given, and a new cluster is added to the partition by identifying and splitting a low-quality cluster. Second phase attempts to optimize clusters. This algorithm is parametric to cluster quality in terms of homogeneity. We show how a suitable notion of cluster homogeneity can be defined in the context of high-dimensional categorical data, from which an effective instance of the proposed clustering scheme immediately follows. Our experiments carried out on real data shows that the devised algorithm achieves optimal results in terms of compactness and separation.
  • 关键词:Clustering; high-dimensional categorical data; information search and retrieval
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有