首页    期刊浏览 2024年11月29日 星期五
登录注册

文章基本信息

  • 标题:An Improved Correlation-Based Algorithm with Discretization for Attribute Reduction in Data Clustering
  • 作者:S Kannan ; Dr Ramaraj
  • 期刊名称:Data Science Journal
  • 电子版ISSN:1683-1470
  • 出版年度:2015
  • 卷号:8
  • DOI:10.2481/dsj.007-044
  • 语种:English
  • 出版社:Ubiquity Press
  • 摘要:Attribute reduction aims to reduce the dimensionality of large scale data without losing useful information and is an important topic of knowledge discovery, data clustering, and classification. In this paper, we aim to solve the current problem that a continuous attribute in a clustering or classification algorithm must be made discrete. We propose a new algorithm of data reduction based on a correlation model with data discretization. It deals with selection of continuous attributes from a very large set of attributes. The proposed algorithm is an extended version of the Fast Correlation-based filter algorithm and is named FCBF + . The FCBF + algorithm performs the discretization of continuous attributes in an efficient manner. Then it selects the relevant attributes from a very large set of attributes. Performance evaluation is done on clustering accuracy for all the features, and a reduced set of features is obtained using FCBF + . It is found that the proposed FCBF + algorithm improves the clustering accuracy of various clustering algorithms.
  • 关键词:Clustering; Attribute reduction; Data discretization; Correlation-based model; Knowledge discovery; Data mining
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有