首页    期刊浏览 2024年11月24日 星期日
登录注册

文章基本信息

  • 标题:Discovering Knowledge Patterns from Integration of Clustering and Classification Techniques
  • 本地全文:下载
  • 作者:Raman Pathrey ; Yogesh Kumar ; Nitin
  • 期刊名称:International Journal of Advanced Research In Computer Science and Software Engineering
  • 印刷版ISSN:2277-6451
  • 电子版ISSN:2277-128X
  • 出版年度:2013
  • 卷号:3
  • 期号:4
  • 出版社:S.S. Mishra
  • 摘要:Data mining is essentially the discovery of valuable information and patterns from huge chunks of available data. Two indispensible techniques of data mining are clustering and classification, where the latter employs a set of pre-classified examples to develop a model that can classify the population of records at large, and the former divides the data into groups of similar objects. In this paper we have proposed a new method for data classification by integrating two data mining techniques, viz. clustering and classification. Then a comparative study has been carried out between the simple classification and new proposed integrated clustering -classification technique. Four popular data mining tools were used for both the techniques by using six different classifiers and one clusterer for all sets. It was found that across all the tools used, the integrated clustering-classification technique was better than the simple classification technique. This result was consistent for all the six classifiers used. For both of the techniques, the best classifier was found to be SVM. Out of the four tools used, WEKA was found to be the best in terms of flexibility of algorithm. All comparisons were drawn by comparing the percentage accuracy of each classifier used
  • 关键词:data mining; classification; integrated clustering -classification; data mining tools; pima Indians Diabetes ;dataset; hybrid
国家哲学社会科学文献中心版权所有