首页    期刊浏览 2024年09月16日 星期一
登录注册

文章基本信息

  • 标题:Batch -Incremental Classification of Stream Data Using Storage
  • 作者:Parita Ponkiya ; Rohit Srivastava
  • 期刊名称:International Journal of Computer Science and Network Security
  • 印刷版ISSN:1738-7906
  • 出版年度:2015
  • 卷号:15
  • 期号:4
  • 页码:95-99
  • 出版社:International Journal of Computer Science and Network Security
  • 摘要:Data mining is a technique that is used to extract useful knowledge from large amount of data. And classification is most important task of data mining. Now a day��s in real world stream data is most important source of knowledge. Stream data is data that continuously arrives over the time i.e. growth of data is increasing faster and faster. Traditional classification algorithms are not suitable for such data. Continuous growth of the data makes previously constructed classification tree outdated and it is to be reconstructed from the scratch, which is very time consuming. Another major issue is the data-type, as each of them is to be treated separately, among which the continuous data produces major challenge in the tree building task, needs to be discretized. Out of many classifications algorithms, ID3 is a famous tree based classification algorithm which deals with only categorical data and uses information gain for attribute selection. In this paper the tree based Batch incremental classification algorithm is proposed for stream data that outputs tree same as ID3. It uses CAIM based discretization for continuous attributes and various attribute selection criterions along with storage structure for the strategic information of every node and the historical data to rebuild decision tree. CAIR, CAIU, CAIM criterions are used as attribute selection criterions and comparison is also provided between these attribute selection measures
  • 关键词:Classification; CAIR; CAIM; CAIU; Information Gain; Batch Incremental Classification
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有