期刊名称:Journal of Theoretical and Applied Information Technology
印刷版ISSN:1992-8645
电子版ISSN:1817-3195
出版年度:2014
卷号:67
期号:3
出版社:Journal of Theoretical and Applied
摘要:Classification and prediction in temporal data is one of important tasks of data mining which are used in various fields such as financial data forecasting, meteorological data prediction, geographic data processing, and biomedical data analysis. A number of methods have been developed based on learning models such as decision tree, neural network or using other data mining task such as clustering, association rule and sequence pattern mining. One of the classification algorithms based on sequential mining is Classify-by-Sequence (CBS) algorithm. CBS algorithm is a combination of sequential pattern mining with probabilistic induction. However, this algorithm has a drawback regarding its computational time which is quite inefficient. This paper proposes an improvement of CBS algorithm to solve the computational time�s inefficiency. The enhancement is conducted by improving the pruning part using FGSP and FEAT algorithm. These algorithms produce classifier that will use to classify testing data. The quality of classifier is then measured using 〖covacc〗_t parameter which is the combination of coverage and accuracy. To achieve the purpose of this research, we use meteorological data to classify rain or dry season. The simulation result show that the computational time of modified CBS algorithm with FGSP algorithm is faster than the modification using FEAT algorithm with the same level of accuracy.
关键词:Temporal Data; Classify by Sequence; Pruning Strategy; FEAT; FGSP.