文章基本信息

标题：A Benchmark to Select Data Mining Based Classification Algorithms for Business Intelligence and Decision Support Systems
本地全文：下载
作者：Pardeep Kumar ; Nitin ; Vivek Kumar Sehgal 等
期刊名称：International Journal of Data Mining & Knowledge Management Process
印刷版ISSN：2231-007X
电子版ISSN：2230-9608
出版年度：2012
卷号：2
期号：5
出版社：Academy & Industry Research Collaboration Center (AIRCC)
摘要：In today’s business scenario, we percept major changes in how managers use computerized support in making decisions. As more number of decision-makers use computerized support in decision making, decision support systems (DSS) is developing from its starting as a personal support tool and is becoming the common resource in an organization. DSS serve the management, operations, and planning levels of an organization and help to make decisions, which may be rapidly changing and not easily specified in advance. Data mining has a vital role to extract important information to help in decision making of a decision support system. It has been the active field of research in the last two-three decades. Integration of data mining and decision support systems (DSS) can lead to the improved performance and can enable the tackling of new types of problems. Artificial Intelligence methods are improving the quality of decision support, and have become embedded in many applications ranges from ant locking automobile brakes to these days interactive search engines. It provides various machine learning techniques to support data mining. The classification is one of the main and valuable tasks of data mining. Several types of classification algorithms have been suggested, tested and compared to determine the future trends based on unseen data. There has been no single algorithm found to be superior over all others for all data sets. Various issues such as predictive accuracy, training time to build the model, robustness and scalability must be considered and can have tradeoffs, further complex the quest for an overall superior method. The objective of this paper is to compare various classification algorithms that have been frequently used in data mining for decision support systems. Three decision trees based algorithms, one artificial neural network, one statistical, one support vector machines with and without adaboost and one clustering algorithm are tested and compared on four datasets from different domains in terms of predictive accuracy, error rate, classification index, comprehensibility and training time. Experimental results demonstrate that Genetic Algorithm (GA) and support vector machines based algorithms are better in terms of predictive accuracy. Former shows highest comprehensibility but is slower than later. From the decision tree based algorithms, QUEST produces trees with lesser breadth and depth showing more comprehensibility. This research work shows that GA based algorithm is more powerful algorithm and shall be the first choice of organizations for their decision support systems. SVM without adaboost shall be the first choice in context of speed and predictive accuracy. Adaboost improves the accuracy of SVM but on the cost of large training time.
关键词：Artificial Intelligence; Decision Support System; Data Mining; KDD; Classification Algorithms; Predictive;Accuracy; Comprehensibility; Genetic Algorithm