期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
印刷版ISSN:2158-107X
电子版ISSN:2156-5570
出版年度:2019
卷号:10
期号:7
页码:651-657
DOI:10.14569/IJACSA.2019.0100785
出版社:Science and Information Society (SAI)
摘要:Gaining the most benefits out of a certain data set is a difficult task because it requires an in-depth investigation into its different features and their corresponding values. This task is usually achieved by presenting data in a visual format to reveal hidden patterns. In this study, several visualization techniques are applied to a bank’s direct marketing data set. The data set obtained from the UCI machine learning repository website is imbalanced. Thus, some oversampling methods are used to enhance the accuracy of the prediction of a client’s subscription to a term deposit. Visualization efficiency is tested with the oversampling techniques’ influence on multiple classifier performance. Results show that the agglomerative hierarchical clustering technique outperforms other oversampling techniques and the Naive Bayes classifier gave the best prediction results.
关键词:Bank direct marketing; prediction; visualization; oversampling; Naive Bayes