期刊名称:International Journal of Intelligent Systems and Applications
印刷版ISSN:2074-904X
电子版ISSN:2074-9058
出版年度:2019
卷号:11
期号:12
页码:11-19
DOI:10.5815/ijisa.2019.12.02
出版社:MECS Publisher
摘要:The use of technology is at its peak. Many companies try to reduce the work and get an efficient result in a specific amount of time. But a large amount of data is being processed each day that is being stored and turned into large datasets. To get useful information, the dataset needs to be analyzed so that one can extract knowledge by training the machine. Thus, it is important to analyze and extract knowledge from a large dataset. In this paper, we have used two popular classification techniques- Decision tree and Naive Bayes to compare the performance of the classification of our data set. We have taken student performance dataset that has 480 observations. We have classified these students into different groups and then calculated the accuracy of our classification by using the R language. Decision tree uses a divide and conquer method including some rules that makes it easy for humans to understand. The Naive Bayes theorem includes an assumption that the pair of features being classified are independent. It is based on the Bayes theorem..
关键词:Decision tree classification;Data mining;R language;Supervised learning;Naive Bayes classification