期刊名称:IAENG International Journal of Computer Science
印刷版ISSN:1819-656X
电子版ISSN:1819-9224
出版年度:2019
卷号:46
期号:1
页码:46-53
出版社:IAENG - International Association of Engineers
摘要:A classification model based on the naïve Bayesalgorithm is proposed to classify spam messages moreeffectively. Spam message classification models based on thenaïve Bayes algorithm are constructed both formulti-classification and multi-two-classification through stepsinvolving text preprocessing based on regular expression andfeature extraction based on Jieba segmentation and the TF-IDF(term frequency–inverse document frequency) algorithm. Byfurther comparing the classification performance against thesupport vector machine and random forest algorithms, thenaïve Bayes algorithm based on multi-two-classification isshown to be the best.