文章基本信息

标题：Voting-based Classification for E-mail Spam Detection
本地全文：下载
作者：Bashar Awad Al-Shboul ; Heba Hakh ; Hossam Faris 等
期刊名称：Journal of ICT Research and Applications
印刷版ISSN：2337-5787
电子版ISSN：2338-5499
出版年度：2016
卷号：10
期号：1
页码：29-42
语种：English
出版社：Institut Teknologi Bandung
其他摘要：The problem of spam e-mail has gained a tremendous amount of attention. Although entities tend to use e-mail spam filter applications to filter out received spam e-mails, marketing companies still tend to send unsolicited e-mails in bulk and users still receive a reasonable amount of spam e-mail despite those filtering applications. This work proposes a new method for classifying e-mails into spam and non-spam. First, several e-mail content features are extracted and then those features are used for classifying each e-mail individually. The classification results of three different classifiers (i.e. Decision Trees, Random Forests and k-Nearest Neighbor) are combined in various voting schemes (i.e. majority vote, average probability, product of probabilities, minimum probability and maximum probability) for making the final decision. To validate our method, two different spam e-mail collections were used.