摘要:With the increasing use of the Internet and social networks, there are many spammers causing security problems and numerous challenges in these services. Detection of spammers has attracted much attention in the recent years and several strategies have been proposed for detection and limitation of their activities by different researchers. However, there are still many challenges and open questions in this area which need further research. Although there are still many problems in this area needs further researches. This study proposes a graph analysis based method for spammer detection by analyzing their behaviors and their relation with the users. Finally, a solution is provided to facilitate the detection process. The aim of this paper, by applying the hybrid graph analysis method and behavior analysis, is to increase the diagnostic accuracy and detection rate with the help of appropriate classification algorithms and the most effective features. So, two scenarios were used to achieve higher accuracy level and lower false positive. The first scenario was based on using the entire data to build and evaluate the model. The results showed that despite the high precision of this approach, due to the high levels of false positive, this approach is not appropriate. In the second scenario, the ratio of the normal users to spammers was considered equal to 2 to 1 which led to satisfactory results. After reviewing the confusion matrix and false positives in different algorithms, the Logistic algorithm was chosen as an appropriate algorithm which meets the objective of this study.
关键词:Spam; spammers; spam detection; user behavior analysis; graph analysis.