首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:Classification of instagram fake users using supervised machine learning algorithms
  • 本地全文:下载
  • 作者:Kristo Radion Purba ; David Asirvatham ; Raja Kumar Murugesan
  • 期刊名称:International Journal of Electrical and Computer Engineering
  • 电子版ISSN:2088-8708
  • 出版年度:2020
  • 卷号:10
  • 期号:3
  • 页码:2763-2772
  • DOI:10.11591/ijece.v10i3.pp2763-2772
  • 出版社:Institute of Advanced Engineering and Science (IAES)
  • 摘要:On Instagram, the number of followers is a common success indicator. Hence, followers selling services become a huge part of the market. Influencers become bombarded with fake followers and this causes a business owner to pay more than they should for a brand endorsement. Identifying fake followers becomes important to determine the authenticity of an influencer. This research aims to identify fake users' behavior, and proposes supervised machine learning models to classify authentic and fake users. The dataset contains fake users bought from various sources, and authentic users. There are 17 features used, based on these sources: 6 metadata, 3 media info, 2 engagement, 2 media tags, 4 media similarity. Five machine learning algorithms will be tested. Three different approaches of classification are proposed, i.e. classification to 2-classes and 4-classes, and classification with metadata. Random forest algorithm produces the highest accuracy for the 2-classes (authentic, fake) and 4-classes (authentic, active fake user, inactive fake user, spammer) classification, with accuracy up to 91.76%. The result also shows that the five metadata variables, i.e. number of posts, followers, biography length, following, and link availability are the biggest predictors for the users class. Additionally, descriptive statistics results reveal noticeable differences between fake and authentic users.
  • 关键词:Social media;Machine learning;Paid follower;Fake user;Classification algorithm;
国家哲学社会科学文献中心版权所有