首页    期刊浏览 2025年02月22日 星期六
登录注册

文章基本信息

  • 标题:Hoax Classification With Term Frequency – Inverse Document Frequency Using Non-Linear SVM and Naïve Bayes
  • 本地全文:下载
  • 作者:Ayundyah Kesumawati ; Achmad Kurniansyah Thalib
  • 期刊名称:International Journal of Advances in Soft Computing and Its Applications
  • 印刷版ISSN:2074-8523
  • 出版年度:2018
  • 卷号:10
  • 期号:3
  • 出版社:International Center for Scientific Research and Studies
  • 摘要:In recent years, there are crucial issues in the modern society that gain information on the internet. Spreading the news very easily but can lead to very difficult to filtering the information. The flow of information that provides broad benefits to society, can even enter into the psychology and social for the integrity of the Nation. Information that is easily obtained is extremely dangerous in terms of validity and is not uncommonly a hoax. The dataset that used in this research was gained from news website detik.com and turnbackhoax.id. in this research will provide the comparing of two methods there are Naïve Bayes Classifier (NBC) and Support Vector Machine (SVM) with Radial Basis Function. This research using the Term Frequency – Inverse Document Frequency Weighting (TF-IDFW) that separated each word to make it easy to analyze the text classification. The results obtained for accuracy NBC with training data of 1.480 and test data of 369 is 85.09% and for SVM obtained an accuracy of 83.74%. In addition, the merging of information with text mining, the keyword for the news category is "Price", followed by "KPK", "Stock", "Indonesia", "DPR", and "Police". For the hoax category, the most words are the word "Price", followed by "KPK", "Stock", "Indonesia", "DPR", and "Police".
  • 关键词:News; Hoax; TF-IDFW; NBC; Text Mining; SVM
国家哲学社会科学文献中心版权所有