首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:Data Mining Technology Application in False Text Information Recognition
  • 本地全文:下载
  • 作者:Jie Wan ; Xue Cao ; Kun Yao
  • 期刊名称:Mobile Information Systems
  • 印刷版ISSN:1574-017X
  • 出版年度:2021
  • 卷号:2021
  • 页码:1-13
  • DOI:10.1155/2021/4206424
  • 出版社:Hindawi Publishing Corporation
  • 摘要:False information on the Internet is being heralded as serious social harm to our society. To recognize false text information, in this paper, an effective method for mining text features is proposed in the field of false drug advertisements. Firstly, the data of false drug advertisements and real drug advertisements were collected from the official websites to build a database of false and real drug advertisements. Secondly, by performing feature extraction on the text of drug advertisements, this work built a characteristic matrix based on the effective features and assigned positive or negative labels to the feature vector of the matrix according to whether it is a fake medical advertisement or not. Thirdly, this study trained and tested several different classifiers, selected the classification model with the best performance in identifying false drug advertisements, and found the key characteristics that can determine the classification. Finally, the model with the best performance was used to predict new false drug advertisements collected from Sina Weibo. In the case of identifying false drug advertisements, the classification effect of the support vector machine (SVM) classifier established on the feature set after feature selection was the most effective. The findings of this study can provide an effective method for the government to identify and combat false advertisements. This study has a certain reference significance in demonstrating the use of text data mining technology to identify and detect information fraud behavior.
国家哲学社会科学文献中心版权所有