首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:Applying Stylometric Analysis Techniques to Counter Anonymity in Cyberspace
  • 本地全文:下载
  • 作者:Sun, Jianwen ; Yang, Zongkai ; Liu, Sanya
  • 期刊名称:Journal of Networks
  • 印刷版ISSN:1796-2056
  • 出版年度:2012
  • 卷号:7
  • 期号:2
  • 页码:259-266
  • DOI:10.4304/jnw.7.2.259-266
  • 语种:English
  • 出版社:Academy Publisher
  • 摘要:Due to the ubiquitous nature and anonymity abuses in cyberspace, it’s difficult to make criminal identity tracing in cybercrime investigation. Writeprint identification offers a valuable tool to counter anonymity by applying stylometric analysis technique to help identify individuals based on textual traces. In this study, a framework for online writeprint identification is proposed. Variable length character n-gram is used to represent the author’s writing style. The technique of IG seeded GA based feature selection for Ensemble (IGAE) is also developed to build an identification model based on individual author level features. Several specific components for dealing with the individual feature set are integrated to improve the performance. The proposed feature and technique are evaluated on a real world data set encompassing reviews posted by 50 Amazon customers. The experimental results show the effectiveness of the proposed framework, with accuracy over 94% for 20 authors and over 80% for 50 ones. Compared with the baseline technique (Support Vector Machine), a higher performance is achieved by using IGAE, resulting in a 2% and 8% improvement over SVM for 20 and 50 authors respectively. Moreover, it has been shown that IGAE is more scalable in terms of the number of authors, than author group level based methods.
  • 关键词:stylometric analysis; writeprint identification; character n-gram; ensemble learning; genetic algorithm
国家哲学社会科学文献中心版权所有