首页    期刊浏览 2024年11月22日 星期五
登录注册

文章基本信息

  • 标题:Author Gender Prediction in an Email Stream Using Neural Networks
  • 本地全文:下载
  • 作者:William Deitrick ; Zachary Miller ; Benjamin Valyou
  • 期刊名称:Journal of Intelligent Learning Systems and Applications
  • 印刷版ISSN:2150-8402
  • 电子版ISSN:2150-8410
  • 出版年度:2012
  • 卷号:4
  • 期号:3
  • 页码:169-175
  • DOI:10.4236/jilsa.2012.43017
  • 出版社:Scientific Research Publishing
  • 摘要:With the rapid growth of the Internet in recent years, the ability to analyze and identify its users has become increasingly important. Authorship analysis provides a means to glean information about the author of a document originating from the internet or elsewhere, including but not limited to the author’s gender. There are well-known linguistic differences between the writing of men and women, and these differences can be effectively used to predict the gender of a document’s author. Capitalizing on these linguistic nuances, this study uses a set of stylometric features and a set of word count features to facilitate automatic gender discrimination on emails from the popular Enron email dataset. These features are used in conjunction with the Modified Balanced Winnow Neural Network proposed by Carvalho and Cohen, an improvement on the original Balanced Winnow created by Littlestone. Experiments with the Modified Balanced Winnow show that it is effectively able to discriminate gender using both stylometric and word count features, with the word count features providing superior results.
  • 关键词:1-Gram Word Counts; Balanced Winnow; Enron Email; Gender Prediction; Neural Network; Stream Mining; Stylometric Features
国家哲学社会科学文献中心版权所有