首页    期刊浏览 2024年10月07日 星期一
登录注册

文章基本信息

  • 标题:Gamma-Poisson Distribution Model for Text Categorization
  • 本地全文:下载
  • 作者:Hiroshi Ogura ; Hiromi Amano ; Masato Kondo
  • 期刊名称:ISRN Artificial Intelligence
  • 印刷版ISSN:2090-7435
  • 电子版ISSN:2090-7443
  • 出版年度:2013
  • 卷号:2013
  • DOI:10.1155/2013/829630
  • 出版社:Hindawi Publishing Corporation
  • 摘要:We introduce a new model for describing word frequency distributions in documents for automatic text classification tasks. In the model, the gamma-Poisson probability distribution is used to achieve better text modeling. The framework of the modeling and its application to text categorization are demonstrated with practical techniques for parameter estimation and vector normalization. To investigate the efficiency of our model, text categorization experiments were performed on 20 Newsgroups, Reuters-21578, Industry Sector, and TechTC-100 datasets. The results show that the model allows performance comparable to that of the support vector machine and clearly exceeding that of the multinomial model and the Dirichlet-multinomial model. The time complexity of the proposed classifier and its advantage in practical applications are also discussed.
国家哲学社会科学文献中心版权所有