首页    期刊浏览 2024年12月01日 星期日
登录注册

文章基本信息

  • 标题:A Novel Text Representation Model to Categorize Text Documents using Convolution Neural Network
  • 本地全文:下载
  • 作者:M. B. Revanasiddappa ; B. S. Harish
  • 期刊名称:International Journal of Intelligent Systems and Applications
  • 印刷版ISSN:2074-904X
  • 电子版ISSN:2074-9058
  • 出版年度:2019
  • 卷号:11
  • 期号:5
  • 页码:36-45
  • DOI:10.5815/ijisa.2019.05.05
  • 出版社:MECS Publisher
  • 摘要:This paper presents a novel text representation model called Convolution Term Model (CTM) for effective text categorization. In the process of text categorization, representation plays a very primary role. The proposed CTM is based on Convolution Neural Network (CNN). The main advantage of proposed text representation model is that, it preserves semantic relationship and minimizes the feature extraction burden. In proposed model, initially convolution filter is applied on word embedding matrix. Since, the resultant CTM matrix is higher dimension, feature selection methods are applied to reduce the CTM feature space. Further, selected CTM features are fed into classifier to categorize text document. To discover the effectiveness of the proposed model, extensive experimentations are carried out on four standard benchmark datasets viz., 20-NewsGroups, Reuter-21758, Vehicle Wikipedia and 4 University datasets using five different classifiers. Accuracy is used to assess the performance of classifiers. The proposed model shows impressive results with all classifiers.
  • 关键词:Text Documents;Convolution Neural Network;Representation;Feature Selection;Categorization
国家哲学社会科学文献中心版权所有