首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:Semantic Document Classification based on Strategies of Semantic Similarity Computation and Correlation Analysis
  • 本地全文:下载
  • 作者:Shuo Yang ; Ran Wei ; Hengliang Tan
  • 期刊名称:Computer Science & Information Technology
  • 电子版ISSN:2231-5403
  • 出版年度:2019
  • 卷号:9
  • 期号:13
  • 页码:1-17
  • DOI:10.5121/csit.2019.91301
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:Document (text) classification is a common method in e-business, facilitating users in the tasks such as document collection, analysis, categorization and storage. Semantic analysis can help to improve the performance of document classification. Though having been considered when designing previous methods for automatic document classification, more focus should be given to semantics with the increase number of content-rich electronic documents, forum posts or blogs online, which can reduce human workload by a great margin. This paper proposes a novel semantic document classification approach aiming to resolve two types of semantic problems: (1) polysemy problem, by using a novel semantic similarity computing strategy (SSC) and (2) synonym problem, by proposing a novel strong correlation analysis method (SCM). Experiments show that our strategies can help to improve the performance of the baseline methods.
  • 关键词:semantic document classification; semantic similarity; semantic embedding; correlation; analysis; machine learning
国家哲学社会科学文献中心版权所有