首页    期刊浏览 2024年07月06日 星期六
登录注册

文章基本信息

  • 标题:Text Clustering using Semantic Terms
  • 本地全文:下载
  • 作者:Sun Park ; Seong Ro Lee
  • 期刊名称:International Journal of Hybrid Information Technology
  • 印刷版ISSN:1738-9968
  • 出版年度:2012
  • 卷号:5
  • 期号:2
  • 出版社:SERSC
  • 摘要:In traditional text clustering, documents appear terms frequency without considering the semantic information of each document (i.e., vector model). The property of vector model may be incorrectly classified documents into different clusters when documents of same cluster lack the shared terms. Recently, to overcome this problem uses knowledge based approaches. However, these approaches have an influence of structure of document set and a cost problem of constructing ontology. In this paper, we propose a text clustering method using semantic terms for clustering label and term weights. The semantic terms of clustering label can well express the internal structure of document clusters using non-negative matrix factorization (NMF). It can also improve the quality of text clustering which uses the term weights by WordNet. The experimental results demonstrate that the proposed method achieves better performance than other text clustering methods
  • 关键词:document clustering; NMF; semantic terms; term weight; WordNet
国家哲学社会科学文献中心版权所有