首页    期刊浏览 2024年07月05日 星期五
登录注册

文章基本信息

  • 标题:Cluster Analysis Based on Contextual Features Extraction for Conversational Corpus
  • 本地全文:下载
  • 作者:Qi Chen 1,3 , Yue Chen 2,3 , Minghu Jiang
  • 期刊名称:Journal of Computer and Communications
  • 印刷版ISSN:2327-5219
  • 电子版ISSN:2327-5227
  • 出版年度:2015
  • 卷号:03
  • 期号:05
  • 页码:33-37
  • DOI:10.4236/jcc.2015.35004
  • 语种:English
  • 出版社:Scientific Research Publishing
  • 摘要:Cluster analysis related to computational linguistics seldom concerned with Pragmatics level. Features of corpus on Pragmatics level related to specific situations, including backgrounds, titles and habits. To improve the accuracy of clustering for conversations collected from international students in Tsinghua University, it required contextual features. Here, we collected four-hundred conversations as a corpus and built it to Vector Space Model. With the Oxford-Duden Dictionary and other methods we modified the model and concluded into three groups. We testified our hypothesis through self-organizing map neural network. The result suggested that the modified model had a better outcome.
  • 关键词:Conversational Corpus; Contextual Features; VSM; SOM
国家哲学社会科学文献中心版权所有