首页    期刊浏览 2024年09月16日 星期一
登录注册

文章基本信息

  • 标题:A Graph-based Approach to Text Genre Analysis
  • 本地全文:下载
  • 作者:Ahmed Ragab Nabhan ; Khaled Shaalan
  • 期刊名称:Computación y Sistemas
  • 印刷版ISSN:1405-5546
  • 出版年度:2016
  • 卷号:20
  • 期号:3
  • 页码:527-539
  • 语种:English
  • 出版社:Instituto Politécnico Nacional
  • 其他摘要:Genre characterization can be achieved by a variety of methods that employ lexical, syntactic, and presentation features of text to highlight key domain differences and stylistic preferences. However, these traditional methods cannot uncover some important macro-structural features that are embedded in text. Representation of text as a word graph can enable effective frameworks for analysis and identification of key topological features that characterize genres of text. In this study, we investigated graph features such as clustering coefficients, centralization, diameter, and average path lengths for eight text genres. The findings indicated key patterns that vary from a genre to another according to the stylistic differences in text. Furthermore, evidence of subgenres was found through some graph features such as number of connected components and node heterogeneity.
  • 其他关键词:Word graphs; genres analysis; topological features.
国家哲学社会科学文献中心版权所有