首页    期刊浏览 2024年11月29日 星期五
登录注册

文章基本信息

  • 标题:Text Analysis Using Different Graph-Based Representations
  • 本地全文:下载
  • 作者:Esteban Castillo ; Ofelia Cervantes ; Darnes Vilariño
  • 期刊名称:Computación y Sistemas
  • 印刷版ISSN:1405-5546
  • 出版年度:2017
  • 卷号:21
  • 期号:4
  • 页码:581-599
  • 语种:English
  • 出版社:Instituto Politécnico Nacional
  • 其他摘要:This paper presents an overview of different graph-based representations proposed to solve text classification tasks. The core of this manuscript is to highlight the importance of enriched/non-enriched co-occurrence graphs as an alternative to traditional features representation models like vector representa- tion, where most of the time these models can not map all the richness of text documents that comes from the web (social media, blogs, personal web pages, news, etc). For each text classification task the type of graph created as well as the benefits of using it are presented and discussed. In specific, the type of features/patterns extracted, the implemented classification/similarity methods and the results obtained in datasets are explained. The theoretical and practical implications of using co-occurrence graphs are also discussed, pointing out the contributions and challenges of modeling text document as graphs.
  • 其他关键词:Text modeling; graph-based representation; co-occurrence graphs; text classification; feature-vector approach; graph similarity approach.
国家哲学社会科学文献中心版权所有