首页    期刊浏览 2025年02月21日 星期五
登录注册

文章基本信息

  • 标题:Text Summarization of Multi-Aspect Comments in Social Networks in Persian Language
  • 本地全文:下载
  • 作者:Hossein Shahverdian ; Hassan Saneifar
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2017
  • 卷号:8
  • 期号:12
  • DOI:10.14569/IJACSA.2017.081248
  • 出版社:Science and Information Society (SAI)
  • 摘要:Now-a-days, there are increasingly huge amount of user generated comments on the web. The user generated comments usually contains useful and essential information reflecting public’s or customers’ opinions. Since the information in the comments could be used for decision making, production or service improvement, and achieving user satisfaction, the systematic analysis of these comments is an essential need in so many domains including e-commerce, production, and social network analysis. However, the analysis of large volume of comments is a difficult and time-consuming task. Therefore, the need for a system which can convert this massive volume of comments to a useful and efficient summary is felt more and more. Text summarization leads to using more resources at higher speeds and getting richer information. According to numerous studies conducted in the field of multi-document summarization, few studies can be found that have been focused on the user generated comments in Persian language. In this paper, we propose a novel approach to summarize huge amount of comments in Persian, which is enough close to a human summarization. Our approach is based on semantic and lexical similarities and uses a graph-based summarization. We also propose a clustering to deal with multiple aspects (subjects) in a corpus of comments. According to the experiments, the summaries extracted by the proposed approach reached an average score of 8.75 out of 10, which improves the state-of-the-art summarizer’s score about 14 percent.
  • 关键词:Text mining; comments analysis; summarization; graph summarization; Persian language
国家哲学社会科学文献中心版权所有