首页    期刊浏览 2024年12月02日 星期一
登录注册

文章基本信息

  • 标题:A Novel Method of Significant Words Identification in Text Summarization
  • 本地全文:下载
  • 作者:Kiabod, Maryam ; Dehkordi, Mohammad Naderi ; Sharafi, Sayed Mehran
  • 期刊名称:Journal of Emerging Technologies in Web Intelligence
  • 印刷版ISSN:1798-0461
  • 出版年度:2012
  • 卷号:4
  • 期号:3
  • 页码:252-258
  • DOI:10.4304/jetwi.4.3.252-258
  • 语种:English
  • 出版社:Academy Publisher
  • 摘要:Text summarization is a process that reduces the size of the text document and extracts significant sentences from a text document. We present a novel technique for text summarization. The originality of technique lies on exploiting local and global properties of words and identifying significant words. The local property of word can be considered as the sum of normalized term frequency multiplied by its weight and normalized number of sentences containing that word multiplied by its weight. If local score of a word is less than local score threshold, we remove that word. Global property can be thought of as maximum semantic similarity between a word and title words. Also we introduce an iterative algorithm to identify significant words. This algorithm converges to the fixed number of significant words after some iterations and the number of iterations strongly depends on the text document. We used a two-layered backpropagation neural network with three neurons in the hidden layer to calculate weights. The results show that this technique has better performance than MS-word 2007, baseline and Gistsumm summarizers.
  • 关键词:Significant Words;Text Summarization;Pruning Algorithm
国家哲学社会科学文献中心版权所有