首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:A Method for Extracting Important Segments from Documents Using Support Vector Machines Toward Automatic Text Summarization
  • 本地全文:下载
  • 作者:Daisuke Suzuki ; Akira Utsumi
  • 期刊名称:人工知能学会論文誌
  • 印刷版ISSN:1346-0714
  • 电子版ISSN:1346-8030
  • 出版年度:2006
  • 卷号:21
  • 期号:4
  • 页码:330-339
  • DOI:10.1527/tjsai.21.330
  • 出版社:The Japanese Society for Artificial Intelligence
  • 摘要:In this paper we propose an extraction-based method for automatic summarization. The proposed method consists of two processes: important segment extraction and sentence compaction. The process of important segment extraction classifies each segment in a document as important or not by Support Vector Machines (SVMs). The process of sentence compaction then determines grammatically appropriate portions of a sentence for a summary according to its dependency structure and the classification result by SVMs. To test the performance of our method, we conducted an evaluation experiment using the Text Summarization Challenge (TSC-1) corpus of human-prepared summaries. The result was that our method achieved better performance than a segment-extraction-only method and the Lead method, especially for sentences only a part of which was included in human summaries. Further analysis of the experimental results suggests that a hybrid method that integrates sentence extraction with segment extraction may generate better summaries.
  • 关键词:segment extraction ; dependency structure ; automatic summarization ; Support Vector Machines
国家哲学社会科学文献中心版权所有