首页    期刊浏览 2024年09月13日 星期五
登录注册

文章基本信息

  • 标题:Unsupervised Extractive Summarization using Pointwise Mutual Information
  • 本地全文:下载
  • 作者:Vishakh Padmakumar ; He He
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2021
  • 卷号:2021
  • 页码:2505-2512
  • DOI:10.18653/v1/2021.eacl-main.213
  • 语种:English
  • 出版社:ACL Anthology
  • 摘要:Unsupervised approaches to extractive summarization usually rely on a notion of sentence importance defined by the semantic similarity between a sentence and the document. We propose new metrics of relevance and redundancy using pointwise mutual information (PMI) between sentences, which can be easily computed by a pre-trained language model. Intuitively, a relevant sentence allows readers to infer the document content (high PMI with the document), and a redundant sentence can be inferred from the summary (high PMI with the summary). We then develop a greedy sentence selection algorithm to maximize relevance and minimize redundancy of extracted sentences. We show that our method outperforms similarity-based methods on datasets in a range of domains including news, medical journal articles, and personal anecdotes.
国家哲学社会科学文献中心版权所有