首页    期刊浏览 2024年09月15日 星期日
登录注册

文章基本信息

  • 标题:Similarity-Based Estimation for Document Summarization using Fuzzy Sets
  • 本地全文:下载
  • 作者:Miss Masrah Azrifah Azmi Murad ; Mr. Trevor Martin
  • 期刊名称:International Journal of Computer Science and Security (IJCSS)
  • 电子版ISSN:1985-1553
  • 出版年度:2007
  • 卷号:1
  • 期号:4
  • 页码:1-12
  • 出版社:Computer Science Journals
  • 摘要:Information is increasing every day and thousands of documents are produced and made available in the Internet. The amount of information available in documents exceeds our capacity to read them. We need access to the right information without having to go through the whole document. Therefore, documents need to be compressed and produce an overview so that these documents can be utilized effectively. Thus, we propose a similarity model with topic similarity using fuzzy sets and probability theories to extract the most representative sentences. Sentences with high weights are extracted to form a summary. On average, our model (known as MySum) produces summaries that are 60% similar to the manually created summaries, while tf.isf algorithm produces summaries that are 30% similar. Two human summarizers, named P1 and P2, produce summaries that are 70% similar to each other using similar sets of documents obtained from TREC.
  • 关键词:fuzzy sets; mass assignment; asymmetric word similarity; topic similarity; summarization
国家哲学社会科学文献中心版权所有