首页    期刊浏览 2024年12月01日 星期日
登录注册

文章基本信息

  • 标题:Document Summarization and Classification using Concept and Context Similarity Analysis
  • 本地全文:下载
  • 作者:J.Arun ; C. Gunavathi M.E
  • 期刊名称:International Journal of Innovative Research in Science, Engineering and Technology
  • 印刷版ISSN:2347-6710
  • 电子版ISSN:2319-8753
  • 出版年度:2014
  • 期号:ICETS
  • 页码:1256
  • 出版社:S&S Publications
  • 摘要:“Document summarization andclassification using concept and context similarityanalysis’’ deals with an information retrieval task,which aims at extracting a condensed version of theoriginal document. A document summary is usefulsince it can give an overview of the original documentin a shorter period of time. The main goal of asummary is to present the main ideas in adocument/set of documents in a short and readableparagraph. Classification is a data mining functionthat assigns items in a collection to target categoriesof the documents. Context sensitive documentindexing model based on the Bernoulli model ofrandomness is used for document summarizationprocess. The lexical association between terms is usedto produce a context sensitive weight to the documentterms. The context sensitive indexing weights areused to compute the sentence similarity matrix and asa result, the sentences are presented in such a waythat the most informative sentences appear on the topof the summary, making a positive impact on thequality of the summary
  • 关键词:Document indexing; Lexical;association; Bernoulli model of randomness
国家哲学社会科学文献中心版权所有