期刊名称:International Journal of Innovative Research in Science, Engineering and Technology
印刷版ISSN:2347-6710
电子版ISSN:2319-8753
出版年度:2014
期号:ICETS
页码:1256
出版社:S&S Publications
摘要:“Document summarization andclassification using concept and context similarityanalysis’’ deals with an information retrieval task,which aims at extracting a condensed version of theoriginal document. A document summary is usefulsince it can give an overview of the original documentin a shorter period of time. The main goal of asummary is to present the main ideas in adocument/set of documents in a short and readableparagraph. Classification is a data mining functionthat assigns items in a collection to target categoriesof the documents. Context sensitive documentindexing model based on the Bernoulli model ofrandomness is used for document summarizationprocess. The lexical association between terms is usedto produce a context sensitive weight to the documentterms. The context sensitive indexing weights areused to compute the sentence similarity matrix and asa result, the sentences are presented in such a waythat the most informative sentences appear on the topof the summary, making a positive impact on thequality of the summary
关键词:Document indexing; Lexical;association; Bernoulli model of randomness