出版社:Academy & Industry Research Collaboration Center (AIRCC)
摘要:In this paper, we propose a new approach based on the detection of opinion by theSentiWordNet for the production of text summarization by using the scoring extractiontechnique adapted to detecting of opinion. The texts are decomposed into sentences thenrepresented by a vector of scores of opinion of this sentences. The summary will be done byelimination of sentences whose opinion is different from the original text. This difference isexpressed by a threshold opinion. The following hypothesis: "textual units that do not share thesame opinion of the text are ideas used for the development or comparison and their absenceshave no vocation to reach the semantics of the abstract" Has been verified by the statisticalmeasure of Chi_2 which we used it to calculate a dependence between the unit textual and thetext. Finally we found an opinion threshold interval which generate the optimal assessments.
关键词:Automatic Summary Extraction; Text Mining; Evaluation; Automatic Language Processing; FMeasure;correlation; ROUGE-SU (2); SentiWordNet; Opinion Mining.