期刊名称:Bonfring International Journal of Software Engineering and Soft Computing
印刷版ISSN:2250-1045
电子版ISSN:2277-5099
出版年度:2018
卷号:8
期号:2
页码:01-06
DOI:10.9756/BIJSESC.8387
语种:English
出版社:Bonfring
摘要:In this paper describe a specific type of summaries that Query facet the main topic of given text. Existing summarization algorithms are classified into different categories in terms of their summary construction methods (abstractive or extractive), the number of sources for the summary (single document or multiple documents), types of information in the summary (indicative or informative), and the relationship between summary and query (generic or query-based). QD Miner aims to offer the possibility of finding the main points of multiple documents and thus save users? time on reading whole documents. The difference is that most existing summarization systems dedicate themselves to generating summaries using sentences extracted from documents. In addition, return multiple groups of semantically related items, while they return a flat list of sentences. In this paper, adding these lists may improve both accuracy and recall of query facets. Part-of-speech information can be used to check the homogeneity of lists and improve the quality of query facets. The side-information could not be incorporate into the mining process, because it can either improve the quality of the representation for the mining process, or can add noise to the process. Therefore, a principle way is required to perform the mining process, so as to maximize the advantages from using this side information. This dissertation proposes an algorithm which combines classical partitioning algorithms with probabilistic models in order to create an effective clustering approach.
关键词:Data Mining; Classification; TF-IDF; K-Mean Clustering; Statistical Mean Validation.