期刊名称:International Journal of Engineering and Computer Science
印刷版ISSN:2319-7242
出版年度:2015
卷号:4
期号:7
页码:13086-13090
出版社:IJECS
摘要:In many text mining applications, the side-information contained within the text document will contribute to enhance the overallclustering process. The proposed algorithm performs clustering of data along with the side information, by combining classical partitioningalgorithms with probabilistic models to boost the efficacy of the clustering approach. The clusters generated will be used as a training modelto solve the classification problem. The proposed work will also make use of a similarity based ontology algorithm, by incorporating twoshared word spaces, to perk up the clustering approach. This will boost the amount of knowledge gained from text documents by includingontology with side-information.
关键词:Clustering; Data Mining; Ontology; Side-Information