期刊名称:International Journal of Electronics, Communication and Soft Computing Science and Engineering
印刷版ISSN:2277-9477
出版年度:2015
卷号:4
期号:Special 2
出版社:IJECSCSE
摘要:To find the appropriate nu mber of clusters and to partitioned the documents is crucial in document clustering. In this paper we will focus on various clustering techniques and our proposed system is to discover the cluster structure without giving the total nu mber of clusters as input. Document features or even we can say that the various attributes will be with no human interference separated into two groups, in particular, discriminative words and nondiscriminative words, and contribute differently to document clustering. There is variational inference algorithm in which we infer the document collection structure and words at the same time partition of document. our proposed approach for the semisupervised document clustering. Semi-supervised clustering lies between both automatic categorization and auto-organization. Here the supervisor need not specifies a set of classes, but only to provide a set of texts grouped by the criteria to be used to generate Clusters.