期刊名称:Oriental Journal of Computer Science and Technology
印刷版ISSN:0974-6471
出版年度:2011
卷号:4
期号:2
页码:371-377
出版社:Oriental Scientific Publishing Company
摘要:This paper presents a technique to improve the quality of Document Clustering based on Word Set Concept. The proposed Technique WDC (word set based document clustering), a clustering algorithm work with to obtain clustering of comparable quality significantly more efficiently more than the state of the art text clustering algorithm. The proposed WDC algorithms utilize the semantic relation ship between words to create concepts. The Word sets based Document Clustering (WDC) obtains clustering of comparable quality significantly more efficiently than state-of-art approach is efficient and give more accurate clustering result than the other methods.
关键词:Document clustering; Frequent concept; Word set.