期刊名称:International Journal of Advanced Research In Computer Science and Software Engineering
印刷版ISSN:2277-6451
电子版ISSN:2277-128X
出版年度:2013
卷号:3
期号:7
出版社:S.S. Mishra
摘要:Document clustering is an e.ective tool to manage information overload. By grouping similar documents together, we enable a human observer to quickly browse large document collections[18], make it possible to easily grasp the distinct topics and subtopics (concept hierarchies) in them, allow search engines to e.ciently query large document collections [16] among many other applications. Hence, it has been widely studied as a part of the broad literature of data clustering. One such survey of existing clustering literature can be found in Jain et. al[19s].