期刊名称:Indian Journal of Computer Science and Engineering
印刷版ISSN:2231-3850
电子版ISSN:0976-5166
出版年度:2010
卷号:1
期号:2
页码:82-85
出版社:Engg Journals Publications
摘要:Text mining is powerful tool to find useful and needed information from huge data set. For context based text mining, keyphrases are used. Keyphrases provide brief summary about the contents of documents. In document clustering, number of total cluster is not known in advance. In K-means, if prespecified number of clusters modified, the precision of each result is also modified. Therefore Kea ,is algorithm for automatically extracting keyphrases from text is used. In this kea algorithm, number of clusters is automatically determined by using extracted keyphrases. Kea-means clustering algorithm provide easy and efficient way to extract test document from large quantity of resources. Keyphrase play important role in text indexing, summarization and categorization. Keyphrases are selected manually. Assigning keyphrases manually is tedious process that requires knowledge of subject. Therefore automatic extraction techniques are most useful.