期刊名称:International Journal of Engineering and Computer Science
印刷版ISSN:2319-7242
出版年度:2014
卷号:3
期号:10
页码:9002-9006
出版社:IJECS
摘要:Many text mining applications contains side-information along with the text documents. Many web documents consistof meta-data with them which correspond to various different kinds of attributes such as the origin or other information related tothe origin of the document. Data such as location, possession or even temporal information may prove to be informative formining purposes in other cases. Such side-information may contain a huge amount of information. This huge amount ofinformation may be used for performing clustering.However, it may be difficult to compute the importance of this side-information, especially when some of the informationfrom it is noisy. When the information is noisy it can be a risky approach for performing mining process along with the sideinformation, because it can actually worsen the quality of mining process. This is why we need a principled way for performingthe mining process, so that the advantages from using this side information can be maximized. We will do mining and clusteringusing the side information and iterative clustering and clusters will be formed. From these clusters we will search the desiredkeyword using user behavior, localization, personalization