首页    期刊浏览 2024年10月07日 星期一
登录注册

文章基本信息

  • 标题:IMPROVING EFFICIENCY OF TEXTUAL STATIC WEB CONTENT MINING USING CLUSTERING TECHNIQUES
  • 本地全文:下载
  • 作者:R. MANIKANDAN
  • 期刊名称:Journal of Theoretical and Applied Information Technology
  • 印刷版ISSN:1992-8645
  • 电子版ISSN:1817-3195
  • 出版年度:2011
  • 卷号:33
  • 期号:2
  • 页码:193-196
  • 出版社:Journal of Theoretical and Applied
  • 摘要:There are several efficient methods for the discovery or mining of various types of data, methods devised for mining textual static web content have always been proved less efficient due to the data's ambiguous , unclassified, unstructured or un-clustered nature. Various association rule mining algorithms like Generalized pattern algorithm are being implemented to mine the web content but again due to the above setbacks the efficiency expected from the algorithm is not obtained. Since the dip in the efficiency of these algorithms is amounted to the nature of the textual web content, an algorithm which may deal with , if not all the anomalies at least the un-clustered nature of the content may increase the efficiency drastically. This paper emphasizes the same point, making the assumptions that the web content is static and there is at least one common pattern found in the given datasets.
  • 关键词:Textual Web Content; Un-clustered Nature; Generalized Pattern Algorithm; Datasets; Clustering Algorithm
国家哲学社会科学文献中心版权所有