首页    期刊浏览 2024年11月28日 星期四
登录注册

文章基本信息

  • 标题:A Novel Approach for Web Page Set Mining
  • 本地全文:下载
  • 作者:R.B.Geeta ; Omkar Mamillapalli ; Prasad Reddy P.V.G.D
  • 期刊名称:International Journal of Web & Semantic Technology
  • 印刷版ISSN:0976-2280
  • 电子版ISSN:0975-9026
  • 出版年度:2011
  • 卷号:2
  • 期号:4
  • DOI:10.5121/ijwest.2011.2412
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:The one of the most time consuming steps for association rule mining is the computation of the frequency of the occurrences of itemsets in the database. The hash table index approach converts a transaction database to an hash index tree by scanning the transaction database only once. Whenever user requests for any Uniform Resource Locator (URL), the request entry is stored in the Log File of the server. This paper presents the hash index table structure, a general and dense structure which provides web page set extraction from Log File of server. This hash table provides information about the original database. Web Page set mining (WPs-Mine) provides a complete representation of the original database. This approach works well for both sparse and dense data distributions. Web page set mining supported by hash table index shows the performance always comparable with and often better than algorithms accessing data on flat files. Incremental update is feasible without reaccessing the original transactional database.
  • 关键词:Web mining; URL; Web Pages set extraction; HTTP transaction; Log File.
国家哲学社会科学文献中心版权所有