首页    期刊浏览 2024年11月29日 星期五
登录注册

文章基本信息

  • 标题:Knowledge Discovery from Web Usage Data: Complete Preprocessing Methodology
  • 本地全文:下载
  • 作者:G T Raju ; P S Satyanarayana
  • 期刊名称:International Journal of Computer Science and Network Security
  • 印刷版ISSN:1738-7906
  • 出版年度:2008
  • 卷号:8
  • 期号:1
  • 页码:179-186
  • 出版社:International Journal of Computer Science and Network Security
  • 摘要:The exponential growth of the Web in terms of Web sites and their users during the last decade has generated huge amount of data related to the user��s interactions with the Web sites. This data is recorded in the Web access log files of Web servers and usually referred as Web Usage Data (WUD). Knowledge Discovery from Web Usage Data (KDWUD) is that area of Web mining deals with the application of data mining techniques to extract interesting knowledge from the WUD. As Web sites continue to grow in size and complexity, the results of KDWUD have become very critical for efficient and effective management of the activities related to: e-business, e-education, e-commerce, personalization, website design & management, network traffic analysis, the cache, the proxies, great diversity of Web pages in a site, search engine��s complexity, and to predict user��s actions. In this paper, we propose a complete preprocessing methodology, one of the important steps in KDWUD process. Several heuristics have been proposed for cleaning the WUD which is then aggregated and recorded in the relational data model. To validate the efficiency of the proposed preprocessing methodology, several experiments were conducted and the results shows that the proposed methodology reduces the size of Web access log files down to 73-82% of the initial size and offer richer logs that are structured for further stages of KDWUD.
  • 关键词:Preprocessing; Knowledge Discovery; Web Usage Data; Web Usage Mining.
国家哲学社会科学文献中心版权所有