首页    期刊浏览 2025年06月17日 星期二
登录注册

文章基本信息

  • 标题:Fast Real Time Analysis of Web Server Massive Log Files using an Improved Web Mining Architecture
  • 本地全文:下载
  • 作者:Rajamanickam, Ramesh ; Kavitha, C.
  • 期刊名称:Journal of Computer Science
  • 印刷版ISSN:1549-3636
  • 出版年度:2013
  • 卷号:9
  • 期号:6
  • 页码:771-779
  • DOI:10.3844/jcssp.2013.771.779
  • 出版社:Science Publications
  • 摘要:The web has played a vital role to detect the information and finding the reasons to organize a system. As the web sites were increased, the web log files also increased based on the web searching. Our challenge and the task are to reduce the log files and classify the best results to reach the task which we used. Aimed to overcome the deficiency of abundant data to web mining, the study proposed a path extraction using Euclidean Distance based algorithm with a sequential pattern clustering mining algorithm. First, we construct the Relational Information System using original data sets. Second, we here cluster the data by the Sequential Pattern Clustering Method for the data sets which make use of the data to produce Core of Information System. Web mining core data is the most important and necessary information which cannot reduce an original Information System. So it can get the same effect as original data sets to data analysis and can construct classification modeling using it. Third, we here used Sequential pattern clustering method with the help of Path Extraction. The experiment shows that the proposed algorithm can get high efficiency and avoid the abundant data in follow-up data processing.
  • 关键词:Path Completion; Cleanup the Data; Data Preprocessor; Travel Path Extraction; Sequential Pattern Clustering Method
国家哲学社会科学文献中心版权所有