期刊名称:International Journal of Computer Science Issues
印刷版ISSN:1694-0784
电子版ISSN:1694-0814
出版年度:2015
卷号:12
期号:6
出版社:IJCSI Press
摘要:The Internet offers huge volume of data to the users and grows rapidly every day. The web server creates log files regarding details about the page, IP address of the user, browser, and operating system used and time/date stamp regarding browsing patterns and this data is mined to extract useful information using web usage mining. The primary objective of this paper is to find the low hit pages of a website from the log files using finding outliers in sequential mining concept. To cater to the need of this objective, a new algorithm named "Detect Anomaly in Sequential Pattern Algorithm (DASPAT)" is proposed. The proposed algorithm creates candidates using Apriori like approach and discovers the unusual browsing behavior of the users, and the detected UBB are treated as outliers. This paper introduces a new approach to find the low hit web pages in tandem to enable the designers to understand how the user browses the site and allow them to redesign the web site