首页    期刊浏览 2024年09月19日 星期四
登录注册

文章基本信息

  • 标题:Web Users Session Analysis Using DBSCAN and Two Phase Utility Mining Algorithms
  • 本地全文:下载
  • 作者:G. Sunil Kumar ; C.V.K Sirisha ; Kanaka Durga.R
  • 期刊名称:International Journal of Soft Computing & Engineering
  • 电子版ISSN:2231-2307
  • 出版年度:2012
  • 卷号:1
  • 期号:6
  • 页码:396-401
  • 出版社:International Journal of Soft Computing & Engineering
  • 摘要:One of the important issues in data mining is the interestingness problem. Typically, in a data mining process, the number of patterns discovered can easily exceed the capabilities of a human user to identify interesting results. To address this problem, utility measures have been used to reduce the patterns prior to presenting them to the user. A frequent itemset only reflects the statistical correlation between items, and it does not reflect the semantic significance of the items. This proposed approach uses a utility based itemset mining approach to overcome this limitation. This proposed system first uses Dbscan clustering algorithm which identifies the behavior of the users page visits, order of occurrence of visits. After applying the clustering technique High Two phase utility mining algorithm is applied, aimed at finding itemsets that contribute high utility.Mining web access sequences can discover very useful knowledge from web logs with broad applications. Mining useful Web path traversal patterns is a very important research issue in Web technologies. Knowledge about the frequent Web path traversal patterns enables us to discover the most interesting Websites traversed by the users. However, considering only the binary (presence/absence) occurrences of the Websites in the Web traversal paths, real world scenarios may not be reflected. Therefore, if we consider the time spent by each user as a utility value of a website, more interesting web traversal paths can be discovered using proposed two-phase algorithm. User page visits are sequential in nature. In this paper MSNBC web navigation dataset is used to compare the efficiency and performance in web usage mining is finding the groups which share common interests General Terms Web session mining, log analysis.
  • 关键词:Webusage Mining; Itemset; DBScan;Association rules.
国家哲学社会科学文献中心版权所有