期刊名称:International Journal of Advanced Research in Computer Engineering & Technology (IJARCET)
印刷版ISSN:2278-1323
出版年度:2012
卷号:1
期号:4
页码:625-634
出版社:Shri Pannalal Research Institute of Technolgy
摘要:World Wide Web is a huge repository of web pages and links. It provides abundance of information for the Internet users. The growth of web is tremendous as approximately one million pag es are added daily. Users' accesses are recorded in web logs. Because of the tremendous usage of web, the web log files are growing at a faster rate and the size is becoming huge. Web data mining is the application of data mining techniques in web data. Web Usage Mining applies mining techniques in log data to extract the behavior of users which is used in various applications like personalized services, adaptive web sites, customer profiling, prefetching, creating attractive web sites etc., The rapid growth in the amount of information and the number of users has lead to difficulty in providing effective search services for the web users and increased web latency; resulting in decreased web performance. Although web performance can be improved by caching, the benefit of using it is rather limited owing to filling the cache with documents without any prior knowledge .Web pre-fetching becomes an attractive solution wherein forthcoming page accesses of a client are predicted, based on log information. This paper proposes an approach for increasing web performance by analyzing and predicting user behavior both by collaborating information from user access log and website structure repository.
关键词:Web Mining; Web Content Mining; Web ; Structure Mining; Web Usage Mining; Data Cleaning; User ; Identification; Session Identification; Path Completion ; ; Prefetching and Markov Model