期刊名称:International Journal of Advanced Research in Computer Engineering & Technology (IJARCET)
印刷版ISSN:2278-1323
出版年度:2013
卷号:2
期号:4
页码:1596-1600
出版社:Shri Pannalal Research Institute of Technolgy
摘要:Web log data is one of the major source which contain all the information regarding the users visited links browsing pattern, time spent on a particular page. A web server log file contain information about user name, ip address, date time, byte transferred, access request. A web log file range is 1kb to 100kb. Many interesting pattern available in web log data. But it is very complex process to extract the interesting pattern without preprocessed phase. Preprocessing step is used to give a reliable input for data mining task. In this paper we present the data preprocessing method for improving the efficiency & ease of mining process.
关键词:Data preprocessing; Data ; cleaning ;web log file; web usage mining ; session identification; user identification