首页    期刊浏览 2024年11月28日 星期四
登录注册

文章基本信息

  • 标题:DATA WAREHOUSING AND PHASES USED IN INTERNET MINING
  • 本地全文:下载
  • 作者:Jitender Ahlawat ; Joni Birla ; Mohit Yadav
  • 期刊名称:International Journal of Computer Science and Management Studies
  • 电子版ISSN:2231-5268
  • 出版年度:2011
  • 卷号:11
  • 期号:2
  • 出版社:Imperial Foundation
  • 摘要:In this paper, we describe the data warehousing and data mining. Data Warehousing is the process of storing the data on large scale and Data mining is the process of analyzing data from different perspectives and summarizing it into useful information - information that can be used to increase revenue, cuts costs, or both. As massive amount of data is continuously being collected and stored, many industries are becoming interested in mining some patterns (association rules, correlations, clusters etc) from their database. Association rule mining is one of the important tasks that are used to find out the frequent itemset from customer transactional database. Each transaction consists of items purchased by a customer in a visit. Internet mining is the application of data mining techniques to discover patterns from the Internet. Internet Usage Mining (IUM) is the process of application of data mining techniques over web data. The data sources are mainly the web server logs, proxy server logs and cookies stored in the user’s computer. IUM is composed of three phases namely, preprocessing, pattern discovery and pattern analysis. This paper describes these phases in detail. A necessary introduction to Internet Mining is also provided for the purpose of background knowledge.
  • 关键词:Data warehousing and its architectures; Data Mining; Techniques of Data Mining; Internet mining
国家哲学社会科学文献中心版权所有