期刊名称:International Journal of Innovative Research in Computer and Communication Engineering
印刷版ISSN:2320-9798
电子版ISSN:2320-9801
出版年度:2015
卷号:3
期号:2
DOI:10.15680/ijircce.2015.0302165
出版社:S&S Publications
摘要:This document defines data mining as advanced methods for exploring and modeling relationships inlarge amounts of data. Your data often comes from several different sources, and combining information from thesedifferent sources may present quite a challenge. The need for better and quicker access to information has generated agreat deal of interest in building data warehouses that are able to quickly assemble and deliver the needed informationin usable form. To download documentation that discusses the Enterprise Miner add-ins to SAS/WarehouseAdministrator, go to the SAS Customer Support Center Web site (http://support.sas.com). From SoftwareDownloads, select Product and Solution Updates. From the Demos and Downloads page, selectSAS/Warehouse Administrator Software, and download the version that you want. A typical data set has manythousand observations. An observation may represent an entity such as an individual customer, a specific transaction, ora certain household. Variables in the data set contain specific information such as demographic information, saleshistory, or financial information for each observation. How this information is used depends on the research question ofinterest. Ordinal variables may be treated as nominal variables, if you are not interested in the ordering of the levels.However, nominal variables cannot be treated as ordinal variables since there is no implied ordering by definition. Toobtain a meaningful analysis, you must construct an appropriate data set and specify the correct measurement level foreach of the variables.
关键词:Interval — a variable for which the mean (or average); Categorical — a variable consisting of a set;of levels; Unary — a variable that has the same value for every observation in the data set; Binary — a variable that;has only two possible levels; Nominal — a variable that has more than two levels; Ordinal — a variable that has more;than two levels.