期刊名称:Information Technology and Management Science
印刷版ISSN:2255-9086
电子版ISSN:2255-9094
出版年度:2015
卷号:18
期号:1
页码:97-102
DOI:10.1515/itms-2015-0015
语种:English
出版社:Walter de Gruyter GmbH
摘要:Data mining methods are applied to a medical task that seeks for the information about the influence of Helicobacter Pylori on the gastric cancer risk increase by analysing the adverse factors of individual lifestyle. In the process of data preprocessing, the data are cleared of noise and other factors, reduced in dimensionality, as well as transformed for the task and cleared of non-informative attributes. Data classification using C4.5, CN2 and k-nearest neighbour algorithms is carried out to find relationships between the analysed attributes and the descriptive class attribute – Helicobacter Pylori presence that could have influence on the cancer development risk. Experimental analysis is carried out using the data of the Latvian-based project “Interdisciplinary Research Group for Early Cancer Detection and Cancer Prevention” database.
关键词:Classification ; data pre-processing ; gastric cancer risk analysis