文章基本信息

标题：Feature selection for genomic data.
本地全文：下载
作者：Paola CERCHIELLO ; Silvia FIGINI
期刊名称：la Revue de MODULAD
印刷版ISSN：1769-7387
出版年度：2007
卷号：2007
期号：36
页码：147-153
出版社：Association MODULAD
摘要：Building predictive models for genomic mining requires feature selection, as an essential preliminary step to reduce the large number of available variable. Feature selection in the process of select a generally smaller subset of variables (features) that can be considered the best, from a statistical point of view, with respect to the employed model for the analysis. In gene expression microarray data, being able to select a few number of important genes not only makes data analysis efficient but also helps their biological interpretation. Microarray data have typically several thousands of genes (features) but only tens of samples. Problems which can occur due to the small sample size have not been addressed well in the literature. Our aim is to discuss some issues on feature selection applied to microarray data in order to select the most important genes from a predictive point of view
关键词：Feature selection, Gene expression, Marker Selection, Kruskal-Wallis test, Model Assessment, Predictive models.