首页    期刊浏览 2024年07月05日 星期五
登录注册

文章基本信息

  • 标题:A Review of Matched-pairs Feature Selection Methods for Gene Expression Data Analysis
  • 本地全文:下载
  • 作者:Sen Liang ; Anjun Ma ; Sen Yang
  • 期刊名称:Computational and Structural Biotechnology Journal
  • 印刷版ISSN:2001-0370
  • 出版年度:2018
  • 卷号:16
  • 页码:88-97
  • DOI:10.1016/j.csbj.2018.02.005
  • 语种:
  • 出版社:Computational and Structural Biotechnology Journal
  • 摘要:With the rapid accumulation of gene expression data from various technologies, e.g., microarray, RNA-sequencing (RNA-seq), and single-cell RNA-seq, it is necessary to carry out dimensional reduction and feature (signature genes) selection in support of making sense out of such high dimensional data. These computational methods significantly facilitate further data analysis and interpretation, such as gene function enrichment analysis, cancer biomarker detection, and drug targeting identification in precision medicine. Although numerous methods have been developed for feature selection in bioinformatics, it is still a challenge to choose the appropriate methods for a specific problem and seek for the most reasonable ranking features. Meanwhile, the paired gene expression data under matched case-control design (MCCD) is becoming increasingly popular, which has often been used in multi-omics integration studies and may increase feature selection efficiency by offsetting similar distributions of confounding features. The appropriate feature selection methods specifically designed for the paired data, which is named as matched-pairs feature selection (MPFS), however, have not been maturely developed in parallel. In this review, we compare the performance of 10 feature-selection methods (eight MPFS methods and two traditional unpaired methods) on two real datasets by applied three classification methods, and analyze the algorithm complexity of these methods through the running of their programs. This review aims to induce and comprehensively present the MPFS in such a way that readers can easily understand its characteristics and get a clue in selecting the appropriate methods for their analyses.
  • 关键词:Matched-pairs feature selection ; Matched case-control design ; Paired data ; Gene expression
国家哲学社会科学文献中心版权所有