首页    期刊浏览 2024年11月24日 星期日
登录注册

文章基本信息

  • 标题:Parallelized Classification of Cancer Sub-types from Gene Expression Profiles Using Recursive Gene Selection
  • 本地全文:下载
  • 作者:Lokeswari VENKATARAMANA ; Shomona Gracia JACOB ; Rajavel RAMADOSS
  • 期刊名称:Studies in Informatics and Control Journal
  • 印刷版ISSN:1220-1766
  • 出版年度:2018
  • 卷号:27
  • 期号:2
  • 页码:213-222
  • DOI:10.24846/v27i2y201809
  • 出版社:National Institute for R&D in Informatics
  • 摘要:Cancer is a chronic disease that is caused mainly by irregularities in genes. It is important to identify such oncogenes that cause cancer. Biological data like gene expressions, protein sequences, RNA-sequences, pathway analysis, Pan-cancer analysis and structural biomarkers could aid in cancer diagnosis, classification and prognosis. This research focuses on classifying subtypes of cancer using Microarray Gene Expression (MGE) levels. Nature of MGE data is multidimensional with very few samples. It is necessary to perform dimensionality reduction to select the relevant genes and remove the redundant ones. The Recursive Feature Selection (RFS) method is proposed as it repeatedly performs the gene selection process until the best gene subset is found. The obtained best subset of genes is further employed for classification using different models and evaluated using 10-fold cross-validation. In order to scale for huge amount of gene expression data, the parallelized classification model was explored on the Spark framework. A comparison was drawn between the non-parallelized classification model on Weka and the parallelized classification model on Spark. The results revealed that the parallelized classification model performs better than non-parallelized classification model in terms of accuracy and execution time. Further, the performance of RFS and parallelized classifier was also compared with previous approaches. The proposed RFS and parallelized classifier outperformed previous methods.
  • 关键词:Recursive Feature Selection; Gene Selection; Microarray Gene Expression; Parallelized classification; Random Forest
国家哲学社会科学文献中心版权所有