首页    期刊浏览 2024年11月25日 星期一
登录注册

文章基本信息

  • 标题:A New Data Imputing Algorithm
  • 本地全文:下载
  • 作者:Ahmed Sobhy Sherif ; Hany Harb ; Sherif Zaky
  • 期刊名称:International Journal of Computer Science Issues
  • 印刷版ISSN:1694-0784
  • 电子版ISSN:1694-0814
  • 出版年度:2011
  • 卷号:8
  • 期号:3
  • 出版社:IJCSI Press
  • 摘要:DNA microarray analysis has become the most widely used functional genomics approach in the bioinformatics field. Microarray gene expression data often contains missing values due to various reasons. Clustering gene expression data algorithms requires having complete information. This means that there should not be any missing values. In this paper, a clustering method is proposed, called "Clustering Local Least Square Imputation method (ClustLLsimpute)", to estimate the missing values. In ClustLLsimpute, a complete dataset is obtained by removing each row with missing values. K clusters and their centroids are obtained by applying a non-parametric clustering technique on the complete dataset. Similar genes to the target gene (with missing values) are chosen as the smallest Euclidian distance to the centroids of each cluster. The target gene is represented as a linear combination of similar genes. Undertaken experiments proved that this algorithm is more accurate than the other algorithms, which have been introduced in the literature.
  • 关键词:Missing Values; Imputation; Microarray; Regression
国家哲学社会科学文献中心版权所有