首页    期刊浏览 2025年06月30日 星期一
登录注册

文章基本信息

  • 标题:Taxonomically Clustering Organisms Based on the Profiles of Gene Sequences Using PCA
  • 本地全文:下载
  • 作者:Ramaraj, E. ; Punithavalli, M.
  • 期刊名称:Journal of Computer Science
  • 印刷版ISSN:1549-3636
  • 出版年度:2006
  • 卷号:2
  • 期号:3
  • 页码:292-296
  • DOI:10.3844/jcssp.2006.292.296
  • 出版社:Science Publications
  • 摘要:The biological implications of bioinformatics can already be seen in various implementations. Biological taxonomy may seem like a simple science in which the biologists merely observe similarities among organisms and construct classifications according to those similarities[1], but it is not so simple. By applying data mining techniques on gene sequence database we can cluster the data to find interesting similarities in the gene expression data. One of the applications of such kind of clustering is taxonomically clustering the organisms based on their gene sequential expressions. In this study we outlined a method for taxonomical clustering of species of the organisms based on the genetic profile using Principal Component Analysis and Self Organizing Neural Networks. We have implemented the idea using Matlab and tried to cluster the gene sequences taken from PAUP version of the ML5/ML6 database. The taxa used for some of the basidiomycetous fungi form the database. To study the scalability issues another large gene sequence database was used. The proposed method clustered the species of organisms correctly in almost all the cases. The obtained were more significant and promising. The proposed method clustered the species of organisms correctly in almost all the cases. The obtained results were more significant and promising.
  • 关键词:Bioinformatics; taxonomy; gene sequence classification; data mining; data classification; clustering; principal component analysis
国家哲学社会科学文献中心版权所有