首页    期刊浏览 2025年07月05日 星期六
登录注册

文章基本信息

  • 标题:Recursive Clustering Using Different Features Sets for Metagenomic Data
  • 本地全文:下载
  • 作者:Isis Bonet ; Widerman Montoya ; Andrea Mesa-Munera
  • 期刊名称:Journal of Computers
  • 印刷版ISSN:1796-203X
  • 出版年度:2018
  • 卷号:13
  • 期号:8
  • 页码:905-912
  • DOI:10.17706/jcp.13.8.905-912
  • 语种:English
  • 出版社:Academy Publisher
  • 摘要:Metagenomics binning process is a step prior to the taxonomic assignment of metagenomic reads of contigs, which helps to group genome sequences belonging to the same species. In this paper we propose a clustering method that is executed recursively to cluster contigs into groups of same taxa. In each step the method increases the taxonomic level, beginning with a domain and ending with a group that represents the species. The method uses a previous rule-based system to separate virus from the rest of the organism and feature selection algorithms to select different features in each step of the clustering. The clustering is based on k-means++ using Cosine and Jaccard distance, and feature selection on gain information. The proposed method outperforms classic k-means++, achieving 88.15% of purity in clusters.
  • 关键词:Binning process; clustering; feature selection; metagenomics.
国家哲学社会科学文献中心版权所有