首页    期刊浏览 2024年12月01日 星期日
登录注册

文章基本信息

  • 标题:Scaffolder - software for manual genome scaffolding
  • 本地全文:下载
  • 作者:Michael D Barton ; Hazel A Barton
  • 期刊名称:Source Code for Biology and Medicine
  • 印刷版ISSN:1751-0473
  • 电子版ISSN:1751-0473
  • 出版年度:2012
  • 卷号:7
  • 期号:1
  • 页码:4
  • DOI:10.1186/1751-0473-7-4
  • 语种:English
  • 出版社:BioMed Central
  • 摘要:The assembly of next-generation short-read sequencing data can result in a fragmented non-contiguous set of genomic sequences. Therefore a common step in a genome project is to join neighbouring sequence regions together and fill gaps. This scaffolding step is non-trivial and requires manually editing large blocks of nucleotide sequence. Joining these sequences together also hides the source of each region in the final genome sequence. Taken together these considerations may make reproducing or editing an existing genome scaffold difficult. The software outlined here, “Scaffolder,” is implemented in the Ruby programming language and can be installed via the RubyGems software management system. Genome scaffolds are defined using YAML - a data format which is both human and machine-readable. Command line binaries and extensive documentation are available. This software allows a genome build to be defined in terms of the constituent sequences using a relatively simple syntax. This syntax further allows unknown regions to be specified and additional sequence to be used to fill known gaps in the scaffold. Defining the genome construction in a file makes the scaffolding process reproducible and easier to edit compared with large FASTA nucleotide sequences. Scaffolder is easy-to-use genome scaffolding software which promotes reproducibility and continuous development in a genome project. Scaffolder can be found at http://next.gs .
  • 关键词:FASTA Sequence ; Unknown Region ; Genome Scaffold ; Scaffold Sequence ; Command Line Interface
国家哲学社会科学文献中心版权所有