首页    期刊浏览 2024年11月28日 星期四
登录注册

文章基本信息

  • 标题:cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing
  • 本地全文:下载
  • 作者:Toshiki Takeuchi ; Atsuo Yamada ; Takashi Aoki
  • 期刊名称:Source Code for Biology and Medicine
  • 印刷版ISSN:1751-0473
  • 电子版ISSN:1751-0473
  • 出版年度:2016
  • 卷号:11
  • 期号:1
  • 页码:12
  • DOI:10.1186/s13029-016-0058-6
  • 语种:English
  • 出版社:BioMed Central
  • 摘要:Next-generation sequencing can determine DNA bases and the results of sequence alignments are generally stored in files in the Sequence Alignment/Map (SAM) format and the compressed binary version (BAM) of it. SAMtools is a typical tool for dealing with files in the SAM/BAM format. SAMtools has various functions, including detection of variants, visualization of alignments, indexing, extraction of parts of the data and loci, and conversion of file formats. It is written in C and can execute fast. However, SAMtools requires an additional implementation to be used in parallel with, for example, OpenMP (Open Multi-Processing) libraries. For the accumulation of next-generation sequencing data, a simple parallelization program, which can support cloud and PC cluster environments, is required. We have developed cljam using the Clojure programming language, which simplifies parallel programming, to handle SAM/BAM data. Cljam can run in a Java runtime environment (e.g., Windows, Linux, Mac OS X) with Clojure. Cljam can process and analyze SAM/BAM files in parallel and at high speed. The execution time with cljam is almost the same as with SAMtools. The cljam code is written in Clojure and has fewer lines than other similar tools.
  • 关键词:Next-generation sequencing ; DNA ; Parallel processing ; Clojure ; SAM ; BAM
国家哲学社会科学文献中心版权所有