首页    期刊浏览 2024年11月23日 星期六
登录注册

文章基本信息

  • 标题:Rqc: A Bioconductor Package for Quality Control of High-Throughput Sequencing Data
  • 本地全文:下载
  • 作者:Wélliton de Souza ; Benilton de Sá Carvalho ; Iscia Lopes-Cendes
  • 期刊名称:Journal of Statistical Software
  • 印刷版ISSN:1548-7660
  • 电子版ISSN:1548-7660
  • 出版年度:2018
  • 卷号:87
  • 期号:1
  • 页码:1-14
  • DOI:10.18637/jss.v087.c02
  • 语种:English
  • 出版社:University of California, Los Angeles
  • 摘要:As sequencing costs drop with the constant improvements in the field, next-generation sequencing becomes one of the most used technologies in biological research. Sequencing technology allows the detailed characterization of events at the molecular level, including gene expression, genomic sequence and structural variants. Such experiments result in billions of sequenced nucleotides and each one of them is associated to a quality score. Several software tools allow the quality assessment of whole experiments. However, users need to switch between software environments to perform all steps of data analysis, adding an extra layer of complexity to the data analysis workflow. We developed Rqc, a Bioconductor package designed to assist the analyst during assessment of high-throughput sequencing data quality. The package uses parallel computing strategies to optimize large data sets processing, regardless of the sequencing platform. We created new data quality visualization strategies by using established analytical procedures. That improves the ability of identifying patterns that may affect downstream procedures, including undesired sources technical variability. The software provides a framework for writing customized reports that integrates seamlessly to the R/Bioconductor environment, including publication-ready images. The package also offers an interactive tool to generate quality reports dynamically. Rqc is implemented in R and it is freely available through the Bioconductor project (https://bioconductor.org/packages/Rqc/) for Windows, Linux and Mac OS X operating systems.
  • 其他关键词:next-generation sequencing;quality assessment;high-performance computing;R
国家哲学社会科学文献中心版权所有