首页    期刊浏览 2025年07月03日 星期四
登录注册

文章基本信息

  • 标题:Fast and accurate matching of cellular barcodes across short-reads and long-reads of single-cell RNA-seq experiments
  • 本地全文:下载
  • 作者:Ghazal Ebrahimi ; Baraa Orabi ; Meghan Robinson
  • 期刊名称:iScience
  • 印刷版ISSN:2589-0042
  • 出版年度:2022
  • 卷号:25
  • 期号:7
  • 页码:1-14
  • DOI:10.1016/j.isci.2022.104530
  • 语种:English
  • 出版社:Elsevier
  • 摘要:SummarySingle-cell RNA sequencing allows for characterizing the gene expression landscape at the cell type level. However, because of its use of short-reads, it is severely limited at detecting full-length features of transcripts such as alternative splicing. New library preparation techniques attempt to extend single-cell sequencing by utilizing both long-reads and short-reads. These techniques split the library material, after it is tagged with cellular barcodes, into two pools: one for short-read sequencing and one for long-read sequencing. However, the challenge of utilizing these techniques is that they require matching the cellular barcodes sequenced by the erroneous long-reads to the cellular barcodes detected by the short-reads. To overcome this challenge, we introduce scTagger, a computational method to match cellular barcodes data from long-reads and short-reads. We tested scTagger against another state-of-the-art tool on both real and simulated datasets, and we demonstrate that scTagger has both significantly better accuracy and time efficiency.Graphical abstractDisplay OmittedHighlights•We introduce scTagger, a method for matching long-read and short-read cellular barcodes•The matching enables combining gene expression analysis and isoform detection•scTagger relies on a trie-based data structure to perform this matching•We show that scTagger is accurate and fast on both real and simulated dataBioinformatics; Genomics; Sequence analysis
国家哲学社会科学文献中心版权所有