期刊名称:Computational and Structural Biotechnology Journal
印刷版ISSN:2001-0370
出版年度:2016
卷号:14
页码:87-90
DOI:10.1016/j.csbj.2015.11.004
语种:
出版社:Computational and Structural Biotechnology Journal
摘要:With next-generation sequencing, the genomic data available for the characterization of integration sites (IS) has dramatically increased. At present, in a single experiment, several thousand viral integration genome targets can be investigated to define genomic hot spots. In a previous article, we renovated a formal CIS analysis based on a rigid fixed window demarcation into a more stretchy definition grounded on graphs. Here, we present a selection of supporting data related to the graph-based framework (GBF) from our previous article, in which a collection of common integration sites (CIS) was identified on six published datasets. In this work, we will focus on two datasets, ISRTCGD and ISHIV, which have been previously discussed. Moreover, we show in more detail the workflow design that originates the datasets.