摘要:Breast cancer is a common and highly heterogeneous disease . Understanding cellular diversity in the mammary gland and its surrounding micro-environment across diferent states can provide insight into cancer development in the human breast . Recently, we published a large-scale single-cell RNA expression atlas of the human breast spanning normal, preneoplastic and tumorigenic states . Single- cell expression profles of nearly 430,000 cells were obtained from 69 distinct surgical tissue specimens from 55 patients . This article extends the study by providing quality fltering thresholds, downstream processed R data objects, complete cell annotation and R code to reproduce all the analyses . Data quality assessment measures are presented and details are provided for all the bioinformatic analyses that produced results described in the study.