Breast Cancer Profiling Project, Gene Expression 1: Baseline mRNA sequencing on 35 breast cell lines - Dataset (ID:20348)

HMS Dataset ID: 20348
Dataset Title: Breast Cancer Profiling Project, Gene Expression 1: Baseline mRNA sequencing on 35 breast cell lines
Screening Lab Investigator: Caitlin E. Mills, Marc Hafner, Sarah A. Boswell
Screening Principal Investigator: Peter K. Sorger
Assay Description: We measured the baseline mRNAseq profiles of two non-malignant breast cell lines and 33 breast cancer cell lines of which twenty were triple negative, six were hormone receptor positive, four were Her2 amplified, and three were established from triple negative patient-derived xenografts.
Assay Protocol: 1. Cells in mid-log phase of the growth cycle from 35 breast cancer cell lines were plated at appropriate densities to achieve ~40% confluence at the time of harvest in a 10 cm plate in their recommended growth media. The growth conditions used are detailed in this file download: Growth Conditions. Three technical replicates (cell lines PDX1258, PDX1328, PDX HCCI002), and one biological replicate (cell line MCF 10A) were included for a total of 39 samples.
2. The cells were grown for 24-48 hours at 37°C in the presence of 5% CO2 (all MDAMB lines except MDAMB231 were grown in the absence of CO2).
3. Plates were put on ice, and washed twice with 10 ml cold PBS. All residual PBS was aspirated following the second wash.
4. 600 ul of RLT (Qiagen) + bME (1:100 (v/v)) was added to each plate and cells were scraped off using cell lifters (Corning). Cell suspensions were flash frozen and stored at -80°C until further processing.
5. Once all samples were collected, cell suspensions were thawed and homogenized using QiaShredder columns according to the manufacturer’s instructions.
6. RNA extraction was completed with an RNeasy kit according to the manufacturer’s instructions with a 20 min on-column DNase digestion at room temperature.
7. RNA concentrations were measured with a NanoDrop (Thermo), and extract quality was evaluated by Bioanalyzer (Agilent). Samples with RNA integrity numbers>9 were deemed to be of sufficient quality.
8. 500 ng of each sample diluted with ERCC spike-in mix 2 (Ambion) was used for library preparation with a High Throughput TruSeq Stranded mRNA Library Prep Kit (Illumina) following the manufacturer’s protocol at 1/3 reaction volume. The final amplification was run for 10 cycles.
9. Libraries were quantified using the Qubit dsDNA HS assay (Thermo Fisher Scientific).
10. Library size and quality were spot checked for a subset of samples by Bioanalyzer (Agilent). The average size of cDNA fragments in the libraries was 360 base pairs.
11. Libraries were pooled at equimolar concentrations and then quantitated using the KAPA library quantification kit (KAPA Biosystems) at the Harvard University Bauer Core Facility.
12. Paired end 75 base pair reads were sequenced on a NextSeq instrument (Illumina) at the Harvard University Bauer Core Facility.
13. The STAR algorithm in the bcbio-Nextgen toolkit was used to map high quality reads to human genome build GRCh37. Refer to: http://bcbio-nextgen.readthedocs.io/en/latest/contents/pipelines.html#rna-seq for details.
14. The resulting counts table was normalized to reads per kilobase of transcripts per million mapped reads (RPKM). Where replication was performed (technical replication for cell lines PDX1258, PDX1328, PDX HCCI002 and biological replication for cell line MCF 10A), the mean of the two replicates is reported.
HMS Dataset Type: RNA-Seq
Date Publicly Available: 2018-07-06
Most Recent Update: 2018-07-06