Child pages
  • TCGA Breast Phenotype Research Group Data sets
Skip to end of metadata
Go to start of metadata


At the time of our study, 108 cases with breast MRI data were available in the TCGA-BRCA collection. In order to minimize variations in image quality across the multi-institutional cases we included only breast MRI studies acquired on GE 1.5 Tesla magnet strength scanners (GE Medical Systems, Milwaukee,Wisconsin, USA) scanners, yielding a total of 93 cases. We then excluded cases that had missing images in the dynamic sequence (1 patient), or at the time did not have gene expression analysis available in the TCGA Data Portal (8 patients). After these criteria, a dataset of 84 breast cancer patients resulted, with MRIs from four institutions: Memorial Sloan Kettering Cancer Center, the Mayo Clinic, the University of Pittsburgh Medical Center, and the Roswell Park Cancer Institute. The resulting cases contributed by each institution were 9 (date range 1999-2002), 5 (1999-2003), 46 (1999-2004), and 24 (1999-2002), respectively. The dataset of biopsy proven invasive breast cancers included 74 (88%) ductal, 8 (10%) lobular, and 2 (2%) mixed. Of these, 73 (87%) were ER+, 67 (80%) were PR+, and 19 (23%) were HER2+.  Various types of analyses were conducted using the combined imaging, genomic, and clinical data.  Those analyses are described within several manuscripts created by the group (cited above). 


Have you written a paper which leveraged this data? Let us know at

Publication Citation

  • Guo W, Li H, Zhu Y, Lan L, Yang S, Drukker K, Morris E, Burnside E, Whitman G, Giger ML*, Ji Y*:  Prediction of clinical phenotypes in invasive breast carcinomas from the integration of radiomics and genomics data.  J Medical Imaging 2(4), 041007 (Oct-Dec 2015).
  • Burnside E, Drukker K, Li H, Bonaccio E, Zuley M, Ganott M, Net JM, Sutton E, Brandt K, Whitman G, Conzen S, Lan L, Ji Y, Zhu Y, Jaffe C, Huang E, Freymann J, Kirby J, Morris EA*, Giger ML*:  Using computer-extracted image phenotypes from tumors on breast MRI to predict breast cancer pathologic stage. Cancer doi: 10.1002/cncr.29791, 2015.
  • Zhu Y, Li H, Guo W, Drukker K, Lan L, Giger ML*, Ji Y*:  Deciphering genomic underpinnings of quantitative MRI-based radiomic phenotypes of invasive breast carcinoma.  Nature – Scientific Reports 5:17787. doi: 10.1038/srep17787, 2015.
  • Li H, Zhu Y, Burnside ES, …. Perou CM, Ji Y*, Giger ML*:  MRI radiomics signatures for predicting the risk of breast cancer recurrence as given by research versions of gene assays of MammaPrint, Oncotype DX, and PAM50.  Radiology DOI:, 2016.
  • Li H, Zhu Y, Burnside ES, …. Perou CM, Ji Y, Giger ML:  Quantitative MRI radiomics in the prediction of molecular classifications of breast cancer subtypes in the TCGA/TCIA Dataset. npj Breast Cancer (2016) 2, 16012; doi:10.1038/npjbcancer.2016.12; published online 11 May 2016.


  • Image Data:  DICOM – Save/open this file to initiate our Java Web Start download manager to begin your download
  • Radiologist Annotations/Markup:  Please contact the TCIA Helpdesk to request access
  • Computer-extracted image phenotypes:  Please contact the TCIA Helpdesk to request access 
  • Multi-gene assays including MammaPrint, Oncotype DX, and PAM50:  Please contact the TCIA Helpdesk to request access
  • TCGA Clinical Data (from TCGA Data Portal, archived in case of subsequent updates made by TCGA):  Please contact the TCIA Helpdesk to request access
  • No labels