Summary
A large dataset of dynamic contrast-enhanced magnetic resonance images of the breast for breast cancer patients. Tumors annotated by radiologists. Extensive meta-data including patient demographics, treatment, outcomes, pathology, genomics.
This dataset is the dataset used in the following paper:
The dataset contains four components: (1) DICOM Images, (2) Annotation_Boxes, (3) Clinical_and_Other_Features, and (4) Imaging_Features.
(1) DICOM Images
Breast MRI images of 922 patients, inside the folder of each patient there are the following sequences: fat saturated pre-contrast sequence, fat saturated post-contrast sequences, and non-fat saturated T1 weighted sequence.
(2) Annotation_Boxes
Tumor annotations in the form of boxes drawn by radiologists, six values in the spreadsheet represent the location of the start row, end row, start column, end column, start slice, and end slice of the box. The annotation boxes
can be applied directly to pre and post sequences.
(3) Clinical and Other Features
A spreadsheet containing demographic, clinical, pathology, outcomes, genomic, and other information for the patients. The legend and column explanation are included in the spreadsheet (different tabs).
(4) Imaging features
A spreadsheet that contains 529 imaging features extracted from the images. These are the features that were used in the paper listed above (Saha et al., BJR 2018).
Acknowledgements
We would like to acknowledge the individuals and institutions that have provided data for this collection:
Hospital/Institution Name city, state, country - Special thanks to First Last Names, degree PhD, MD, etc from the Department of xxxxxx, Additional Names from same location.
- Continue with any names from additional submitting sites if collection consists of more that one.
Data Access
Click the Download button to save to your local storage. Click the Search button to open our Data Portal, where you can browse the data collection and/or download a subset of its contents.
Data Type | Download all or Query/Filter |
---|---|
Images (DICOM, XX.X GB) |
(Requires NBIA Data Retriever .) |
Clinical and Other Features (XLSX) | |
Annotation Boxes (XLSX) | |
Imaging features (XLSX, 6.6 MB) |
Click the Versions tab for more info about data releases.
Detailed Description
Image Statistics | |
---|---|
Modalities | MR |
Number of Participants | 922 |
Number of Studies | |
Number of Series | 5,304 |
Number of Images | |
Images Size (GB) |
Citations & Data Usage Policy
Add any special restrictions in here.
Data Citation
< Dataset DOI coming soon>
Source Publication
Saha, A., Harowicz, M. R., Grimm, L. J., Kim, C. E., Ghate, S. V., Walsh, R., & Mazurowski, M. A. (2018). A machine learning approach to radiogenomics of breast cancer: a study of 922 subjects and 529 DCE-MRI features. British journal of cancer, 119(4), 508-516. DOI: https://doi.org/10.1038/s41416-018-0185-8
TCIA Citation
Clark K, Vendt B, Smith K, Freymann J, Kirby J, Koppel P, Moore S, Phillips S, Maffitt D, Pringle M, Tarbox L, Prior F. The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository, Journal of Digital Imaging, Volume 26, Number 6, December, 2013, pp 1045-1057. DOI: https://doi.org/10.1007/s10278-013-9622-7
Other Publications Using This Data
TCIA maintains a list of publications which leverage TCIA data. If you have a manuscript you'd like to add please contact the TCIA Helpdesk.
Version 1 (Current): Updated 2020/mm/dd
Data Type | Download all or Query/Filter |
---|---|
Images (DICOM, xx.x GB) |
(Requires NBIA Data Retriever .) |
Other (format) |
Added new subjects.