Child pages
  • Dynamic contrast-enhanced magnetic resonance images of breast cancer patients with tumor locations (Duke-Breast-Cancer-MRI)

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Summary


Breast MRI is a common image modality to assess extent of disease in breast cancer patients. Recent studies show that MRI has a potential in prognosis of patient sort and long term outcomes as well as predicting pathological and genomic features of the tumors. However, large, well annotated datasets are needed to make further progress in the field. We share such a dataset here.

In this dataset, we share MRI imaging and other data for 922 patients with invasive breast cancer. Their prone position axial breast MRI images were acquired by 1.5T or 3T scanners. Following MRI sequences are shared: a non-fat saturated T1-weighted sequence, a fat-saturated gradient echo T1-weighted pre-contrast sequence, and mostly three to four post-contrast sequences.

The images are associated with a variety of metadata. Specifically, we are sharing:

  1. Demographic, clinical, pathology, treatment, outcomes, and genomic data. This data has been collected from a variety of sources including clinical notes and has served as a source for multiple published papers on radiogenomics, outcomes prediction, and other.
  2. 529 imaging features which represent a variety of imaging characteristics including size, shape, texture, and enhancement of both the tumor and the surrounding tissue, which is combined of features commonly published in the literature, as well as the features developed in our lab. These features were used in our previous paper (https://doi.org/10.1016/j.eswa.2017.06.029)
  3. Annotation boxes provided by radiologists that indicate locations of the lesions in the images.

For more information see this site https://sites.duke.edu/mazurowski/resources/breast-cancer-mri-dataset/, the related publication, or contact TCIA Helpdesk at help@cancerimagingarchive.net

A large dataset of dynamic contrast-enhanced magnetic resonance images of the breast for breast cancer patients. Tumors annotated by radiologists. Extensive meta-data including patient demographics, treatment, outcomes, pathology, genomics.

This dataset is the dataset used in the following paper:

Info
iconfalse
titleRelated article
Saha, A., Harowicz, M. R., Grimm, L. J., Kim, C. E., Ghate, S. V., Walsh, R., & Mazurowski, M. A. (2018). A machine learning approach to radiogenomics of breast cancer: a study of 922 subjects and 529 DCE-MRI features. British journal of cancer, 119(4), 508-516. https://doi.org/10.1038/s41416-018-0185-8

The dataset contains four components: (1) DICOM Images, (2) Annotation_Boxes, (3) Clinical_and_Other_Features, and (4) Imaging_Features.  Breast MRI images of 922 patients, including the following sequences: fat saturated pre-contrast sequence, fat saturated post-contrast sequences, and non-fat saturated T1 weighted sequence. Tumor annotations in the form of boxes drawn by radiologists, six values in the spreadsheet represent the location of the start row, end row, start column, end column, start slice, and end slice of the box. The annotation boxes can be applied directly to pre and post sequences. 

Accompanying the DICOM are two spreadsheets: a spreadsheet containing demographic, clinical, pathology, outcomes, genomic, and other information for the patients. The legend and column explanation are included in the spreadsheet (different tabs). Also, a spreadsheet that contains 529 imaging features extracted from the images. These are the features that were used in the paper listed above (Saha et al., BJR 2018).

Acknowledgements

We would like to acknowledge the individuals and institutions that have provided data for this collection:

  • Hospital/Institution Name city, state, country - Special thanks to First Last Names, degree PhD, MD, etc from the Department of xxxxxx, Additional Names from same location.

  • Continue with any names from additional submitting sites if collection consists of more that one.



    Localtab Group



    Localtab
    activetrue
    titleData Access

    Data Access

    Click the  Download button to save to your local storage. Click the Search button to open our Data Portal, where you can browse the data collection and/or download a subset of its contents.


    Data TypeDownload all or Query/Filter
    Images (DICOM, XX.X GB)

     

    Tcia button generator
    labelSearch
    urlhttps://nbia.cancerimagingarchive.net/nbia-search/?CollectionCriteria=Duke-Breast-Cancer-MRI

    (Requires NBIA Data Retriever .)

    Clinical and Other Features (XLSX, 582 kB)

      

    Annotation Boxes (XLSX, 49 kB)

    Imaging features (XLSX, 6.44 MB)


    Click the Versions tab for more info about data releases.




    Localtab
    titleDetailed Description

    Detailed Description


    Image Statistics


    Modalities

    MR

    Number of Participants

    922

    Number of Studies

    922

    Number of Series

    5,304

    Number of Images

    773,126

    Images Size (GB)351.3

    Users please note this caveat about DICOM tag (0020,0052) Frame Of Reference UID: this

    This DICOM tag was lost during deidentification and curation, and has been replaced with a Dummy value per study that may not be reliable for image alignment.




    Localtab
    titleCitations & Data Usage Policy

    Citations & Data Usage Policy

    Add any special restrictions in here.

    Tcia license 4 noncommercial
     

    Info
    titleData Citation

    Mazurowski M.Saha, Saha A., Harowicz, M. R., Grimm, L. J., Weng, J.,  Cain, E. H., Kim, C. E., Ghate, S. V., Walsh, R., & Mazurowski, M. A.  (2020) Dynamic contrast-enhanced magnetic resonance images of breast cancer patients with tumor locations [ Dataset]. The Cancer Imaging Archive. https://doi.org/   < Dataset DOI coming soon> 


    (no other authors seem to have an ORCiD)

    Saha https://orcid.org/0000-0002-7650-1720

    (Abstract for DOI) 

    Breast MRI is a common image modality to assess extent of disease in breast cancer patients. Recent studies show that MRI has a potential in prognosis of patient sort and long term outcomes as well as predicting pathological and genomic features of the tumors. However, large, well annotated datasets are needed to make further progress in the field. We share such a dataset here.

    In this dataset, we share MRI imaging and other data for 922 patients with invasive breast cancer. Their prone position axial breast MRI images were acquired by 1.5T or 3T scanners. Following MRI sequences are shared: a non-fat saturated T1-weighted sequence, a fat-saturated gradient echo T1-weighted pre-contrast sequence, and mostly three to four post-contrast sequences.

    The images are associated with a variety of metadata. Specifically, we are sharing:

    1. Demographic, clinical, pathology, treatment, outcomes, and genomic data. This data has been collected from a variety of sources including clinical notes and has served as a source for multiple published papers on radiogenomics, outcomes prediction, and other.
    2. 529 imaging features which represent a variety of imaging characteristics including size, shape, texture, and enhancement of both the tumor and the surrounding tissue, which is combined of features commonly published in the literature, as well as the features developed in our lab. These features were used in our previous paper (https://doi.org/10.1016/j.eswa.2017.06.029)
    3. Annotation boxes provided by radiologists that indicate locations of the lesions in the images.


    Info
    titleSource Publication

    Saha, A., Harowicz, M. R., Grimm, L. J., Kim, C. E., Ghate, S. V., Walsh, R., & Mazurowski, M. A. (2018). A machine learning approach to radiogenomics of breast cancer: a study of 922 subjects and 529 DCE-MRI features. British journal of cancer, 119(4), 508-516. DOI: https://doi.org/10.1038/s41416-018-0185-8


    Info
    titleTCIA Citation

    Clark K, Vendt B, Smith K, Freymann J, Kirby J, Koppel P, Moore S, Phillips S, Maffitt D, Pringle M, Tarbox L, Prior F. The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository, Journal of Digital Imaging, Volume 26, Number 6, December, 2013, pp 1045-1057. DOI: https://doi.org/10.1007/s10278-013-9622-7


    Other Publications Using This Data

    TCIA maintains a list of publications which leverage TCIA data. If you have a manuscript you'd like to add please contact the TCIA Helpdesk.




    Localtab
    titleVersions

    Version 1 (Current): Updated 2020/12/03


    Data TypeDownload all or Query/Filter
    Images (DICOM, xx.x GB)

     

    Tcia button generator
    labelSearch
    urlhttps://nbia.cancerimagingarchive.net/nbia-search/?CollectionCriteria=Duke-Breast-Cancer-MRI

    (Requires NBIA Data Retriever .)

    Clinical and Other Features (XLSX, 582 kB)

      

    Annotation Boxes (XLSX, 49 kB)

    Imaging features (XLSX, 6.44 MB)


    Added new subjects.