Summary
This dataset includes brain MRI scans of adult brain glioma patients, comprising of 4 structural modalities (i.e., T1, T1c, T2, T2-FLAIR) and associated manually generated ground truth labels for each tumor sub-region (enhancement, necrosis, edema), as well as their MGMT promoter methylation status. These scans are a collection of data from existing TCIA collections, but also cases provided by individual institutions and willing to share with a cc-by license.The BraTS dataset describes a retrospective collection of brain tumor structural mpMRI scans of 2,040 patients, acquired from multiple different institutions under standard clinical conditions, but with different equipment and imaging protocols, resulting in a vastly heterogeneous image quality reflecting diverse clinical practice across different institutions. The 4 structural mpMRI scans included in the BraTS challenge describe a) native (T1) and b) post-contrast T1-weighted (T1Gd (Gadolinium)), c) T2-weighted (T2), and d) T2 Fluid Attenuated Inversion Recovery (T2-FLAIR) volumes, acquired with different protocols and various scanners from multiple institutions. Furthermore, data on the O[6]-methylguanine-DNA methyltransferase (MGMT) promoter methylation status is provided as a binary label. Notably, MGMT is a DNA repair enzyme that the methylation of its promoter in newly diagnosed glioblastoma has been identified as a favorable prognostic factor and a predictor of chemotherapy response.
It is curated for computational image analysis of segmentation and prediction of the MGMT promoter methylation status.
A note about available TCIA data which were converted for use in this Challenge
Dr. Bakas's group has provided skull-stripped challenge TRAINING data in NIfTI that do not pose DUA-level risk of potential facial reidentification, and segmentations to go with them. This group has provided the skull-stripped challenge TEST data in NIfTI, and segmentations to go with them, available upon request through the helpdesk. If you want the DICOM-format files, then as part of page 1 of this Agreement form TCIA needs to have the citation(s) for the (TCGA-GBM, CPTAC-GBM, Ivy GAP, TCGA-LGG, UPENN-GBM, UCSF-PDGM, ACRIN-FMISO-Brain) filled in. These earlier data that pose enough reidentification risk that TCIA must keep them behind an explicit usage agreement follow this restriction.
Please also note that specificity of the exact series in DICOM became which exact volume in NIfTI has, unfortunately, been lost to time but the available lists below (in most cases, by Collection, TCIA Patient ID, Study date) represent our best effort at reconstructing the BraTS input imaging.
Acknowledgements
We would like to acknowledge the individuals and institutions that have provided data for this collection:
- Data used in this publication were obtained as part of the RSNA-ASNR-MICCAI Brain Tumor Segmentation (BraTS) Challenge project through Synapse ID (syn25829067).
Data Access
Data Type | Download all or Query/Filter | License |
---|---|---|
Images and Segmentations (NIfTI, 1.4 TB) 11.731 GB | Complete dataset all tasks (Download and apply the IBM-Aspera-Connect plugin to your browser to retrieve this faspex package) | |
BraTS 2021 Training set (NIfTI, XX GB ) | (Download and apply the IBM-Aspera-Connect plugin to your browser to retrieve this faspex package) | |
BraTS 2021 Validation set (NIfTI, XX GB) | (Download and apply the IBM-Aspera-Connect plugin to your browser to retrieve this faspex package) | |
BraTS Task2 Radiogenomics Classifier task Training images (DICOM, GB) |
| ( is this open license stuff?) |
BraTS Task2 Radiogenomics Classifier task molecular marker table (JSON/XLS/CSV spreadsheet, MB) |
| ( is this open license stuff?) |
BraTS 2021 Classifier Training Task | (Download and apply the IBM-Aspera-Connect plugin to your browser to retrieve this faspex package) | CC by 4.0 |
Clinical data (CSV) | ||
Feature matrices (format, ##GB) | link or attachment | |
ID Crosswalk between BraTS ID and TCIA ID (xlsx, 79 kB) |
Click the Versions tab for more info about data releases.
Additional Resources for this Dataset
Additional Resources for this Dataset
The NCI Cancer Research Data Commons (CRDC) provides access to additional data and a cloud-based data science infrastructure that connects data sets with analytics tools to allow users to share, integrate, analyze, and visualize cancer research data.
- Imaging Data Commons (IDC) (Imaging Data)
- Proteomic Data Commons (PDC) (Proteomic & Clinical Data)
- TCGA-GBM
- Genomic Data Commons (GDC) (Genomic & Clinical Data)
- TCGA-GBM
Note to curators! Below are examples for what to do with other external resources/links that don't fit into the above categories.
The following external resources have been made available by the data submitters. These are not hosted or supported by TCIA, but may be useful to researchers utilizing this collection.
- Software / Code on Github
- Genomics data in DbGAP
- Genomics data in Gene Expression Omnibus
- RSNA or kaggle link to molecular marker table if they only provide us a link
TCIA Collections Used (in part) in these analyses:
Source Data Type | Download | License |
---|---|---|
DICOM Used in BraTS 2021 Segmentation Training set from CPTAC-GBM , TCGA-GBM , TCGA-LGG , ACRIN-FMISO-Brain (ACRIN 6684) , IvyGAP ,UPENN-GBM | To get data used in BraTS-2021 please request the following Collections in your Agreement: CPTAC-GBM , TCGA-GBM , TCGA-LGG , ACRIN-FMISO-Brain (ACRIN 6684) , IvyGAP | |
DICOM Used in BraTS 2021 Classifier Training set from | To get data used in BraTS-2021 please request the following Collections in your Agreement: CPTAC-GBM , TCGA-GBM , TCGA-LGG , ACRIN-FMISO-Brain (ACRIN 6684) , IvyGAP | |
DICOM Used in BraTS 2021 Segmentation Validation set from | To get data used in BraTS-2021 please request the following Collections in your Agreement: CPTAC-GBM , TCGA-GBM , TCGA-LGG , ACRIN-FMISO-Brain (ACRIN 6684) , IvyGAP | |
DICOM Used in BraTS 2021 Classifier Validation set from | To get data used in BraTS-2021 please request the following Collections in your Agreement: CPTAC-GBM , TCGA-GBM , TCGA-LGG , ACRIN-FMISO-Brain (ACRIN 6684) , IvyGAP | |
new trapd-lnk files of only already-used UCSF-PDGM files (Download and apply the IBM-Aspera-Connect plugin to your browser to retrieve this faspex package) |
Third Party Analyses of this Dataset
TCIA encourages the community to publish your analyses of our datasets. Below is a list of such third party analyses published using this Collection:
Detailed Description
Image Statistics | Radiology Image Statistics |
---|---|
Modalities | MR, Segmentations |
Number of Patients | 2,040 |
Number of Studies | |
Number of Series | 7,131 |
Number of Images | |
Images Size (GB) | 11.731 |
- the "challenge test set dataset" is sequestered on synapse.org as a tarball of nii & seg & age & diagnosis. Please see their website for more detail.
The data used in BraTS Challenges often have some overlap with other TCIA Collections, cases, and series. Some filters for handling these, so that you can work with statistically not-duplicated images, include these below:
- Manifests of case identifiers between BraTS and TCIA : BraTS2021_MappingToTCIA.xlsx
- Spreadsheet list of cases and series used in prior year BraTS Challenges may also refer to these:
- Multimodal Brain Tumor Segmentation Challenge 2018 (BraTS)
- Multimodal Brain Tumor Segmentation Challenge 2019
- Segmentation Labels and Radiomic Features for the Pre-operative Scans of the TCGA-GBM collection (BraTS-TCGA-GBM)
- Segmentation Labels and Radiomic Features for the Pre-operative Scans of the TCGA-LGG collection (BraTS-TCGA-LGG)
- Spreadsheet list of new (DICOM and NIFTI) series files with no TCIA DICOM equivalent:
Here are some data splits that you might find useful. Manifest of (DICOM and NIFTI) files sourced from other TCIA Collections, so you can avoid superset accidental duplication in case you want everything TCIA has, after you've gotten all of BRaTS data
Notes about Image Registration:
- Transformation matrices DICOM to NIfTI are not available.
- Segmentation task image volume have been set to x=y=240 voxels by z=155 voxels.
- All Radiogenomics Classifier task files are restored to original DICOM resolution & orientation (thus volume may vary).
Citations & Data Usage Policy
Users must abide by the TCIA Data Usage Policy and Restrictions. Attribution should include references to the following citations:
Data Citation
Baid, U., Ghodasara, S., Mohan, S., Bilello, M., Calabrese, E., Colak, E., Farahani, K., Kalpathy-Cramer, J., Kitamura, F. C., Pati, S., Prevedello, L., Rudie, J., Sako, C., Shinohara, R., Bergquist, T., Chai, R., Eddy, J., Elliott, J., Reade, W., Schaffter, T., Yu, T., Zheng, J., Davatzikos, C., Mongan, J., Hess, C., Cha, S., Villanueva-Meyer, J., Freymann, J. B., Kirby, J. S., Wiestler, B., Crivellaro, P., Colen, R. R., Kotrotsou, A., Marcus, D., Milchenko, M., Nazeri, A., Fathallah-Shaykh, H., Wiest, R., Jakab, A., Weber, M-A., Mahajan, A., Menze, B., Flanders, A E., Bakas, S., (2023) RSNA-ASNR-MICCAI-BraTS-2021 Dataset. The Cancer Imaging Archive DOI: 10.7937/jc8x-9874
Acknowledgement
"The results <published or shown> here are in whole or part based upon data generated by the TCGA Research Network: http://cancergenome.nih.gov/."
Publication Citation
You are free to use and/or refer to the BraTS datasets in your own research, provided that you always cite the flagship manuscript (published or pre-published) resulting from the challenge as well as the following three manuscripts:
- Baid, U., Ghodasara, S., Mohan, S., Bilello, M., Calabrese, E., Colak, E., Farahani, K., Kalpathy-Cramer, J., Kitamura, F. C., Pati, S., Prevedello, L. M., Rudie, J. D., Sako, C., Shinohara, R. T., Bergquist, T., Chai, R., Eddy, J., Elliott, J., Reade, W., Schaffter, T., Yu, T., Zheng, J., Moawad, A. W., Coelho, L. O., McDonnell, O., Miller, E., Moron, F. E., Oswood, M. C., Shih, R. Y., Siakallis, L., Bronstein, Y., Mason, J. R., Miller, A. F., Choudhary, G., Agarwal, A., Besada, C. H., Derakhshan, J. J., Diogo, M. C., Do-Dai, D D., Farage, L., Go, J. L., Hadi, M., Hill, V. B., Iv, M., Joyner, D., Lincoln, C., Lotan, E., Miyakoshi, A., Sanchez-Montano, M., Nath, J., Nguyen, X. V., Nicolas-Jilwan, M., Ortiz Jimenez, J., Ozturk, K., Petrovic, B. D., Shah, C., Shah, L. M., Sharma, M., Simsek, O., Singh, A. K., Soman, S., Statsevych, V., Weinberg, B. D., Young, R. J., Ikuta, I., Agarwal, A. K.,Cambron, S. C., Silbergleit, R., Dusoi, A., Postma, A. A., Letourneau-Guillon, L., Guzman Perez-Carrillo, G. J., Saha, A., Soni, N., Zaharchuk, G., Zohrabian, V. M., Chen, Y., Cekic, M. M., Rahman, A., Small, J. E., Sethi, V., Davatzikos, C., Mongan, J., Hess, C., Cha, S., Villanueva-Meyer, J., Freymann, J. B., Kirby, J. S., Wiestler, B., Crivellaro, P., Colen, R. R., Kotrotsou, A., Marcus, D., Milchenko, M., Nazeri, A., Fathallah-Shaykh, H., Wiest, R., Jakab, A., Weber, M-A. Mahajan ,A., Menze, B., Flanders, A. E., Bakas, S. (2021). The RSNA-ASNR-MICCAI BraTS 2021 Benchmark on Brain Tumor Segmentation and Radiogenomic Classification (Version 2). arXiv. DOI: 10.48550/arXiv.2107.02314
- Menze, B. H., Jakab, A., Bauer, S., Kalpathy-Cramer, J., Farahani, K., Kirby, J., Burren, Y., Porz, N., Slotboom, J., Wiest, R., Lanczi, L., Gerstner, E., Weber, M.-A., Arbel, T., Avants, B. B., Ayache, N., Buendia, P., Collins, D. L., Cordier, N., … Van Leemput, K. (2015). The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS). In IEEE Transactions on Medical Imaging (Vol. 34, Issue 10, pp. 1993–2024). Institute of Electrical and Electronics Engineers (IEEE). DOI: 10.1109/tmi.2014.2377694
- Bakas, S., Akbari, H., Sotiras, A., Bilello, M., Rozycki, M., Kirby, J. S., Freymann, J. B., Farahani, K., & Davatzikos, C. (2017). Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features. In Scientific Data (Vol. 4, Issue 1). https://doi.org/10.1038/sdata.2017.117
TCIA Citation
Clark, K., Vendt, B., Smith, K., Freymann, J., Kirby, J., Koppel, P., Moore, S., Phillips, S., Maffitt, D., Pringle, M., Tarbox, L., & Prior, F. (2013). The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository. In Journal of Digital Imaging (Vol. 26, Issue 6, pp. 1045–1057). Springer Science and Business Media LLC. https://doi.org/10.1007/s10278-013-9622-7
Additional Publication Resources:
The Collection authors suggest the below will give context to this dataset:
You are free to use and/or refer to the BraTS datasets in your own research. In addition, please be specific and also cite the following datasets that were part of this Challenge:
- Bakas, S., Akbari, H., Sotiras, A., Bilello, M., Rozycki, M., Kirby, J., Freymann, J., Farahani, K., & Davatzikos, C. (2017). Segmentation Labels for the Pre-operative Scans of the TCGA-GBM collection [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/K9/TCIA.2017.KLXWJJ1Q
- Bakas, S., Akbari, H., Sotiras, A., Bilello, M., Rozycki, M., Kirby, J., Freymann, J., Farahani, K., & Davatzikos, C. (2017). Segmentation Labels for the Pre-operative Scans of the TCGA-LGG collection [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/K9/TCIA.2017.GJQ7R0EF
- Scarpace, L., Mikkelsen, T., Cha, S., Rao, S., Tekchandani, S., Gutman, D., Saltz, J. H., Erickson, B. J., Pedano, N., Flanders, A. E., Barnholtz-Sloan, J., Ostrom, Q., Barboriak, D., & Pierce, L. J. (2016). The Cancer Genome Atlas Glioblastoma Multiforme Collection (TCGA-GBM) (Version 4) [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/K9/TCIA.2016.RNYFUYE9
- Pedano, N., Flanders, A. E., Scarpace, L., Mikkelsen, T., Eschbacher, J. M., Hermes, B., Sisneros, V., Barnholtz-Sloan, J., & Ostrom, Q. (2016). The Cancer Genome Atlas Low Grade Glioma Collection (TCGA-LGG) (Version 3) [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/K9/TCIA.2016.L4LTD3TK
- Calabrese, E., Villanueva-Meyer, J., Rudie, J., Rauschecker, A., Baid, U., Bakas, S., Cha, S., Mongan, J., & Hess, C. (2022). The University of California San Francisco Preoperative Diffuse Glioma MRI (UCSF-PDGM) [Dataset]. The Cancer Imaging Archive. https://10.7937/tcia.bdgf-8v37
- Bakas, S., Sako, C., Akbari, H., Bilello, M., Sotiras, A., Shukla, G., Rudie, J. D., Flores Santamaria, N., Fathi Kazerooni, A., Pati, S., Rathore, S., Mamourian, E., Ha, S. M., Parker, W., Doshi, J., Baid, U., Bergman, M., Binder, Z. A., Verma, R., … Davatzikos, C. (2021). Multi-parametric magnetic resonance imaging (mpMRI) scans for de novo Glioblastoma (GBM) patients from the University of Pennsylvania Health System (UPENN-GBM) (Version 2) [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/TCIA.709X-DN49
Other Publications Using This Data
TCIA maintains a list of publications which leverage our data. If you have a manuscript you'd like to add please contact TCIA's Helpdesk.