Summary
The RADCURE dataset was collected clinically for radiation therapy treatment planning and retrospectively reconstructed for quantitative imaging research.
Acquisition and Validation Methods: RADCURE is comprised of data for 3,346 patients and contains computed tomography (CT) images with corresponding normal and non-normal tissue contours. CT scans were collected using systems from three different manufacturers. Standard clinical imaging protocols were followed, and contours were generated and reviewed at weekly quality assurance rounds. RADCURE imaging and structure set data was extracted from our institution’s radiation treatment planning and oncology systems using an in-house data mining and processing system. Furthermore, images are linked to clinical data for each patient and include demographic, clinical and treatment information based on the 7th edition TNM staging system. The median patient age is 63, with the final dataset including 80% males. Oropharyngeal cancer makes up 50% of the population with larynx, nasopharynx, and hypopharynx cancer, comprising 25, 12, and 5% respectively. Median follow-up was 5 years with 60% of the patients alive at last follow-up.
Data Format and Usage Notes: During extraction of images and contours from our institution’s radiation treatment planning and oncology systems, the data was converted to DICOM and RTSTRUCT formats, respectively. To improve the usability of the RTSTRUCT files, individual contour names were standardized for primary tumor volumes and 29 organs-at-risk. Demographic, clinical, and treatment information is provided as a comma-separated values (csv) file. This dataset is a superset of the Radiomic Biomarkers in Oropharyngeal Carcinoma (OPC-Radiomics) dataset and fully encapsulates all previous data; this dataset replaces the OPC-Radiomics dataset. The RTSTRUCTs from OPC-Radiomics have been standardized to adhere to the TG263 nomenclature. Age of 90 years or greater is considered PHI and set to 90 years to minimize impact to privacy. Both radiological and clinical metadata were offset by an undisclosed number of days for anonymization and should be noted for downstream analysis. The TG263-standardized RTSTRUCTs include only the GTVp (primary gross tumor volume) contours. Patients without corresponding GTVp contours will not have RTSTRUCTs.
Potential Applications: The availability of imaging, clinical, demographic and treatment data in RADCURE makes it a viable option for a variety of quantitative image analysis research initiatives. This includes the application of machine learning or artificial intelligence methods to expedite routine clinical practices, discover new non-invasive biomarkers, or develop prognostic models.
Inclusion: The dataset used for this study consists of 3346 head and neck cancer CT image volumes collected from 2005-2017 treated with definitive RT at the University Health Network (UHN) in Toronto, Canada
Acknowledgements
We would like to acknowledge the individuals and institutions that have provided data for this collection:
University Health Network (UHN), Toronto, Ontario, Canada - Special thanks to Scott Bratman, PhD, Department of Radiation Oncology and Medical Biophysics, University of Toronto.
Data Access
Data Type | Download all or Query/Filter | License |
---|---|---|
Images and Radiation Therapy Structures, full dataset (DICOM, 324 GB) | (Download requires the NBIA Data Retriever) | |
Images and Radiation Therapy Structures, updated OPC-Radiomics (DICOM, 57 GB) | ||
Clinical data (CSV, 452 kB) | ||
Patient ID Mapping-RADCURE patient id to OPC-RADIOMICS patient id (CSV, 14 kB) |
Click the Versions tab for more info about data releases.
Please contact help@cancerimagingarchive.net with any questions regarding usage.
Detailed Description
Image Statistics | |
---|---|
Modalities | CT, RTSTRUCT |
Number of Patients | 3346 |
Number of Studies | 3346 |
Number of Series | 9693 |
Number of Images | 618837 |
Images Size (GB) | 333 |
Citations & Data Usage Policy
Users must abide by the TCIA Data Usage Policy and Restrictions. Attribution should include references to the following citations:
Data Citation
Welch, M. L., Kim, S., Hope, A., Huang, S. H., Lu, Z., Marsilla, J., Kazmierski, M., Rey-McIntyre, K., Patel, T., O’Sullivan, B., Waldron, J., Kwan, J., Su, J., Soltan Ghoraie, L., Chan, H. B., Yip, K., Giuliani, M., Princess Margaret Head And Neck Site Group, Bratman, S., … Tadic, T. (2023). Computed Tomography Images from Large Head and Neck Cohort (RADCURE) (Version 1) [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/J47W-NM11
Publication Citation
Kazmierski, M., Welch, M., Kim, S., McIntosh, C., Rey-McIntyre, K., Huang, S. H., Patel, T., Tadic, T., Milosevic, M., Liu, F.-F., Ryczkowski, A., Kazmierska, J., Ye, Z., Plana, D., Aerts, H. J., Kann, B. H., Bratman, S. V., Hope, A. J., & Haibe-Kains, B. (2023). Multi-institutional prognostic modelling in head and neck cancer: evaluating impact and generalizability of deep learning and radiomics. In Cancer Research Communications. American Association for Cancer Research (AACR). https://doi.org/10.1158/2767-9764.crc-22-0152
TCIA Citation
Clark, K., Vendt, B., Smith, K., Freymann, J., Kirby, J., Koppel, P., Moore, S., Phillips, S., Maffitt, D., Pringle, M., Tarbox, L., & Prior, F. (2013). The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository. In Journal of Digital Imaging (Vol. 26, Issue 6, pp. 1045–1057). Springer Science and Business Media LLC. https://doi.org/10.1007/s10278-013-9622-7
Other Publications Using This Data
TCIA maintains a list of publications which leverage TCIA data. If you have a manuscript you'd like to add please contact the TCIA Helpdesk.
Version 1 (Current): Updated 2023/06/14
Data Type | Download all or Query/Filter | License |
---|---|---|
Images (DICOM, 324 GB) | (Download requires the NBIA Data Retriever) | |
Images and Radiation Therapy Structures, updated OPC-Radiomics (DICOM, 57 GB) | ||
Clinical data (XLSX) | ||
Patient ID Mapping-RADCURE patient id to OPC-RADIOMICS patient id (CSV) |