This collection contains images from 422 non-small cell lung cancer (NSCLC) patients. For these patients pretreatment CT scans, manual delineation by a radiation oncologist of the 3D volume of the gross tumor volume and clinical outcome data are available. This dataset refers to the Lung1 dataset of the study published in Nature Communications.
In short, this publication applies a radiomic approach to computed tomography data of 1,019 patients with lung or head-and-neck cancer. Radiomics refers to the comprehensive quantification of tumour phenotypes by applying a large number of quantitative image features. In present analysis 440 features quantifying tumour image intensity, shape and texture, were extracted. We found that a large number of radiomic features have prognostic power in independent data sets, many of which were not identified as significant before. Radiogenomics analysis revealed that a prognostic radiomic signature, capturing intra-tumour heterogeneity, was associated with underlying gene-expression patterns. These data suggest that radiomics identifies a general prognostic phenotype existing in both lung and head-and-neck cancer. This may have a clinical impact as imaging is routinely used in clinical practice, providing an unprecedented opportunity to improve decision-support in cancer treatment at low cost.
The dataset described here (Lung1) was used to build a prognostic radiomic signature. The Lung3 dataset used to investigate the association of radiomic imaging features with gene-expression profiles consisting of 89 NSCLC CT scans with outcome data can be found here: NSCLC-Radiomics-Genomics.
For scientific inquiries about this dataset, please contact Dr. Hugo Aerts of the Dana-Farber Cancer Institute / Harvard Medical School (firstname.lastname@example.org).
Choosing the Download option will provide you with a file to launch the TCIA Download Manager to download the entire collection. If you want to browse or filter the data to select only specific scans/studies please use the Search By Collection option.
Download all or Query/Filter
Images (DICOM, 25GB)
Lung1 clinical (CSV)
Click the Versions tab for more info about data releases.
Number of Patients
Number of Studies
Number of Series
Number of Images
Image Size (GB)
Radiation Oncologist Tumor Segmentations
The RTSTRUCT files in this data contain a manual delineation by a radiation oncologist of the 3D volume of the gross tumor volume. For viewing quickly we recommend Dicompyler (http://www.dicompyler.com/) which is an open source, cross-platform DICOM RT viewer. Slicer has a SlicerRT module (http://slicerrt.github.io/index.html) which enables use of this kind of data. The Radiotherapy DICOM toolkit may also be useful for working with this data (https://github.com/dicom/rtkit).
Please be sure to include the following citations in your work if you use this data set:
Aerts, Hugo J. W. L., Rios Velazquez, Emmanuel, Leijenaar, Ralph T. H., Parmar, Chintan, Grossmann, Patrick, Carvalho, Sara, … Lambin, Philippe. (2015). Data From NSCLC-Radiomics. The Cancer Imaging Archive. http://doi.org/10.7937/K9/TCIA.2015.PF0M9REI
Aerts, H. J. W. L., Velazquez, E. R., Leijenaar, R. T. H., Parmar, C., Grossmann, P., Cavalho, S., … Lambin, P. (2014, June 3). Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nature Communications. Nature Publishing Group.http://doi.org/10.1038/ncomms5006(link)
Clark K, Vendt B, Smith K, Freymann J, Kirby J, Koppel P, Moore S, Phillips S, Maffitt D, Pringle M, Tarbox L, Prior F. The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository, Journal of Digital Imaging, Volume 26, Number 6, December, 2013, pp 1045-1057. (paper)