Summary
In short, this publication applies a radiomic approach to computed tomography data of 1,019 patients with lung or head-and-neck cancer. Radiomics refers to the comprehensive quantification of tumour phenotypes by applying a large number of quantitative image features. In present analysis 440 features quantifying tumour image intensity, shape and texture, were extracted. We found that a large number of radiomic features have prognostic power in independent data sets, many of which were not identified as significant before. Radiogenomics analysis revealed that a prognostic radiomic signature, capturing intra-tumour heterogeneity, was associated with underlying gene-expression patterns. These data suggest that radiomics identifies a general prognostic phenotype existing in both lung and head-and-neck cancer. This may have a clinical impact as imaging is routinely used in clinical practice, providing an unprecedented opportunity to improve decision-support in cancer treatment at low cost.
The dataset described here (Lung3) was used to investigate the association of radiomic imaging features with gene-expression profiles. The Lung2 dataset used for training the radiomic biomarker and consisting of 422 NSCLC CT scans with outcome data can be found here: NSCLC-Radiomics.
For scientific inquiries about this dataset, please contact Dr. Hugo Aerts of the Dana-Farber Cancer Institute / Harvard Medical School (hugo_aerts@dfci.harvard.edu). When using this data in scientific publications or technical reports please cite the following reference:
Data Access
Imaging Data
You can view and download these images on The Cancer Imaging Archive by logging in to TCIA and selecting the NSCLC-Radiomics-Genomics collection.
Collection Statistics | (updated 2/28/2013) |
---|---|
Modalities | CT |
Number of Patients | 89 |
Number of Studies | 89 |
Number of Series | 89 |
Number of Images | 13,482 |
If you are unsure how to download this Collection please view our quick guide on Searching by Collection or refer to our The Cancer Imaging Archive User's Guide for more detailed instructions on using the site.
Shared Lists
- Coming soon - .
Gene-expression Data
Corresponding microarray data acquired for the imaging samples are available at National Center for Biotechnology Information (NCBI) Gene Expression Omnibus (Link to GEO: http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE58661). The patient names used to identify the cases on GEO are identical to those used in the DICOM files on TCIA and in the clinical data spreadsheet.
Clinical Data
Corresponding clinical data can be found here: Lung3.metadata.xls.
Restricted from Commercial Use
The data are not permitted for commercial applications. Please contact the associated data submitters with any questions about utilizing this data.