Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.



Medical image biomarkers of cancer promise improvements in patient care through advances in precision medicine. Compared to genomic biomarkers, image biomarkers provide the advantages of being a non-invasive procedure, and characterizing a heterogeneous tumor in its entirety, as opposed to limited tissue available for biopsy. We developed a unique radiogenomic dataset from a Non-Small Cell Lung Cancer (NSCLC) cohort of 211 subjects. The dataset comprises Computed Tomography (CT), Positron Emission Tomography (PET)/CT images, semantic annotations of the tumors as observed on the medical images using a controlled vocabulary, segmentation maps of tumors in the CT scans, and quantitative values obtained from the PET/CT scans. Imaging data are also paired with gene mutation, RNA sequencing data from samples of surgically excised tumor tissue, and clinical data, including survival outcomes. This dataset was created to facilitate the discovery of the underlying relationship between genomic and medical image features, as well as the development and evaluation of prognostic medical image biomarkers.

Further details regarding this data-set may be found in Bakr, et. al, Sci Data. 2018 Oct 16;5:180202. doi: 10.1038/sdata.2018.202, Please direct additional inquiries to Drs. Sandy Napel ( or Sylvia K. Plevritis (

Localtab Group

titleData Access

Data Access

Click the  Download button to save a ".tcia" manifest file to your computer, which you must open with the NBIA Data Retriever . Click the Search button to open our Data Portal, where you can browse the data collection and/or download a subset of its contents.

Data TypeDownload all or Query/Filter
Images and Segmentations (DICOM, 97.6 GB)
AIM Annotations (XML, zip)

Clinical Data (csv)

RNA sequence data (web)

Note: 130 subject subset

Click the Versions tab for more info about data releases.

Third Party Analyses of this Dataset

TCIA encourages the community to publish your analyses of our datasets . Below is a list of such third party analyses published using this Collection:

titleDetailed Description

Detailed Description

Collection Statistics



Number of Participants


Number of Studies


Number of Series


Number of Images


Image Size (GB)97.6

This collection was originally submitted to TCIA as a 26 subject pilot data set. You can learn more about that subset of the collection in the following Analysis Results publication:

titleData Citation

Napel, Sandy, & Plevritis, Sylvia K. (2014). NSCLC Radiogenomics: Initial Stanford Study of 26 Cases. The Cancer Imaging Archive.

titleCitations & Data Usage Policy

Citations & Data Usage Policy 

This collection is freely available to browse, download, and use for commercial, scientific and educational purposes as outlined in the Creative Commons Attribution 3.0 Unported License.  See TCIA's Data Usage Policies and Restrictions for additional details. Questions may be directed to

Please be sure to include the following citations in your work if you use this data set:

Public collection license
titleData Citation

Bakr, Shaimaa; Gevaert, Olivier; Echegaray, Sebastian; Ayers, Kelsey; Zhou, Mu; Shafiq, Majid; Zheng, Hong; Zhang, Weiruo; Leung, Ann; Kadoch, Michael; Shrager, Joseph; Quon, Andrew; Rubin, Daniel; Plevritis, Sylvia; Napel, Sandy.(2017). Data for NSCLC Radiogenomics Collection. The Cancer Imaging Archive.

titlePublication Citation

Primary publication coming soon

titlePublication Citation

Gevaert, O., Xu, J., Hoang, C. D., Leung, A. N., Xu, Y., Quon, A., … Plevritis, S. K. (2012, August). Non–Small Cell Lung Cancer: Identifying Prognostic Imaging Biomarkers by Leveraging Public Gene Expression Microarray Data—Methods and Preliminary Results. Radiology. Radiological Society of North America (RSNA).

titleTCIA Citation

Clark K, Vendt B, Smith K, Freymann J, Kirby J, Koppel P, Moore S, Phillips S, Maffitt D, Pringle M, Tarbox L, Prior F. The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository, Journal of Digital Imaging, Volume 26, Number 6, December, 2013, pp 1045-1057. (paper)

Other Publications Using This Data

If you have a publication you'd like to add, please contact the TCIA Helpdesk.


Version 2 (Current): Updated 2017/02/28

Data Type

Download all or Query/Filter

Images (DICOM, 97.6 GB)

(Requires the NBIA Data Retriever .)

AIM Annotations (XML, zip)

Clinical Data (csv)

Version 1: Updated 2015/12/22

Data Type

This collection was originally submitted to TCIA as a 26 subject pilot data set. You can learn more about that subset of the collection in the following Analysis Results publication:

NSCLC Radiogenomics: Initial Stanford Study of 26 Cases