Skip to end of metadata
Go to start of metadata

Summary

Background: The aggressive and heterogeneous nature of lung cancer has thwarted efforts to reduce mortality from this cancer through the use of screening. The advent of low-dose helical computed tomography (CT) altered the landscape of lung-cancer screening, with studies indicating that low-dose CT detects many tumors at early stages. The National Lung Screening Trial (NLST) was conducted to determine whether screening with low-dose CT could reduce mortality from lung cancer.

Methods: From August 2002 through April 2004, we enrolled 53,454 persons at high risk for lung cancer at 33 U.S. medical centers. Participants were randomly assigned to undergo three annual screenings with either low-dose CT (26,722 participants) or single-view posteroanterior chest radiography (26,732). Data were collected on cases of lung cancer and deaths from lung cancer that occurred through December 31, 2009.  This dataset includes the low-dose CT scans from 26,254 of these subjects, as well as digitized histopathology images from 451 subjects.

Results: The rate of adherence to screening was more than 90%. The rate of positive screening tests was 24.2% with low-dose CT and 6.9% with radiography over all three rounds. A total of 96.4% of the positive screening results in the low-dose CT group and 94.5% in the radiography group were false positive results. The incidence of lung cancer was 645 cases per 100,000 person-years (1060 cancers) in the low-dose CT group, as compared with 572 cases per 100,000 person-years (941 cancers) in the radiography group (rate ratio, 1.13; 95% confidence interval [CI], 1.03 to 1.23). There were 247 deaths from lung cancer per 100,000 person-years in the low-dose CT group and 309 deaths per 100,000 person-years in the radiography group, representing a relative reduction in mortality from lung cancer with low-dose CT screening of 20.0% (95% CI, 6.8 to 26.7; P=0.004). The rate of death from any cause was reduced in the low-dose CT group, as compared with the radiography group, by 6.7% (95% CI, 1.2 to 13.6; P=0.02).

Conclusions: Screening with the use of low-dose CT reduces mortality from lung cancer. (Funded by the National Cancer Institute; National Lung Screening Trial ClinicalTrials.gov number, NCT00047385).

Data Availability: A summary of the National Lung Screening Trial and its available datasets are provided on the Cancer Data Access System (CDAS). CDAS is maintained by Information Management System (IMS), contracted by the National Cancer Institute (NCI) as keepers and statistical analyzers of the NLST trial data.  The full clinical data set from NLST is available through CDAS.  Users of TCIA can download without restriction a publicly distributable subset of that clinical data, along with the CT and Histopathology images collected during the trial.  (These previously were restricted.)

Data Access


Data TypeDownload all or Query/Filter

CT Images (DICOM, 11.3 TB)

   

This link downloads the entire collection, which is quite large.
See the Detailed Description tab for options to download the collection in smaller chunks.


(Download requires the NBIA Data Retriever)

Primary Tumor
Tissue Slide Images (SVS, 775 GB)

Additional images are available: See Detailed Description.

(Download and apply the IBM-Aspera-Connect plugin to your browser) 

Clinical data (ZIP, 25 MB)

(more info)  

Provided in SAS format in one compressed file (.zip); 
includes data and dictionaries.

This is a subset of the full clinical data.  If you need the full clinical data,
please visit the Cancer Data Access System (CDAS) system.

Click the Versions tab for more info about data releases.

Please contact help@cancerimagingarchive.net  with any questions regarding usage.

Detailed Description

Collection Statistics

Radiology

Pathology

Modalities

 CT

Aperio

Number of Patients

 26,254

451

Number of Studies

73,118


Number of Series

203,099


Number of Images

21,082,502

1,225 (optionally + 4 + 23)
Images Size (TB)11.3 TB775 GB

The full CT data (manifest-NLST_allCT.tcia) occupy 11.3 terabytes when downloaded. For convenience, you can "Search" to access all the files, or you can download in chunks ordered only by Participant identifier (2,500 participants each):

  1. TCIA_NLST2500pid_01_20511series_888GB.tcia
  2. TCIA_NLST2500pid_02_20360series_878GB.tcia
  3. TCIA_NLST2500pid_03_20182series_868GB.tcia
  4. TCIA_NLST2500pid_04_20900series_899GB.tcia
  5. TCIA_NLST2500pid_05_20504series_891GB.tcia
  6. TCIA_NLST2500pid_06_18941series_820GB.tcia
  7. TCIA_NLST2500pid_07_20060series_957GB.tcia
  8. TCIA_NLST2500pid_08_17148series_1168GB.tcia
  9. TCIA_NLST2500pid_09_17064series_1172GB.tcia
  10. TCIA_NLST2500pid_10_17214series_1171GB.tcia
  11. TCIA_NLST_1254pid_11_8809series_600GB.tcia

The pathology slide data: 

  1. Primary Tumor slides (faspex) Primary Tumor slides (the standard package), 1225 files.

  2. Additional slides (faspex) Additional histopathology slide images for which the participants have no Baseline Questionnaire data (4 slides) Detail in Table 1.

  3. Second Primary-Tumor slides (faspex) Additional histopathology slide images for participants with Second Primary Tumors as well as those included in the "standard" package (23 slides) Detail in Table 2

NLST Design & Process, Protocol Documents, and Results: https://cdas.cancer.gov/learn/nlst/main-findings/

NLST Data Collected: https://biometry.nci.nih.gov/cdas/learn/nlst/data-collected/

  • Questionnaires, screening, diagnostic procedures, cancer diagnosis, treatment, progression, mortality, contamination.

Biospecimens Collected

Formalin-fixed paraffin embedded (FFPE) tissue specimens are available for a subset of the NLST participants who developed lung cancer during the trial. Donor blocks were obtained from local pathology laboratories and tissue cores (0.6mm) were extracted from them to construct tissue microarrays (TMA). Tissue cores were sampled from primary main invasive tumor histology, secondary tumor histology, carcinoma in situ, adjacent normal lung tissue, metastatic lesion from lymph node(s) and/or distant sites, benign (un-involved) lymph node, proximal and/or distal bronchi.

In total, tissue materials were collected from 438 lung cancer cases. All have cores arrayed across nine TMAs, one of which only contains tissue collected after neoadjuvant treatment. 434 of these also have loose cores available for nucleic acid extraction. On average, each TMA contains 504 cores from 48 subjects.

Applications for access to these specimens can be submitted under the PLCO Etiologic and Early Marker Studies Program (EEMS). The application review process opens twice a year, once in the winter and once in the summer. For more information about EEMS and to initiate an application visit the PLCO EEMS Application page. When filling out the application, specify “NLST Tissue” under the case definition.


Citations & Data Usage Policy 

Users of this data must abide by the TCIA Data Usage Policy and the Creative Commons Attribution 4.0 International License under which it has been published. Attribution should include references to the following citations:

Dataset Citation

 National Lung Screening Trial Research Team. (2013). Data from the National Lung Screening Trial (NLST) [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/TCIA.HMQ8-J677

Publication Citation

National Lung Screening Trial Research Team*; Aberle DR, Adams AM, Berg CD, Black WC, Clapp JD, Fagerstrom RM, Gareen IF, Gatsonis C, Marcus PM, Sicks JD (2011). Reduced Lung-Cancer Mortality with Low-Dose Computed Tomographic Screening. New England Journal of Medicine, 365(5), 395–409. https://doi.org/10.1056/nejmoa1102873


*note:  List of National Lung Screening Trial members (pages 1-31 of this supplemental PDF to this article)

TCIA Citation

Clark K, Vendt B, Smith K, Freymann J, Kirby J, Koppel P, Moore S, Phillips S, Maffitt D, Pringle M, Tarbox L, Prior F. The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository, Journal of Digital Imaging, Volume 26, Number 6, December, 2013, pp 1045-1057. DOI: https://doi.org/10.1007/s10278-013-9622-7

Other Publications Using This Data

TCIA maintains a list of publications which leverage TCIA data. If you have a manuscript you'd like to add please contact the TCIA Helpdesk.

IMS/CDAS maintains a separate list of publications related to NLST data: https://cdas.cancer.gov/publications/?study=nlst 

Version 3 (Current) : Updated 2021/09/24

Data TypeDownload all or Query/Filter

CT Images (DICOM, 11.3 TB)

   

This link downloads the entire collection, which is quite large, as legacy single frame images.
See the Detailed Description tab for options to download the collection in smaller chunks.


(Download requires the NBIA Data Retriever)

Primary Tumor
Tissue Slide Images (SVS, 775 GB)

Additional images are available: See Detailed Description.

(Download and apply the IBM-Aspera-Connect plugin to your browser) 

Clinical data (ZIP, 25 MB)

(more info)  

Provided in SAS format in one compressed file (.zip); 
includes data and dictionaries.

This is a subset of the full clinical data.  If you need the full clinical data,
please visit the Cancer Data Access System (CDAS) system.



Data embargo of limited access is lifted September 2021, with the addition of downloadable pathology slide data and clinical data spreadsheet & dictionaries.

Version 2: Updated 2015/12/14

Data TypeDownload all or Query/Filter
Images (DICOM, 11.3TB)

 

Change: restoration of images that had become corrupted/missing during a storage transfer.

Version 1 (Current): Updated 2013/03/01

Data TypeDownload all or Query/Filter
Images (DICOM, 11.3TB)

 

  • No labels