
The Lung Image Database Consortium image collection (LIDC-IDRI) consists of diagnostic and lung cancer screening thoracic computed tomography (CT) scans with marked-up annotated lesions. It is a web-accessible international resource for development, training, and evaluation of computer-assisted diagnostic (CAD) methods for lung cancer detection and diagnosis.

The following paper published in Medical Physics is effectively the user's manual for the database and should be cited as follows in all manuscripts that make use of the database:

Armato SG III, McLennan G, Bidaut L, McNitt-Gray MF, Meyer CR, Reeves AP, Zhao B, Aberle DR, Henschke CI, Hoffman EA, Kazerooni EA, MacMahon H, van Beek EJR, Yankelevitz D, et al.:  The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): A completed reference database of lung nodules on CT scans. Medical Physics, 38: 915--931, 2011.  

In addition to following The Cancer Imaging Archive's (TCIA’s) Citation Guidelines, if you use this data in your research, please be sure to include the following attribution in any publications or grant applications along with references to appropriate LIDC publications. A listing of these publications can be found on TCIA's Related Publications page.

LIDC-IDRI Attribution:

The authors acknowledge the National Cancer Institute and the Foundation for the National Institutes of Health, and their critical role in the creation of the free publicly available LIDC/IDRI Database used in this study.

Additional information about using this data, as well as some collection metadata, can be obtained on the Lung Image Database Consortium research page.

Important note: There was a "pilot release" of 399 cases of the LIDC CT data via the NCI CBIIT installation of NBIA. The LIDC-IDRI collection contained on TCIA is the complete data set of all 1,010 patients which includes all 399 pilot CT cases plus the additional 611 patient CTs and all 290 corresponding chest x-rays. A table which allows mapping between the old NBIA IDs and new TCIA IDs can be downloaded for those who have obtained and analyzed the older data.

Data Access

Imaging Data

You can view and download these images on TCIA by clicking and selecting the LIDC-IDRI collection.

Collection Statistics



CT (computed tomography)
DX (digital radiography) 
CR (computed radiography) 

Number of Patients


Number of Studies


Number of Series

1,018 CT
290 CR/DX

Number of Images


Image Size (GB)124

If you are unsure how to download this collection please view Searching by Collection or refer to our TCIA's User's Guide for more detailed instructions on using the site.

Note: On 2012-03-21 the XML associated with patient LIDC-IDRI-0101 was updated with a corrected version of the file. The old version is still available via the wiki if needed for audit purposes.

Note: Prior to 7/27/2015, many of the series in the LIDC-IDRI collection,had inconsistent values in the DICOM Frame of Reference UID, DICOM tag (0020,0052).  Each image had a unique value for Frame of Reference (which should be consistent across a series).  This has been corrected.  In addition, the following tags, which were present (but should not have been), were removed: (0020,0200) Synchronization Frame of Reference, (3006,0024) Referenced Frame of Reference, and (3006,00c2) Related Frame of Reference.



This collection contains a great deal of supporting documentation that was generated by members of the LIDC. It can be found on the LIDC research page. A subset of the metadata (XML annotation/markup info) is also contained alongside the images in TCIA and will be automatically downloaded with the images unless specified to be excluded.