Armato SG III, McLennan G, Bidaut L, McNitt-Gray MF, Meyer CR, Reeves AP, Zhao B, Aberle DR, Henschke CI, Hoffman EA, Kazerooni EA, MacMahon H, van Beek EJR, Yankelevitz D, et al.: The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): A completed reference database of lung nodules on CT scans. Medical Physics, 38: 915--931, 2011.
Important note: There was a "pilot release" of 399 cases of the LIDC CT data via the NCI CBIIT installation of NBIA. This LIDC-IDRI collection is the complete data set of all 1,010 patients which includes all 399 pilot CT cases plus the additional 611 patient CTs and all 290 corresponding chest x-rays.
If you use this data in your research please be sure that the LIDC/IDRI Database is mentioned in any publications or grant applications along with references to appropriate LIDC publications. Additional information about using this data as well as some collection meta data can be obtained in the Supporting Documentation below.
CT (computed tomography)
Number of Patients
Number of Studies
Number of Series
Number of Images
You can view and download these images on the Cancer Imaging Archive by selecting the LIDC-IDRI collection. If you are unsure how to download this Collection view our quick guide on Searching by Collection or you can refer to our The Cancer Imaging Archive User's Guide for more detailed instructions on using the site.
Annotation and Markup Issues/Comments
- We recently (early 2009) discovered that, for For a subset of approximately 100 cases from among the initial 399 cases released, inconsistent rating systems were used among the 5 sites with regard to the spiculation and lobulation characteristics of lesions identified as nodules > 3 mm. We are in the process of correcting this situation. The XML nodule characteristics data as it exists for some cases will be impacted by this error. We apologize for any inconvenience.
- Also note that the XML files do not store radiologist annotations in a manner that allows for a comparison of individual radiologist reads across cases (i.e., the first reader recorded in the XML file of one CT scan will not necessarily be the same radiologist as the first reader recorded in the XML file of another CT scan).
- March 2010: Contrary to previous documentation, the correct ordering for the subjective nodule lobulation and nodule spiculation rating scales stored in the XML files is 1=none to 5=marked. The issue of consistency noted in issue 1 still remains to be corrected.
You can download this Diagnosis Data at: LIDC Diagnosis Data-01-08-10.xls
Note: This data has not yet been updated to match the new patient ID structure for the LIDC-IDRI data set (it currently still uses the pilot data patient ID schema).
AIM Annotation Conversion Project
As part of an effort to move towards standard formats for annotation and markup a project has been undertaken to convert this data from the LIDC Pilot project into Annotated Image Markup format (AIM). AIM is a standard which was developed out of the caBIG program. More information about this effort can be found here on the NCI CBIIT wiki: LIDC Conversion to AIM.
In the near future we will be providing the entire LIDC-IDRI data set in AIM format available for download here on the TCIA wiki.
MAX ("multi-purpose application for XML") performs nodule matching and pmap generation based on the XML files provided with the LIDC/IDRI Database. It also performs certain QA and QC tasks and other XML-related tasks.