The Lung Image Database Consortium image collection (LIDC-IDRI) consists of diagnostic and lung cancer screening thoracic CT scans with marked-up annotated lesions. It is a web-accessible international resource for development, training, and evaluation of computer-assisted diagnostic (CAD) methods for lung cancer detection and diagnosis. |
Armato SG III, McLennan G, Bidaut L, McNitt-Gray MF, Meyer CR, Reeves AP, Zhao B, Aberle DR, Henschke CI, Hoffman EA, Kazerooni EA, MacMahon H, van Beek EJR, Yankelevitz D, et al.: The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): A completed reference database of lung nodules on CT scans. Medical Physics, 38: 915--931, 2011.
Important note: There was a "pilot release" of 399 cases of the LIDC CT data via the NCI CBIIT installation of NBIA. The LIDC-IDRI collection contained on The Cancer Imaging Archive (TCIA) is the complete data set of all 1,010 patients which includes all 399 pilot CT cases plus the additional 611 patient CTs and all 290 corresponding chest x-rays.
In addition to following TCIA's Citation Guidelines, if you use this data in your research please be sure to include the following attribution in any publications or grant applications along with references to appropriate LIDC publications. A listing of these publications can be found on TCIA's Related Publications page.
LIDC-IDRI Attribution:
The authors acknowledge the support of the National Cancer Institute and the Foundation for the National Institutes of Health in the creation of the free publicly available LIDC/IDRI Database used in this study.
Additional information about using this data as well as some collection meta data can be obtained in the Supporting Documentation below.
Collection Statistics |
|
---|---|
Modalities |
CT (computed tomography) |
Number of Patients |
1,010 |
Number of Studies |
1,308 |
Number of Series |
1,018 CT |
Number of Images |
244,527 |
You can view and download these images on the Cancer Imaging Archive by selecting the LIDC-IDRI collection. If you are unsure how to download this Collection view our quick guide on Searching by Collection or you can refer to our The Cancer Imaging Archive User's Guide for more detailed instructions on using the site.
More information about the Cancer Imaging Program's Program Announcement for LIDC can be found at: http://imaging.cancer.gov/programsandresources/InformationSystems/LIDC
These links help describe how to use the .XML annotation files which are packaged along with the images in the Cancer Imaging Archive. The option to include annotation files in the download is enabled by default, so the XML described here will be included when downloading the LIDC-IDRI images unless you specifically uncheck this option.
Annotation and Markup Issues/Comments
This link provides a list of available cases and the associated size of each identified nodule.
For a limited set of cases, LIDC sites were able to identify diagnostic data associated with the case. Data was collected for as many cases as possible and is associated at two levels:
At each level, data was provided as to whether the nodule was:
For each lesion, there is also information provided as to how the diagnosis was established including options such as:
You can download this Diagnosis Data at: LIDC Diagnosis Data-01-08-10.xls
Note: This data has not yet been updated to match the new patient ID structure for the LIDC-IDRI data set (it currently still uses the pilot data patient ID schema).
As part of an effort to move towards standard formats for annotation and markup a project was undertaken to convert XML data from the LIDC Pilot project into Annotated Image Markup format (AIM). AIM is a standard which was developed out of the caBIG program. More information about this effort can be found here on the NCI CBIIT wiki: LIDC Conversion to AIM.
We hope to be able to provide the entire LIDC-IDRI set of markup in AIM format at some point in the future, along with a release of the ClearCanvas open source workstation which can view these markups. However at this time the project has been placed on hold. We will update this page if/when the status of this project changes.
MAX ("multi-purpose application for XML") performs nodule matching and pmap generation based on the XML files provided with the LIDC/IDRI Database. It also performs certain QA and QC tasks and other XML-related tasks.
MAX is written in Perl and was developed under RedHat Linux. It has been run under Windows.
Downloading MAX and its associated files implies acceptance of the following notice (also available here and in the distro as a text file):
DISCLAIMER: MAX is not guaranteed to process all input correctly. Possible errors include (but are not limited to) the inability to process correctly some types of nodule ambiguity (where nodule ambiguity refers to overlap between nodule markings having complicated shapes or to overlap between a nodule marking and a non-nodule mark).
Download the distro (max-V107.tgz); view/download ReadMe.txt (a text file that is also included in the distro).