Summary

Redirect

delay	5
location	https://www.cancerimagingarchive.net/collection/lung-pet-ct-dx/

Excerpt

Image Added

This dataset consists of CT and PET-CT DICOM images of lung cancer subjects with XML Annotation files that indicate tumor location with bounding boxes. The images were retrospectively acquired from patients with suspicion of lung cancer, and who underwent standard-of-care lung biopsy and PET/CT. Subjects were grouped according to a tissue histopathological diagnosis. Patients with Names/IDs containing the letter 'A' were diagnosed with Adenocarcinoma, 'B' with Small Cell Carcinoma, 'E' with Large Cell Carcinoma, and 'G' with Squamous Cell Carcinoma.

The images were analyzed on the mediastinum

Excerpt

Our dataset consists of three parts: raw DICOM data, JPG images transformed from raw DICOM data, and non-image data including sex, age, history, some patients have gene expression, and pathologist reports. The images were analyzed both on the mediastinum (window width, 350 HU; level, 40 HU) and lung (window width, 1,400 HU; level, –700 HU) settings. The reconstructions were made in 2mm-slice-thick and lung settings. The CT slice interval slice interval varies from 0.625 mm to 5 mm, scanning . Scanning mode include includes plain scan, contrast scan, 3D reconstruction, etc. All the cases were confirmed by pathological diagnosis. We labeled the locations of tumor in JPG images. And the image annotations are saved in XML files in Annotation Files with Hashed Filenames format. Users can parse the annotations using the PASCAL Development Toolkit.and 3D reconstruction.

Before the examination, the patient underwent fasting for at least 6 hours, and the blood glucose of each patient was less than 11 mmol/L. Whole-body emission scans were acquired 60 minutes after the intravenous injection of 18F-FDG (4.44MBq/kg, 0.12mCi/kg), with patients in the supine position in the PET scanner. FDG doses and uptake times were 168.72-468.79MBq (295.8±64.8MBq) and 27-171min (70.4±24.9 minutes), respectively. 18F-FDG with a radiochemical purity of 95% was provided. Patients were allowed to breathe normally during PET and CT acquisitions. Attenuation correction of PET images was performed using CT data with the hybrid segmentation method. Attenuation corrections were performed using a CT protocol (180mAs,120kV,1.0pitch). Each study comprised one CT volume, one PET volume and fused PET and CT images: the CT resolution was 512 × 512 pixels at 1mm × 1mm, the PET resolution was 200 × 200 pixels at 4.07mm × 4.07mm, with a slice thickness and an interslice distance of 1mm. Both volumes were reconstructed with the same number of slices. Three-dimensional (3D) emission and transmission scanning were acquired from the base of the skull to mid femur. The PET images were reconstructed via the TrueX TOF method with a slice thickness of 1mm.

The location of each tumor was annotated by five academic thoracic radiologists with expertise in lung cancer to make this dataset a useful tool and resource for developing algorithms for medical diagnosis. Two of the radiologists had more than 15 years of experience and the others had more than 5 years of experience. After one of the radiologists labeled each subject the other four radiologists performed a verification, resulting in all five radiologists reviewing each annotation file in the dataset. Annotations were captured using Labellmg. The image annotations are saved as XML files in PASCAL VOC format, which can be parsed using the PASCAL Development Toolkit: https://pypi.org/project/pascal-voc-tools/. Python code to visualize the annotation boxes on top of the DICOM images can be downloaded here.

Two deep learning researchers used the images and the corresponding annotation files to train several well-known detection models which resulted in a maximum a posteriori probability (MAP) of around 0.87 on the validation set. We provide JPG images and XML annotation files in PASCAL VOC format which is widely used in deep learning and machine learning researches. The annotation files are provided by five doctors and two deep learning researchers. Besides that, all the cases were confirmed by pathology. Thus, we can guarantee our dataset precise and ease of use. Our dataset can be regarded as a useful tools and data resource to develop medical diagnosis algorithm based on deep learning. On the other hand, our data set can be used as an effective tool for promoting medical diagnosis.

Acknowledgements

We would like to acknowledge the individuals and institutions that have provided data for this collection:

Drs. Huiping Han, Funing Yang and Rui Wang for their help collecting data
The Computer Center and Cancer Institute at the Second Affiliated Hospital of Harbin Medical University in Harbin, Heilongjiang Province, China for their help collecting the image data
Beijing Municipal Administration of Hospital Clinical Medicine Development of Special Funding (ZYLX201511)
Hospital/Institution Name city, state, country - Special thanks to First Last Names, degree PhD, MD, etc from the Department of xxxxxx, Additional Names from same location.
Continue with any names from additional submitting sites if collection consists of more that one.

Localtab Group

Localtab

active	true
title	Data Access

Data Access

Data Type	Download all or Query/Filter	License
Images (DICOM,

XX

127.

X GB)

Image Removed

2 GB)

Tcia button generator

url	https://wiki.cancerimagingarchive.net/download/attachments/70224216/Lung-PET-CT-Dx-NBIA-Manifest-122220.tcia?version=1&modificationDate=1608669250614&api=v2

Tcia button generator

label	Search
url	https://nbia.cancerimagingarchive.net/nbia-search/?MinNumberOfStudiesCriteria=1&CollectionCriteria=Lung-PET-CT-Dx

(Download requires the NBIA Data Retriever)

XML

Tcia cc by 4

Annotation Files (

PASCAL VOC) (14.62 MB)

Image Removed

XML, 17.26 MB)

Tcia button generator

url	https://wiki.cancerimagingarchive.net/download/attachments/70224216/Lung-PET-CT-Dx-Annotations-XML-Files-rev12222020.zip?version=1&modificationDate=1609346850424&api=v2

Tcia cc by 4

Clinical Data (XLSX, 36 KB)

Tcia button generator

url	https://wiki.cancerimagingarchive.net/download/attachments/70224216/statistics-clinical-20201221.xlsx?version=1&modificationDate=1608654729514&api=v2

Tcia cc by 4

Click the Versions tab for more info about data releases.Please contact help@cancerimagingarchive.net with any questions regarding usage

Additional Resources for this Dataset

The NCI Cancer Research Data Commons (CRDC) provides access to additional data and a cloud-based data science infrastructure that connects data sets with analytics tools to allow users to share, integrate, analyze, and visualize cancer research data.

Imaging Data Commons (IDC) (Imaging Data)

In addition, the following external resources have been made available by the data submitters. These are not hosted or supported by TCIA, but may be useful to researchers utilizing this collection.

Annotations were captured using Labellmg
The image annotations are saved as XML files in PASCAL VOC format, which can be parsed using the PASCAL Development Toolkit: https://pypi.org/project/pascal-voc-tools/
Python code to visualize the annotation boxes on top of the DICOM images can be downloaded here.

Localtab

title	Detailed Description

Detailed Description

Image Statistics	Radiology Image Statistics
Modalities	CT,PT
Number of PatientsParticipants	355
Number of Studies	436
Number of Series	1,295
Number of Images	251,135
Images Size (GB)

Add any additional information as needed below. Likely would be something from site.

127.2

Other Publications Using This Data

TCIA maintains a list of publications which leverage TCIA data. If you have a manuscript you'd like to add please contact the TCIA Helpdesk.

Localtab

title	Citations & Data Usage Policy

Citations & Data Usage Policy

Add any special restrictions in here.

These collections are freely available to browse, download, and use for commercial, scientific and educational purposes as outlined in the Creative Commons Attribution 4.0 International License. Questions may be directed to help@cancerimagingarchive.net. Please be sure to acknowledge both this data set and TCIA in publications by including the following citations in your work:

Tcia limited license policy
Info
title Data Citation
Li, P., Wang, S., Li, T., Lu, J., HuangFu, Y., & Wang, D. (2020). A Large-Scale CT and PET/CT Dataset for Lung Cancer Diagnosis (Lung-PET-CT-Dx) [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/TCIA.2020.NNC2-0461
Info
title Data Citation
DOI goes here. Create using Datacite with information from Collection Approval form

Info
title TCIA Citation
Clark K, Vendt B, Smith K, Freymann J, Kirby J, Koppel P, Moore S, Phillips S, Maffitt D, Pringle M, Tarbox L, Prior F. (2013) The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository, Journal of Digital Imaging, Volume 26, Number 6, December, 2013, pp (6):1045-1057. DOI: 10.1007/s10278-013-9622-7
In addition to the dataset citations above, please be sure to cite the following if you utilize these data in your research:

Info

title	Acknowledgement

Added new subjects

Localtab

title	Versions

Version

X

5 (Current): Updated

yyyy

2020/

mm

12/

dd

22

Data Type	Download all or Query/Filter
Images (DICOM,

xx.x GB)

Image RemovedImage Removed

(Requires NBIA Data Retriever.)

XML Annotation Files (PASCAL VOC) (14.62 MB)

Image Removed

Clinical Data (CSV)

Link

Other (format)

Image Removed

127.2GB)

Tcia button generator

url	https://wiki.cancerimagingarchive.net/download/attachments/70224216/Lung-PET-CT-Dx-NBIA-Manifest-122220.tcia?version=1&modificationDate=1608669250614&api=v2

Tcia button generator

label	Search
url	https://nbia.cancerimagingarchive.net/nbia-search/?MinNumberOfStudiesCriteria=1&CollectionCriteria=Lung-PET-CT-Dx

(Download requires the NBIA Data Retriever)

Annotation Files (XML, 17.26 MB)

Tcia button generator

url	https://wiki.cancerimagingarchive.net/download/attachments/70224216/Lung-PET-CT-Dx-Annotations-XML-Files-rev12222020.zip?version=1&modificationDate=1609346850424&api=v2

Clinical Data (XLSX, 36 KB)

Tcia button generator

url	https://wiki.cancerimagingarchive.net/download/attachments/70224216/statistics-clinical-20201221.xlsx?version=1&modificationDate=1608654729514&api=v2

Clinical data has been added for all 355 subjects.

Eight subjects were removed from the dataset because the submitting site determined that they required further medical examinations to make an accurate diagnosis.

Version 4: Updated 2020/10/16

Data Type

Download all or Query/Filter

Images (DICOM,132 GB)

Tcia button generator

url	https://wiki.cancerimagingarchive.net/download/attachments/70224216/Lung-PET-CT-Dx-NBIA-manifest-07242020.tcia?api=v2

(Download requires the NBIA Data Retriever)

Annotation Files (XML, 17.26 MB)

Tcia button generator

url	https://wiki.cancerimagingarchive.net/download/attachments/70224216/Lung-PET-CT-Dx-Annotations-XML-Files-rev10152020.zip?version=1&modificationDate=1603823290007&api=v2

Annotation files were corrected and updated at the request of the submitting site.

Version 3: Updated 2020/07/24

Data Type

Download all or Query/Filter

Images (DICOM,132 GB)

Tcia button generator

url	https://wiki.cancerimagingarchive.net/download/attachments/70224216/Lung-PET-CT-Dx-NBIA-manifest-07242020.tcia?api=v2

Tcia button generator

label	Search
url	https://nbia.cancerimagingarchive.net/nbia-search/?MinNumberOfStudiesCriteria=1&CollectionCriteria=Lung-PET-CT-Dx

(Download requires the NBIA Data Retriever)

Annotation Files (XML, 14.62 MB)

Tcia button generator

url	https://wiki.cancerimagingarchive.net/download/attachments/70224216/Lung-PET-CT-Dx-Annotations-XML-Files-rev07142020.zip?version=1&modificationDate=1594757790879&api=v2

PET scans have been added for 140 subjects.

Version 2: Updated 2020/07/14

Data Type

Download all or Query/Filter

Images (DICOM, 128 GB)

Tcia button generator

url	https://wiki.cancerimagingarchive.net/download/attachments/70224216/Lung-PET-CT-Dx-NBIA-manifest-07152020.tcia?version=1&modificationDate=1594821252813&api=v2

Tcia button generator

label	Search
url	https://nbia.cancerimagingarchive.net/nbia-search/?MinNumberOfStudiesCriteria=1&CollectionCriteria=Lung-PET-CT-Dx

(Requires NBIA Data Retriever)

Annotation Files (XML, 14.62 MB)

Tcia button generator

url	https://wiki.cancerimagingarchive.net/download/attachments/70224216/Lung-PET-CT-Dx-Annotations-XML-Files-rev07142020.zip?version=1&modificationDate=1594757790879&api=v2

After publication of this dataset, the submitter notified us that the data for Subject Lung_Dx-A0266 really belonged to Subject Lung_Dx-A0251 and that Subject Lung_Dx-A0266 should not exist in the collection. Version 2 corrects this issue.

Version 1: Updated 2020/06/17

Data Type	Download all or Query/Filter
Images (DICOM, 128 GB)	Unavailable, see version 2 note.
Annotation Files (XML, 14.62 MB)	Unavailable, see version 2 note.

Space shortcuts

Child pages

Versions Compared

Old Version 4

New Version Current

Key

Summary

Acknowledgements

Data Access

Additional Resources for this Dataset

Detailed Description

Other Publications Using This Data

Citations & Data Usage Policy

Version

5 (Current): Updated

2020/

12/

22

Version 4: Updated 2020/10/16

Version 3: Updated 2020/07/24

Version 2: Updated 2020/07/14

Version 1: Updated 2020/06/17

Space shortcuts

Child pages

Page History

Versions Compared

Old Version 4

New Version Current

Key

Summary

Acknowledgements

Data Access

Additional Resources for this Dataset

Detailed Description

Citations & Data Usage Policy

Other Publications Using This Data

Version

5 (Current): Updated

2020/

12/

22

Version 4: Updated 2020/10/16

Version 3: Updated 2020/07/24

Version 2: Updated 2020/07/14

Version 1: Updated 2020/06/17