Summary
This dataset contains image annotations derived from "The Clinical Proteomic Tumor Analysis Consortium Pancreatic Ductal Adenocarcinoma Collection (CPTAC-PDA)”. This dataset was generated as part of a National Cancer Institute project to augment images from The Cancer Imaging Archive with tumor annotations that will improve their value for cancer researchers and artificial intelligence experts.
Annotation Protocol
For each patient, all scans were reviewed to identify and annotate the clinically relevant time points and sequences/series. In a typical patient all available time points were annotated. The following annotation rules were followed:
- PERCIST criteria was followed for PET imaging. Specifically, the lesions estimated to have the most elevated SUVmax were annotated.
- RECIST 1.1 was otherwise generally followed for MR and CT imaging. A maximum of 5 lesions were annotated per patient scan (timepoint); no more than 2 per organ. The same 5 lesions were annotated at each time point. Lymph nodes were annotated if >1 cm in short axis. Other lesions were annotated if >1 cm. If the primary lesion measures < 1 cm, it was still annotated.
- Three-dimensional segmentations of lesions were created in the axial plane. If no axial plane was available, lesions were annotated in the coronal plane.
- MRIs were annotated using axial T1-weighted post contrast sequences that best demonstrated the tumor.
- CTs were annotated using all axial post contrast series. If not available, the axial non-contrast series were annotated.
- PET/CTs were annotated on the CT and attenuation corrected PET images, unless there was a diagnostic CT from the same time point, in which case the CT portion of the PET/CT was not annotated.
- Lesions were labeled separately.
- Seed points were automatically generated, but reviewed by a radiologist.
- A “negative” annotation was created for any exam without findings.
At each time point:
- Volume calculations were performed for each segmented structure. These calculations are provided in the Annotation Metadata CSV.
- A seed point (kernel) was created for each segmented structure. The seed points for each segmentation are provided in a separate DICOM RTSTRUCT file.
- SNOMED-CT “Anatomic Region Sequence” and “Segmented Property Category Code Sequence” and codes were inserted for all segmented structures.
- Imaging time point codes were inserted to help identify each annotation in the context of the clinical trial assessment protocol.
- “Clinical Trial Time Point ID” was used to encode time point type using one of the following strings as applicable: “pre-dose” or “post-chemotherapy”.
- Content Item in “Acquisition Context Sequence” was added containing "Time Point Type" using Concept Code Sequence (0040,A168) selected from:
- (255235001, SCT, “Pre-dose”)
- (719864002, SCT, "Post-cancer treatment monitoring")
Important supplementary information and sample code
- A spreadsheet containing key details about the annotations is available in the Data Access section below.
- A Jupyter notebook demonstrating how to use the NBIA Data Retriever Command-Line Interface application and the REST API to access these data can be found in the Additional Resources section below.
Data Access
Data Type | Download all or Query/Filter | License |
---|---|---|
CPTAC-PDA Annotations - Segmentations, Seed Points, and Negative Findings Assessments (DICOM, 0.01 GB) | (Download requires NBIA Data Retriever) | |
CPTAC-PDA Annotation Metadata (CSV) |
Click the Versions tab for more info about data releases.
Additional Resources for this Dataset
- The NCI Cancer Research Data Commons (CRDC) provides access to proteomic, genomic and clinical data related to these subjects.
- Imaging Data Commons (IDC) (Imaging Data)
- Proteomic Data Commons (PDC) (Proteomic & Clinical Data)
- Genomic Data Commons (GDC) (Genomic & Clinical Data)
- Jupyter notebook demonstrating how to use the NBIA Data Retriever Command-Line Interface application and REST API to access these data
- Instructions for Visualizing these data in 3D Slicer
Collections Used in this Third Party Analysis
Below is a list of the Collections used in these analyses:
Source Data Type | Download | License |
---|---|---|
Original CPTAC-PDA Images used to create Segmentations and Seed Points (DICOM, 18.0 GB) | (Download requires NBIA Data Retriever) | |
Original CPTAC-PDA Images used to create Negative Assessment reports (DICOM, 1.54 GB) | (Download requires NBIA Data Retriever) |
Detailed Description
Image Statistics | |
---|---|
Modalities | RTSTRUCT |
Number of Patients | 103 |
Number of Studies | 119 |
Number of Series | 534 |
Number of Images | 534 |
Images Size (GB) | 0.01 GB |
Citations & Data Usage Policy
Users must abide by the TCIA Data Usage Policy and Restrictions. Attribution should include references to the following citations:
Data Citation
Rozenfeld, M., & Jordan, P. (2023). Annotations for The Clinical Proteomic Tumor Analysis Consortium Pancreatic Ductal Adenocarcinoma Collection (CPTAC-PDA-Tumor-Annotations) (Version 1) [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/BW9V-BX61
TCIA Citation
Clark, K., Vendt, B., Smith, K., Freymann, J., Kirby, J., Koppel, P., Moore, S., Phillips, S., Maffitt, D., Pringle, M., Tarbox, L., & Prior, F. (2013). The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository. In Journal of Digital Imaging (Vol. 26, Issue 6, pp. 1045–1057). Springer Science and Business Media LLC. https://doi.org/10.1007/s10278-013-9622-7
Other Publications Using This Data
TCIA maintains a list of publications which leverage TCIA data. If you have a manuscript you'd like to add please contact the TCIA Helpdesk.
Version 1 (Current): Updated 2023/07/24
Data Type | Download all or Query/Filter | License |
---|---|---|
CPTAC-PDA Annotations - Segmentations, Seed Points, and Negative Findings Assessments (DICOM, 0.01 GB) | (Download requires NBIA Data Retriever) | |
CPTAC-PDA Annotation Metadata (CSV) | ||
Original CPTAC-PDA Images used to create Segmentations and Seed Points (DICOM, 18.0 GB) | (Download requires NBIA Data Retriever) | |
Original CPTAC-PDA Images used to create Negative Assessment reports (DICOM, 1.54 GB) | (Download requires NBIA Data Retriever) |