Summary
This dataset contains image annotations derived from the NCI Clinical Trial "Vincristine, Dactinomycin, and Doxorubicin With or Without Radiation Therapy or Observation Only in Treating Younger Patients Who Are Undergoing Surgery for Newly Diagnosed Stage I, Stage II, or Stage III Wilms' Tumor (AREN0532)”. The key objective of this project is to generate a large and highly curated imaging dataset of pediatric Wilms tumor patients with annotations suitable for cancer researchers and AI developers.
Annotation Protocol
For each patient, every DICOM Study and DICOM Series was reviewed to identify and annotate the clinically relevant time points and sequences. In a typical patient the following annotation rules were followed:
- The primary renal tumor(s) were annotated on post-contrast axial series. Normal renal parenchyma were excluded.
- A maximum of 5 lesions were annotated per patient scan (timepoint); no more than 2 per organ. The same 5 lesions were annotated at each time point. RECIST 1.1 principles were generally be followed for lesion annotation, however, if <5 lesions measuring >1 cm were present, then smaller lesions were annotated, again up to 2 lesions per organ or 5 lesions per patient scan. Bone lesions were included if other lesions were not present.
- Lesions were labeled separately.
- Seed points were automatically generated but were reviewed by a radiologist.
- To ensure a high standard of accuracy and data quality, each annotation was reviewed by a secondary reader.
At each time point:
- A seed point (kernel) was created for each segmented structure. The seed points for each segmentation are provided in a separate DICOM RTSS file.
- SNOMED-CT “Anatomic Region Sequence” and “Segmented Property Category Code Sequence” and codes were inserted for all segmented structures.
- Imaging time point codes were inserted to help identify each annotation in the context of the clinical trial assessment protocol.
- “Clinical Trial Time Point ID” was used to encode time point type using one of the following strings as applicable: “pre-dose” or “post-chemotherapy”
- Content Item in “Acquisition Context Sequence” was added containing "Time Point Type" using Concept Code Sequence (0040,A168) selected from:
- (255235001, SCT, “Pre-dose”)
- (262502001, SCT, "Post-chemotherapy")
- (262502001, SCT, "Post-chemotherapy")
- (262502001, SCT, "Post-chemotherapy")
We believe that these are the most clinically useful annotations for radiologists as well as for future researchers. The selected sequences inform whether there is residual or recurrent tumor and assess response to therapy.
Important supplementary information and sample code
- A spreadsheet containing key details about the annotations is available in the Data Access section below.
- A Jupyter notebook demonstrating how to use the NBIA Data Retriever Command-Line Interface application and the REST API (with authentication) to access these data can be found in the Additional Resources section below.
Data Access
This is a limited access data set. To request access please register an account on the NCTN Data Archive. After logging in, use the "Request Data" link in the left side menu. Follow the on screen instructions, and enter NCT00352534 when asked which trial you want to request. In step 2 of the Create Request form, be sure to select “Imaging Data Requested”. Please contact NCINCTNDataArchive@mail.nih.gov for any questions about access requests.
Data Type | Download all or Query/Filter | License |
---|---|---|
AREN0532 Annotations -- Segmentations, Seed Points, and Negative Findings Assessments (DICOM, 0.2 GB) | (Download requires NBIA Data Retriever) | |
AREN0532 Annotation Metadata (CSV) | ||
Original AREN0532 Images used to create Segmentations and Seed Points (DICOM, 56.4 GB) | (Download requires NBIA Data Retriever) | |
Original AREN0532 Images used to create Negative Assessment reports (DICOM, 23.7 GB) | (Download requires NBIA Data Retriever) |
Click the Versions tab for more info about data releases.
Additional Resources
- NCTN/NCORP Data Archive provides the Clinical Data files related to these subjects, and is also where you go to request access to the entire dataset
- Jupyter notebook demonstrating how to use the NBIA Data Retriever Command-Line Interface application and REST API (with authentication) to access these data
- Instructions for Visualizing these data in 3D Slicer
Collections Used in this Third Party Analyses
TCIA encourages the community to publish your analyses of our datasets. Below is a list of such third party analyses published using this Collection:
Detailed Description
Image Statistics | |
---|---|
Modalities | RTSTRUCT |
Number of Patients | 543 |
Number of Studies | 861 |
Number of Series | 2531 |
Number of Images | 2531 |
Images Size (GB) | 0.2 |
Citations & Data Usage Policy
Users must abide by the TCIA Data Usage Policy and Restrictions. Attribution should include references to the following citations:
Data Citation
Rozenfeld, M., & Jordan, P. (2023). Annotations for Vincristine, Dactinomycin, and Doxorubicin With or Without Radiation Therapy or Observation Only in Treating Younger Patients Who Are Undergoing Surgery for Newly Diagnosed Stage I, II, or III Wilms' Tumor (AREN0532-Tumor-Annotations) [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/KJA4-1Z76
TCIA Citation
Clark, K., Vendt, B., Smith, K., Freymann, J., Kirby, J., Koppel, P., Moore, S., Phillips, S., Maffitt, D., Pringle, M., Tarbox, L., & Prior, F. (2013). The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository. In Journal of Digital Imaging (Vol. 26, Issue 6, pp. 1045–1057). Springer Science and Business Media LLC. https://doi.org/10.1007/s10278-013-9622-7
Other Publications Using This Data
TCIA maintains a list of publications which leverage TCIA data. If you have a manuscript you'd like to add please contact the TCIA Helpdesk.
Version 1 (Current): Updated 2023/08/dd
Data Type | Download all or Query/Filter | License |
---|---|---|
AREN0532 Annotations -- Segmentations, Seed Points, and Negative Findings Assessments (DICOM, 0.2 GB) | (Download requires the NBIA Data Retriever) | |
AREN0532 Annotation Metadata (CSV) | ||
Original AREN0532 Images used to create Segmentations and Seed Points (DICOM, 56.4 GB) | (Download requires NBIA Data Retriever) | |
Original AREN0532 Images used to create Negative Assessment reports (DICOM, 23.7 GB) | (Download requires NBIA Data Retriever) |