Child pages
  • National Clinical Trials Network (NCTN) Date Handling

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

De-identification of DICOM dates

The resulting DICOM dates are meaningless yet preserve the relative temporal distance between studies for a patient

De-identification of dates for this dataset uses the DICOM Part 3.15 Annex E standard “Retain Longitudinal With Modified Dates Option” which allows dates to be retained as long as they are modified from the original date. Date and Date-Time fields in TCIA DICOM image headers are de-identified by normalizing to a base date of January 1, 1960 and then shifted by the number of days between the original Study Date and an "anchor date".  The anchor date for NCTN IROC data is the Date of Registration; this is the date the participant was registered for the study.   The choice of '1960' was arbitrary, but it allows one to ensure that the dates in de-identified DICOM files have been properly de-identified as anything not around that year would be suspect.TCIA Study Date = 01/01/1960 + (Original Study Date – Date of registration)  TCIA implements this using a technique which de-identifies the dates while preserving the longitudinal relationship between them.  Original dates will be first normalized to 01 January, 1960 and then offset relative to the date of registration for each patient.  This normalized date system was chosen in order to make it obvious that the dates are not real, and to make it easy to quickly determine how much time has passed between the date of registration and the patients' related imaging studies

For example, if the original Study Date real date of a patient's registration was 03/2927/2018 and the original imaging Study Date of registration was 03/2729/2018 then the "Days from registration" would be +2 and the anonymized TCIA Study Date would become 01/03/1960.

This technique de-identifies the dates while preserving the longitudinal relationship between dates.  Therefore, a researcher won’t know the precise date the scan occurred, but if a follow up scan was performed 120 days later, that same 120 day difference between scans of a subject will exist in the TCIA images.  Dates that occur in DICOM tags other than Date or Date-Time fields are removed. An example of this would be a date entered into the Series Description field.  If the date is associated with a library for Code Meaning then that date is preserved as the date would be required to look up the meaning in the correct version of the library.  To show that the dates have been modified, the term “MODIFIED” is written into DICOM tag (0028,0303) “LongitudinalTemporalInformationModified”.

Original dates will be first normalized to 01 January, 1960 and then offset relative to the date of registration.

Insertion of computed "REGISTRATION"/Days offset from registration" value

The inserted "REGISTRATION" or "Days offset from registration" value can be compared with similar values in the NCTN clinical data to understand the clinical context of the imaging study

The number of days the study occurred relative to the date of registration is calculated (using the registration date as identified for each patient) In addition to modifying the actual date fields in the DICOM header, the "days from registration" values are calculated and stored in the DICOM tag (0012,0052) Longitudinal Temporal Offset from Event with the associated tag (0012,0053) Longitudinal Temporal Event Type set to “REGISTRATION”. The value in the (0012,0052) DICOM tags is given in days as related to the date of registration which links the imaging data to the clinical data for a given subject.  

Note: If these DICOM tags are not present, DICOM tag (0012,0050) Clinical Trial Time Point ID with the associated tag (0012,0051) Clinical Trial Time Point Description Attribute set to “Days offset from registration” provides this same information.

...

  This inconsistency is due to a change in how dates were handled in the first NCTN trials that were published on TCIA.