Times and dates are an integral part of the data generated in all clinical trials. At least one timing variable must be included in all
SDTM subject-level domain data sets. Time and date variables are
numerically formatted according to the following ISO 8601 standard:
YYYY-MM-DDThh:mm:ss, where
•
|
YYYY is the four-digit year,
|
•
|
MM is the two-digit month (values rage from 01-12),
|
•
|
DD is the two-digit day (values range from 01-31),
|
•
|
hh is the two-digit hour (values range from 00-23)
|
•
|
mm is the two-digit minutes (values range from 00-59), and
|
•
|
ss is the two-digit seconds (values range from 00-59).
|
•
|
T, which indicates that time information is included (omitted if no time component is included),
|
•
|
-, which either separates the date elements or can be used to indicate missing date components
|
•
|
:, which separates time elements,
|
•
|
/, which can be used to separate the date components from the time components, and
|
•
|
P, which serves as a duration indicator and precedes the date/time components representing the duration of an event or intervention.
|
Note: Spaces are never allowed in any ISO 8601-formatted representations of dates/times.
•
|
When a partial date identified (xxDTC for LB, EG or VS), an asterisk (*) is appended to the end of the finding name or test code. You should review the findings for the appropriately reported set of observations.
|
•
|
When the reference date (RFSTDTC) is partial, an asterisk is appended to the AETERM. You should review all reported dates, study days, and contents for correctness.
|
•
|
When the AE start date (AESTDTC) is partial, an asterisk is appended to the date in the narrative. You should review all contents of the narrative.
|
•
|
When the AE end date (AEENDTC) is partial, an asterisk is appended to the date in the narrative. You should review the final outcome and narrative header information for correctness.
|
•
|
When either or both of the start or stop dates (CMSTDTC or CMENDTC) for Concomitant medications are partial, an asterisk is appended to the end of CMTERM or CMDECOD (based on the selected analysis option). You should review the data for this medication for correctness.
|
Domains are evaluated for SDTM folder and ADaM folder. Domains from SDTM folder are named using the 2-letter code XX, where
XX. can be any two letters. For example, the domain containing adverse event data is
AE (the data set name).
Domains from ADAM always contain AD as the first two letters of the domain name. ADSL is constant while other letters following the
AD identify the specific domain. For example,
ADAE is the ADaM domain
AE.
Domains are classified as findings if XXTESTCD is present, interventions if
XXTRT is present, or events if
XXDECOD is present. If domain type cannot be identified for a given folder, the domain is ignored.
In addition, Basic Data Structure (BDS) is supported for ADAM. If PARAMCD and either
AVAL or
AVALC are present, the domain is considered a findings domain. If other variables are present as above and the domain type cannot be identified, it is ignored. This is true even if
XXTESTCD is present since it would not be clear whether to transform ADaM variables or use SDTM variables for the domain.
Supplemental domains (SUPPxx) can be used by JMP Clinical. However, because these domains lack the standard data needed for analysis,
SUPPxx domains are recognized only when the main domains are present. For example,
SUPPAE is recognized when
AE is present but is ignored when
AE is not found.
SUPP, where all supplemental data is within one data set, is not supported in the CDISC specification and will not be used.
The CDISC SDTM Implementation Guide defines the following terms:
A Natural Key is one or more variables whose contents uniquely distinguish every record (row) in the data set. For example, each row of the DM domain should represent a different subject. The natural keys in this instance could be Study Identifier (
STUDYID) and Unique Subject Identifier (
USUBJID).
A Surrogate Key is an artificially established single-variable identifier that uniquely identifies rows. This could include any of the
xxSEQ variables. For example, if the vital signs data set contained 200 records, the
VSSEQ variable could be numbered 1 to 200 to uniquely identify the rows.
Alternatively, xxSEQ can be made part of a natural key so that
xxSEQ can count from 1 to
ni, where
ni is the total number of records for a subject. Here the keys would be
STUDYID,
USUBJID and
xxSEQ.
JMP Clinical supports the use of define.xml files (both Define-XML Versions 1 and 2) that define the keys and can retrieve the keys from them. It is expected that a separate
define.xml file is present within each of the
SDTM and
ADaM directories where the domains are stored. If a
define.xml file is not present for the specific library or the domain is not present in the
define.xml file specific to the library (
SDTM or
ADAM), JMP Clinical proceeds to step 2.
JMP Clinical looks for presence of xx.txt files (for SDTM) or
ADxx.txt files (for ADaM) in
keys subfolders located in either the
SDTM or
ADaM folders where the domains are stored. For each domain (
DM,
CO,
SE,
SV, the findings domains, the events domains, and the interventions domains) and
ADSL, the JMP Clinical chooses either the
xx.txt file (where
xx is the domain name) or the
ADxx.txt file depending on the library used for each domain. Keys are derived from that text file.
For example, a DM.txt file located in the SDTM folder contains the following three rows:
Note: Keys files need to present during updates to enable users to change keys during snapshots if needed to account for non-uniqueness of rows based on a past set of keys.
Note: This option is not available for SAS Transport files.
JMP Clinical uses the TreatmentEmergent SAS
macro to determine whether records might be:
•
|
On Treatment. Those are events that begin on or after the first dose of any study drug until the last date of dosing plus the offset 1 for end of dosing
|
This macro can be applied to event domains (including AE and
CE), intervention domains (including
CM) and findings domains (including
VS) and supplemental domains (including
SUPPAE).
•
|
If either xx.xxTRTEM or xx.TRTEMFL are present with a value of either Yes or Y, the event is considered as treatment emergent.
|
•
|
The start dates for dosing are determined from ADSL.TRTSDTM. If ADSL.TRTSDTM is not present, ADSL.TRTSDT is used. If ADSL.TRTSDT is not present, DM.RFXSTDTC is used. If DM.RFXSTDTC is not present, the earliest date in EX.EXSTDTC is used. If EX.EXSTDTC is not present, DM.RFSTDTC is used.
|
•
|
The end dates for dosing are determined from ADSL.TRTEDTM. If ADSL.TRTEDTM is not present, ADSL.TRTEDT is used. If ADSL.TRTEDT is not present, DM.RFXENDTC is used. If DM.RFXENDTC is not present, the latest date in EX.EXENDTC is used. If EX.EXENDTC is not present, DM.RFENDTC is used.
|
•
|
For AE, CM, and CE, if the date listed is on or after the dosing start date, the event is considered treatment emergent.
|
•
|
Note: All events are considered pre-treatment for those subjects not on treatment.
|
When running Findings reports, JMP Clinical looks for and appends the values from either
xxPOS or xxSPEC to the test names in
xxTESTCD and
xxTEST. This enables you to analyze findings data when multiple findings test names are identical across the variables:
xxTESTCD,
xxPOS, and
xxSPEC.
If test name values are still not unique across categories of xxCAT or
xxSCAT (if they exist) after appending the prior variables, a numeric index is appended to non-unique tests so that reports can still be run and tests are not inappropriately combined.
•
|
Natural Keys are one or more variables whose contents uniquely distinguish every record (row) in the data set. For example, each row of the DM domain should represent a different subject. The natural keys in this instance could be Study Identifier ( STUDYID) and Unique Subject Identifier ( USUBJID).
|
•
|
A Surrogate Key is an artificially established single-variable identifier that uniquely identifies rows. This could include any of the xxSEQ variables. For example, if the vital signs data set contained 200 records, the VSSEQ variable could be numbered 1 to 200 to uniquely identify the rows.
|
Alternatively, xxSEQ can be made part of a natural key so that
xxSEQ can count from 1 to
ni, where
ni is the total number of records for a subject. Here the keys would be
STUDYID,
USUBJID, and xxSEQ.
If you have ever used PROC CONTENTS, the output header for a data set contains various information about the data set. One of these pieces of information is “SORTED: YES/NO”. If the data set happens to be sorted (in other words,
YES), then additional information is provided in the PROC CONTENTS output after the description of the data set variables. For example, PROC CONTENTS is used on the
DM domain for Nicardipine, following row in the output:
Sortedby STUDYID USUBJID is added to the metadata that is stored in the SAS formatted data set; the variables used for the data set sort is what JMP Clinical uses to define the keys for a study.
If the study domains lack the SORTEDBY metadata associated with the data sets, JMP Clinical attempts to derive the keys based on suggestions provided in the
SDTM Implementation Guide. However, the keys generated might not be the optimal set for a given domain.
So that happens if the supplied keys do not define the records (rows) uniquely? When the study is first added to JMP Clinical, a duplicate report is provided for each affected domain that details the records (rows) that cannot be uniquely determined. These records (rows) are still labeled as New in JMP Clinical. However, any record-level notes that are system- or user-generated would be associated with two or more records. This might be OK if there are few duplicates to contend with, but any duplicates should be reviewed as potential data errors (data that was mistakenly entered twice). When the study data is updated and redundancies remain, JMP Clinical has no way to match these records. In other words, it cannot assess whether any changes were made to the records or not. Again, if there are few duplicates, these records (rows) can be reviewed at multiple snapshots for correctness.
8
|
Given (3), do not use terms that rely on medical coding as part of the keys (in other words, AEDECOD based on MedDRA or CMDECOD based on WHODRUG). There are two reasons for this. First, medical coding might not be immediately available. This provides an opportunity for a missing value of AEDECOD to change to a nonmissing coded term later on. Further, sometimes over the course of a study, coded terms might change based on new insights of the clinical team, so, you should use verbatim terms such as AETERM or CMTRT.
|
9
|
The xxSEQ variable or STUDYID, USUBJID, and xxSEQ set might be good keys to use since these values are unlikely to change. However, the xxSEQ variable must be carefully maintained so that the number never changes for a particular record. For example, suppose a CM data set contains two records:
|
How does JMP Clinical define various terms for risk-based monitoring?
2
Subjects are considered RANDOMIZED if there is at least one record from
DS where the index ((
DS.DSDECOD3),
RANDOMIZED) is true.
•
|
the value in DS.EPOCH is SCREENING, the value in DS.DSCAT is DISPOSITION EVENT, and value in DS.DECOD is COMPLETED, or
|
•
|
the value in DS.DSEPOCH is SCREENING and the value in DS.DSDECOD is COMPLETED, or
|
To determine whether the subjects have COMPLETED the trial, a
The SAS WHERE Expression can be included on the analysis dialog to select the appropriate
DS records (this statement should also select the records that indicate whether a subject has alternatively
DISCONTINUED or
WITHDRAWN). If this
The SAS WHERE Expression is supplied and the value in
DS.DSDECOD is
COMPLETED, the subject is considered to have completed the trial. Otherwise, based on the available variables, the subject is considered to have completed the trial only if
•
|
the value in DS.EPOCH is TREATMENT and the value in DS.DSCAT is DISPOSITION EVENT and the value in DS.DSDECOD is COMPLETED, or
|
•
|
the value in DS.EPOCH is TREATMENT and the value in DS.DSDECOD is COMPLETED, or
|
•
|
the value in DS.DSCAT is DISPOSITION EVENT and the value in DS.DSDECOD is COMPLETED.
|
Subjects are considered to have DISCONTINUED or
WITHDRAWN when a
The SAS WHERE Expression is supplied and
DS.DSDECOD is
^=COMPLETED. Otherwise, based on the available variables the subject is considered to have discontinued the trial if
•
|
the value in DS.EPOCH is TREATMENT and the value in DS.DSCAT is DISPOSITION EVENT and the value in DS.DSDECOD is ^= COMPLETED, or
|
•
|
the value in DS.EPOCH is TREATMENT and the value in DS.DSDECOD is ^= COMPLETED, or
|
•
|
the value in DS.DSCAT is DISPOSITION EVENT and the value in DS.DSDECOD is ^= COMPLETED.
|
•
|
value for DM.ACTARM is neither SCREEN FAILURE, NOT TREATED, nor NOT ASSIGNED, or
|
•
|
value for DM.ARM is neither SCREEN FAILURE, NOT TREATED, nor NOT ASSIGNED, or
|
An adverse event (AE) is considered serious (an SAE) if the value in
AE.AESER is either
Y or
YES.
An AE is considered fatal if the value in either AE.AEOUT or is either FATAL or DEATH, or if the value in
AESDTH is either
Y or
YES.
•
|
the CO domain is available and a comment containing DEATH, DIED, or DEAD.
|
•
|
If the value in DS.DSDECOD is either DEATH, DIED, or DEAD, then the subject is considered to have Discontinued Due to Death, or
|
•
|
the CO domain is available and a comment containing DEATH, DIED, or DEAD, or
|
•
|
If the value in DS.DSDECOD is either LOST TO FOLLOW-UP, LOST TO FOLLOWUP, LOST TO FOLLOW UP, or LTFU, then the subject is considered to be Lost to Followup, or
|
•
|
If the value in DS.DSDECOD is either ADVERSE EVENT or AE, then the subject is considered to have Discontinued Due to Adverse Event, or
|
•
|
If the value in DS.DSDECOD is either WITHDRAWAL BY SUBJECT, SUBJECT WITHDRAWAL, WITHDREW CONSENT, or SUBJECT WITHDREW CONSENT, then the Patient Withdrew from Study, or
|
With J treatment comparisons of ordered (smallest to largest)
p-values
p(j), the FDR
p-value (Benjamini and Hochberg, 1995
4) for the
jth hypothesis is:
For the ith site or country and the
jth risk indicator,
where is the mean,
median or user-supplied
center value. The quantity
equals
,
, and
, for
Direction of Risk Signals equal to B, U, and L, respectively, and
is the value for the
it h site or country and the
jth risk indicator.
It is acceptable to specify both yellow and red risk thresholds, one or no risk thresholds. When specifying only a moderate threshold, the Red Percent of Center is left missing in the risk threshold data set so that moderate risk is considered
. In instances where values do not meet the criteria for moderate or severe risk, the risk is considered mild (green). Note that for risk thresholds defined using the above criteria, no threshold colors are determined in instances where the
mean,
median or
center value is calculated or set to zero.
In this case, it is acceptable to specify both thresholds, one threshold, or no risk thresholds at all. When specifying only a moderate threshold, the Red Magnitude is left missing in the risk threshold data set so that moderate risk is
. In cases where neither moderate nor severe risk applies, the risk is considered mild (green).
The first, or Overall Risk Indicator, incorporates all of the variables meeting these criteria into a single measure that signifies the overall risk and performance of a clinical site. This indicator is generated only when the
Weight for Overall Risk Indicator exceeds 0 for at least one of the available risk indicators exhibiting variability. If none of the individual indicators have a
Weight for Overall Risk Indicator > 0, then the corresponding
Overall Risk Indicator is not generated.
Each of the other four overall indicators - Enrollment Metrics,
Disposition,
Adverse Events, and
Manually Entered - combines subsets of the risk indicators based on
Category in the risk weight data set. By default,
Category matches how variables are grouped in Risk-Based Monitoring, with
Manually Entered applied to all user-supplied risk indicators from
Update Study Risk Data Set. If no indicators have a
Weight for Overall Risk Indicator > 0 for a given category, then the corresponding overall indicator is not provided.
The Weight for Overall Risk Indicator (
wj) can either be missing (in this case, it is assumed to be zero) or greater than or equal to zero. The weights are self-normalizing in that each weight is divided by the sum of all weights for variables contributing to the particular overall indicator. The contribution of each indicator to an overall indicator is based on its weight, center value (either
mean,
median or user-provided
center value,
), standard deviation (
), and direction. In general, the value for an overall indicator for the
ith site or country and the
jth risk indicator is defined as
, where
,
, or
when Direction equals B,U, or L, respectively. This can be interpreted as larger values imply greater risk. By default, all weights are assumed equal to one in the Default Risk Threshold data set, meaning that each variable contributes equally to each overall indicator.