This report calculates Mahalanobis distance based on available data, using the equation
, to identify subject inliers and outliers in multivariate space from the multivariate
mean. Refer to the JMP documentation on
Mahalanobis Distance Measures for statistical details. It also generates results by site to see which sites are extreme in this multivariate space.
Running this report for Nicardipine using default settings generates the
Report shown below.
Presents plots of Mahalanobis distance of all subjects (distance is from the multivariate
mean), colored by study site, and
Box Plots presented by sites.
•
|
One JMP Mahalanobis Distances plot to identify significant outliers. In the Mahalanobis Distances plot shown above, the distance of each specific observation from the mean center of the other observations from the site is plotted. Those points residing above the upper 95% confidence interval (outliers) or below the lower 95% confidence interval (inliers) correspond to those rows that warrant the most attention due to their significant distance from the mean center of all other observations. The region between 95% and 99.7% is referred to as a moderate inlier or outlier depending of it is in the lower or upper bounds, respectively. Beyond 99.7 is considered a severe outlier or inlier.
|
Details variables that contain missing data that prevented
Mahalanobis distance from being calculated for certain subjects (
Flag = 1) or variables that were dropped from analysis based on the option
Remove variables from analysis with a missing data percentage of at least:. By default, variables with
5% or more of missing data are
not used in the calculation of Mahalanobis Distance. Data are presented either as counts (
left) or percentages (
right) reflect the number of values that are missing for each variable. Opening the data table shows the percentage of missing data for each test.
•
|
Profile Subjects: Select subjects and click to generate the patient profiles. See Profile Subjects for additional information.
|
•
|
Show Subjects: Select subjects and click to open the ADSL (or DM if ADSL is unavailable) of selected subjects.
|
•
|
Create Subject Filter: Select subjects and click to create a subject filter. Follow-up analyses are subset to these subjects if the Select saved subject Filter is applied in the dialog.
|
•
|
Variable Contributions to Distance: Select subjects and click to create Pareto plots for each subject that explain how much each variable contributes to a patient being an outlier, with a total possible of 100% for each subject. Analysis of the Pareto plots can enable you to view how selected subjects are extreme for the selected covariates.
|
Output includes one summary data set (named csass_sum_XXX2, by default) containing one record per subject with pre-dosing data, one data set of all pairwise distances within the
covariate subgroups (named
csass_alldist_XXX, by default), one data set containing minimum pairwise distances for each covariate subgroup (named
csass_mindist_XXX), by default), one data set per covariate subgroup containing pairwise distances (named
csass_p_Y_XXX, by default, where
Y is indexed 1 to the number of covariate subgroups) and one data set per covariate subgroup containing the
distance matrix of subjects within the covariate subgroup (named
csass_Y_XXX, by default, where
Y is indexed 1 to the number of covariate subgroups).
•
|
Click the Options arrow to reopen the completed report dialog used to generate this output.
|
The Include Age and
Include Sex are used to include age and sex (when these are not matching criteria) in the analysis for computing distance.
You can opt to Include findings domains to analyze and summarize findings by visit and time point number. If you decide to include finding results, you must specify which
Findings Tests to include and whether to
Analyze findings using: using original or standard units. Check the
Remove unscheduled visits option to included findings from scheduled visits only; leave the option unchecked to include findings from all visits.
You can opt to Include intervention domains to analyze the frequency of each intervention. By default, all intervention domains are included. However, you can restrict the analysis to domains specified using the
Subset of Domains to Analyze for Interventions option.
You can opt to Include event domains to analyze the frequency of each event. By default, all event domains are included. However, you can restrict the analysis to domains specified using the
Subset of Domains to Analyze for Events option. You can use the following options to exclude events:
The Include events or interventions experienced by at least this percent of patients: option includes all events that are experienced by the specified threshold percentage of subjects; events not meeting this threshold are excluded.
Checking the Remove all variables with missing values option excludes all variables with one or more missing values. The
Remove variables from analysis with a missing data percentage of at least: option enables you to be more permissive with regards to missing data. When the
Remove all variables with missing values option is not checked, you can specify a threshold value for exclusion; only those variables whose percentage of missing values exceeds this threshold are excluded. All other variables are included in the analysis.
Use the Summarize sites with at least this many subjects: option to specify the number of subjects a site must have in the selected
population in order to include it in the summary.
See Select the analysis population,
Select saved subject Filter3,
Additional Filter to Include Subjects, and
Subset of Visits to Analyze for Findings for more information.
The _XXX designation is used to designate a one- to three-digit number that is added sequentially to prevent overwriting of existing data sets.