This report calculates
Mahalanobis distance
based on available data, using the equation
, to identify subject inliers and outliers in multivariate space from the multivariate
mean
. Refer to the JMP documentation on
Mahalanobis Distance Measures
for statistical details. It also generates results by site to see which sites are extreme in this multivariate space.
Running this report for
Nicardipine
using default settings generates the
Report
shown below.
Presents plots of
Mahalanobis distance
of all subjects (distance is from the multivariate
mean
), colored by study site, and
Box Plot
s presented by sites.
•
|
One JMP
Mahalanobis Distances
plot to identify significant outliers. In the
Mahalanobis Distances
plot shown above, the distance of each specific
observation
(row number) from the
mean
center of the other observations of each row number is plotted. Those outlier points residing above the dotted line correspond to those rows that warrant the most attention due to their significant distance from the mean center of all other observations.
|
The first
box plot
shows all subjects for which Mahalanobis Distance is calculated. Values closer to
zero
(0) reflect subjects that are close to the multivariate mean of the
variables
(inliers). Larger values represent subjects that are extreme in multivariate space. The
square
of Mahalanobis Distance is distributed as
chi-square
with
k
degrees of freedom, where
k
is the number of variables used in the calculation of Mahalanobis Distance. The redline reflects the
square root
of
k
. The second figure shows box plots by study site. This allows the analyst to determine how different sites are from the multivariate mean.
Details
variables
that contain missing data that prevented
Mahalanobis distance
from being calculated for certain subjects (
Flag = 1
) or variables that were dropped from analysis based on the
dialog
option
Remove variables from analysis with a missing data percentage of at least:
. By default, variables with
5
% or more of missing data are
not
used in the calculation of Mahalanobis Distance. Data are presented either as counts (
left
) or percentages (
right
) reflect the number of values that are missing for each variable. Opening the data table shows the percentage of missing data for each test.
•
|
Show Subjects
: Select subjects and click
to open the
ADSL
(or
DM
if
ADSL
is unavailable) of selected subjects.
|
•
|
Demographic Counts
: Select subjects and click
to create a subject filter. Follow-up analyses are subset to these subjects if the
Subject Filter
is applied in the dialog.
|
Output includes one summary data set (named
csass_sum_XXX
2
, by default) containing one record per subject with pre-dosing data, one data set of all pairwise distances within the
covariate
subgroups (named
csass_alldist_XXX
, by default), one data set containing minimum pairwise distances for each covariate subgroup (named
csass_mindist_XXX
), by default), one data set per covariate subgroup containing pairwise distances (named
csass_p_Y_XXX
, by default, where
Y
is indexed 1 to the number of covariate subgroups) and one data set per covariate subgroup containing the
distance matrix
of subjects within the covariate subgroup (named
csass_Y_XXX
, by default, where
Y
is indexed 1 to the number of covariate subgroups).
•
|
Click the
Options
arrow to reopen the completed report dialog used to generate this output.
|
The
_XXX
designation is used to designate a one- to three-digit number that is added sequentially to prevent overwriting of existing data sets.