The
Q-K Model Fitness
process can be used to fit different combinations of
Q
and
K
variables and
model
fitness information from each model, such as AIC, AICC, and BIC. This information is used to evaluate the best combination of
Q
and
K
to be used in the
Q-K Mixed Model
process for
association
tests.
One data set, the
Input Data Set
, which contains all of the marker data, is needed for this process. The sample data set used in the following example, the
samplegmdata_numgeno_rm_pcm.sas7bdat
data set, which was generated from the
samplegmdata.sas7bdat
described in
Sample Genetic Marker Data
, contains a root
identity-by-descent
(IBD) matrix computed for 60 computer-generated SNP
genotypes
by
single value decomposition
(SVD) from the
Relationship Matrix
process, a compressed IBD matrix from the
K Matrix Compression
process, a
principal components
matrix from the
PCA for Population Stratification
process, a coordinates matrix from the
Multidimensional Scaling
process, and a population membership probability, all merged with the original data. This data set is partially shown below. Note that this is a wide data set; markers are listed in columns, whereas individuals are listed in rows.
The
samplegmdata_numgeno_rm_pcm.sas7bdat
data set is
included in the
Sample Data
folder that comes with JMP Genomics.For detailed information about the files and data sets used or created by JMP Life Sciences software, see
Files and Data Sets
.