The first required data set is the
Experimental Design Data Set (EDDS). The
affylatin_exp2.sas7bdat EDDS, which is used in the example that follows, is shown below. This required data set provides information about the experiment and identifies the files and columns containing the relevant data in the input data set. Note that one column in the EDDS must be named
ColumnName and the values contained in this column must exactly match the column names in the input data set.
The second required data set, the Input Data Set, contains all of the numeric data to be analyzed. The sample data set used in the following example, the chips45and55.sas7bdat data set, shown below, is from the Affymetrix Latin Square experiment that is described in Sample Case Studies. This data was originally generated by Affymetrix Corporation to develop and validate their U95A GeneChip and Microarray Suite (MAS) 5.0 algorithm over a range of known concentrations. Each group in the experiment contains a pool of non-specific
RNA as well as a set of 14 distinct human
transcripts spiked in at known concentrations. Intensity data are listed in the last two columns of the data set.
The third required data set, the coordinate data set, lists the
x- and
y-coordinates for each spot on the arrays. The
probemap_u95a.sas7bdat data set, used in the following example, is shown below.
Two additional, optional, data sets can also be merged into the analysis. The first, the Annotation Data Set, provides annotation information for each of the markers, such as gene names, function, physical location, and
association, that can be used as key terms for web and database searches.
The second optional data set, the QC data set, contains Affymetrix Input Engine generated QC
probes data (
qcprobe). This data set should be specified only if the input data set does not contain data for QC probes and if you want QC probes data displayed in the pseudo image. The
qcdata_c45and55.sas7bdat QC data set is shown below.
The affylatin_exp2.sas7bdat EDDS, the
chips45and55.sas7bdat input data set, the
probemap_u95a.sas7bdat coordinate data set and the
qcdata_c45and55.sas7bdat QC data set are all included in the
Sample Data folder.