Filter Wide Columns Based on Tall Rows enables you to remove columns from a
wide data set based on a filter specified using
variables from the corresponding
tall data set to filter its rows.
The drosophilaaging_wide.sas7bdat file (located in the
\LifeSciences\Sample Data\Microarray\Scanalyze Drosophila directory included with JMP Genomics, associated with the
Drosophila aging experiment of Jin, et al. (2001) and described in
Drosophila Aging Experimental Data) serves as an example
Wide SAS Data Set.
The drosophilaaging_tall.sas7bdat file (located in the
\LifeSciences\Sample Data\Microarray\Scanalyze Drosophila directory included with JMP Genomics, associated with the
Drosophila aging experiment of Jin, et al. (2001) described in
Drosophila Aging Experimental Data) serves as an example
Tall SAS Data Set.
In this example, all variables starting with log2in_ are
Wide Variables Corresponding to Tall Rows. Rows from the tall data set (and the corresponding columns from the wide data set) are to be included only if the value of
SPOT exceeds
100.
The output from this process consists of a Results window, which lists the two subsetted data sets:
The drosophilaaging_wide_fw.sas7bdat subsetted wide data set is shown
below.
The first column beginning with log2in_ listed in the subsetted wide data set is
log2in_Tpi. This is the 35
th log2in_ column listed in the
Input Wide Data Set. It corresponds to the 35
th row of the
Input Tall Data Set, which is the first row where the value of
SPOT exceeds
100. All subsequent columns in the subsetted wide data set correspond to the rows in the
Input Tall Data Set after the 35
th row (and the columns in the
Input Wide Data Set after the 35
th log2in_ column).
The drosophilaaging_fw.sas7bdat subsetted tall data set is shown
below.
Only those rows where the SPOT value exceeded
100 were kept (row 35, onward). This consequently means that only
log2in_ column 35, onward, were kept in the subsetted wide data set.