Filter Wide Columns Based on Tall Rows
enables you to remove columns from a
wide
data set based on a filter specified using
variables
from the corresponding
tall
data set to filter its rows.
The
drosophilaaging_wide.sas7bdat
file (located in the
\LifeSciences\Sample Data\Microarray\Scanalyze Drosophila
directory included with JMP Genomics, associated with the
Drosophila
aging experiment of Jin, et al. (2001) and described in
Drosophila Aging Experimental Data
) serves as an example
Wide SAS Data Set
.
The
drosophilaaging_tall.sas7bdat
file (located in the
\LifeSciences\Sample Data\Microarray\Scanalyze Drosophila
directory included with JMP Genomics, associated with the
Drosophila
aging experiment of Jin, et al. (2001) described in
Drosophila Aging Experimental Data
) serves as an example
Tall SAS Data Set
.
In this example, all variables starting with
log2in_
are
Wide Variables Corresponding to Tall Rows
. Rows from the tall data set (and the corresponding columns from the wide data set) are to be included only if the value of
SPOT
exceeds
100
.
The output from this process consists of a
Results
window, which lists the two subsetted data sets:
The
drosophilaaging_wide_fw.sas7bdat
subsetted wide data set is shown
below
.
The first column beginning with
log2in_
listed in the subsetted wide data set is
log2in_Tpi
. This is the 35
th
log2in_
column listed in the
Input Wide Data Set
. It corresponds to the 35
th
row of the
Input Tall Data Set
, which is the first row where the value of
SPOT
exceeds
100
. All subsequent columns in the subsetted wide data set correspond to the rows in the
Input Tall Data Set
after the 35
th
row (and the columns in the
Input Wide Data Set
after the 35
th
log2in_
column).
The
drosophilaaging_fw.sas7bdat
subsetted tall data set is shown
below
.
Only those rows where the
SPOT
value exceeded
100
were kept (row 35, onward). This consequently means that only
log2in_
column 35, onward, were kept in the subsetted wide data set.