PROC GENESELECT Statement Options
Enter SAS PROC GENESELECT options in this field to override default parameters.
You can specify any PROC GENESELECT option using the following syntax:
Option
where:
• | Option is the PROC GENESELECT option, and |
• | a space is used to delimit individual conditions. |
Examples of commonly used PROC GENESELECT options are listed in the following table.
Option |
Definition |
LEAFFRACTION=p |
Specifies the smallest number of observations a new branch can have, expressed as the proportion of the number N of available observations in the DATA= data set specified in the PROC statement. Can be less than the total number of observations in the data set because observations with a missing trait or nonpositive value of the variable specified in the FREQ statement are excluded from N. The LEAFSIZE= option specifies the same quantity as an absolute number. The procedure uses the larger of the two. Can be any number from zero through one. The default value equals 0.001. |
MINCATSIZE=n |
Specifies the minimum number of observations that a given qualitative variable value must have in order to use the value in a split search. The GENESELECT procedure handles qualitative values that appear in fewer than n observations in the same way it handles missing values. The default value of n is 5. |
MODELTYPE=type |
Specifies the type of model to create. TREEBOOST generates a boosted series of trees. TREE generates a decision tree. |
To Specify One or More PROC GENESELECT Options:
8 | Type specific PROC GENESELECT options in the PROC GENESELECT Statement Options field. |
Example: To specify a minimum proportion (0.003) of observations a new branch can have, type LEAFFRACTION=0.003 in the text field, as shown below:
For Additional Information
Refer to the SAS PROC GENESELECT documentation for more information.