• Option is the PROC GENESELECT option, and
• a space is used to delimit individual conditions.
Specifies the smallest number of observations a new branch can have, expressed as the proportion of the number N of available observations in the DATA= data set specified in the PROC statement. Can be less than the total number of observations in the data set because observations with a missing trait or nonpositive value of the variable specified in the FREQ statement are excluded from N . The LEAFSIZE= option specifies the same quantity as an absolute number. The procedure uses the larger of the two. Can be any number from zero through one. The default value equals 0.001. Specifies the minimum number of observations that a given qualitative variable value must have in order to use the value in a split search. The GENESELECT procedure handles qualitative values that appear in fewer than n observations in the same way it handles missing values . The default value of n is 5. Specifies the type of model to create. TREEBOOST generates a boosted series of trees . TREE generates a decision tree.
Type specific PROC GENESELECT options in the PROC GENESELECT Statement Options field.Example : To specify a minimum proportion (0.003) of observations a new branch can have, type LEAFFRACTION=0.003 in the text field, as shown below:Refer to the SAS PROC GENESELECT documentation for more information.