Launch the Make Validation Column platform by selecting Analyze > Predictive Modeling > Make Validation Column.
Figure 11.3 Make Validation Column Launch Window
For more information about the options in the Select Columns red triangle menu, see Column Filter Menu in Using JMP.
The Make Validation Column launch window provides the following options:
Stratification Columns
Assigns one or more stratification columns.
Grouping Columns
Assigns one or more grouping columns.
Cutpoint Column
Assigns a numeric cutpoint column.
Cutpoint Batch ID
When a cutpoint column is assigned, you can also assign a column for cutpoint batch IDs. This enables you to determine cutpoint values within each level of the Cutpoint Batch ID column.
Selected Method
Describes the selected validation column method based on the specified stratification, grouping, and cutpoint columns. After a method is selected and you click OK, you specify the allocations for each set in the Make Validation Column report. See Specify Rates or Relative Rates and Set Cutpoints. There are five methods for constructing the holdback sets:
Random Validation Column
The default method if there are no column assignments in the launch window. This method partitions the data into sets based on the allocations entered in the Make Validation Column report.
Stratified Validation Column
The selected method if one or more stratification columns are assigned. This method partitions the data into balanced sets based on the levels of the specified stratification columns. As in the Random Validation Column method, rows are randomly assigned to the holdback sets based on the allocations entered in the Make Validation Column report. However, this is done at each level or combination of levels of the stratifying columns. Use this method when you want a balanced representation of the levels of a column in each of the training, validation, and test sets.
Grouped Validation Column
The selected method if one or more grouping columns are specified. This method partitions the data into sets in such a way that entire levels of a specified column or combinations of levels of two or more columns are placed in the same set. Because of this, the sizes of the resulting sets vary slightly from the sizes that you specified. Use this option when splitting levels across holdback sets is not desirable.
Stratify by Group Validation Column
The selected method if both stratification and grouping columns are specified. This method partitions the data to balance the levels across the stratification column while requiring that the specified groups stay together in the same holdback sets. As in Grouped Validation Column, groups can be created as levels of a specified column or combination of levels of two or more columns. The sizes of the resulting sets vary slightly from the sizes you specified.
Cutpoint Validation Column
The selected method if a cutpoint column is specified. This method partitions the data into sets based on the time series cutpoints. Use this option when you want to assign your data to holdback sets based on time periods. The training set consists of rows between the first cutpoint and the second cutpoint. The validation set consists of rows between the second and third cutpoints. The test set consists of the remaining rows. These sets are chosen based on options in the Set Cutpoints report.