In this example, you use the Cluster Variables platform as a dimension-reduction tool for modeling. The Penta.jmp sample data table contains 15 variables used to predict the response variable, log RAI. Use Cluster Variables to reduce this number.
1.
|
2.
|
Select Analyze > Clustering > Cluster Variables.
|
3.
|
4.
|
Click OK.
|
5.
|
Click the Variable Clustering red triangle and select Save Cluster Components.
|
1.
|
Click the red triangle next to Variable Clustering and select Launch Fit Model.
|
2.
|
3.
|
Be careful not to include Obs Name.
4.
|
Select the box next to Keep dialog open.
|
5.
|
Click Run.
|
6.
|
8.
|
Click Run.
|
The model that includes the five Cluster Components as the only predictors explains a substantial amount of the variation in the response, with an adjusted Rsquare of 0.784. The model that uses all fifteen predictors has only a slightly higher adjusted Rsquare of 0.853 (Fit Least Squares Report for Model with All Continuous Predictors).