A commonly used transformation raises the response to some power. Box and Cox (1964) formalized and described this family of power transformations. The formula for the transformation is constructed to provide a continuous definition in terms of the parameter λ, and so that the error sums of squares are comparable. Specifically, the following equation provides the family of transformations:
The Box Cox Y Transformation option fits transformations from λ = –2 to 2 in increments of 0.2. To choose a proper value of λ, the likelihood function for each of these transformations is computed. They are computed under the assumption that the errors are independent and normal with mean zero and variance σ2. The value of λ that maximizes the likelihood is selected. This value also minimizes the SSE over the values of λ. The value of λ that maximizes the likelihood is found using a quadratic interpolation between the two incremental grid points surrounding the grid point with the smallest SSE.
The Box-Cox Transformations report displays a plot showing the sum of squared errors (SSE) values against the values of λ. The horizontal red line on the plot represents a one-sided 95% confidence interval for λ. This confidence interval is based on the confidence region defined in Box and Cox (1964, p. 216). The confidence region is defined by the following inequality:
Creates a new data table containing parameter estimates and SSE values for all λ from –2 to 2, in increments of 0.2.
1.
|
2.
|
Select Analyze > Fit Model.
|
3.
|
4.
|
Make sure that the Degree box has a 2 in it.
|
5.
|
6.
|
Click Run.
|
Figure 2.54 Box Cox Y Transformation
The plot shows that the best values of λ are between 0.1 and 2.0. The value that JMP selects, using interpolation between the best two values in the 0.2-unit grid of λ values, is 1.124.
7.
|
(Optional) To see the SSE values used to construct the graph, select Table of Estimates.
|