This example illustrates the benefits of the Fractional Weights (Bayesian Bootstrap) option for a small data table. The data consist of a response, Y, measured on three samples of each of seven different soil types. A scientist is interested in finding a confidence interval for the mean response for the wabash soil type.
Because each soil type has only three observations, the simple bootstrap has the potential to exclude all three of the observations for wabash from a bootstrap sample. The Fractional Weights option ensures that all observations for every soil type are represented in all bootstrap samples.
The scientist examines the distribution of wabash sample means from both bootstrap methods:
1.
|
2.
|
Select Analyze > Fit Y by X.
|
3.
|
4.
|
5.
|
Click OK.
|
6.
|
Select Means/Anova from the Oneway Analysis red triangle menu.
|
7.
|
8.
|
Type 1000 for the Number of Bootstrap Samples.
|
9.
|
(Optional) To match the results in Bootstrap Results for a Simple Bootstrap, type 12345 for the Random Seed.
|
10.
|
Click OK.
|
The missing values in Bootstrap Results for a Simple Bootstrap represent bootstrap iterations in which none of the observations for a given soil type were selected for the bootstrap sample.
11.
|
Select Analyze > Distribution.
|
12.
|
13.
|
Click OK.
|
Distribution of wabash Means from a Simple Bootstrap shows the distribution of wabash means from the simple bootstrap analysis. Notice the following:
–
|
The Summary Statistics report indicates that the number of rows containing bootstrap means for wabash is N = 961. Although you conducted 1,000 iterations, 39 bootstrap samples did not contain any of the three observations for wabash.
|
–
|
The histogram of sample means is not smooth, with peaks at the two extremes. The three values for wabash are 38.2, 37.8, and 31.9. The peak at the low end of the distribution results from bootstrap samples that contain only the value 31.9. The peak at the high end results from bootstrap samples that contain one or both of the values 38.2 and 37.8.
|
1.
|
In the Oneway Analysis report, right-click the Mean column in the Means for Oneway Anova report and select Bootstrap.
|
2.
|
Type 1000 for the Number of Bootstrap Samples.
|
3.
|
(Optional) To match the results in Bootstrap Results for a Bayesian Bootstrap, type 12345 for the Random Seed.
|
4.
|
Select the Fractional Weights option.
|
5.
|
Click OK.
|
There are no missing values in the Bayesian Bootstrap results table. All 21 rows in the Snapdragon.jmp data table are included, with varying bootstrap weights, in each bootstrap sample.
6.
|
Select Analyze > Distribution.
|
7.
|
8.
|
Click OK.
|
The Bayesian Bootstrap produces a much smoother distribution for the wabash sample means. All 1,000 bootstrap samples include the three observations for wabash. For each iteration, the wabash sample mean is calculated using different fractional weights.