The Fit Mixture option adds the Mixture outline to the report where you can fit a mixture distribution to the data. For an example, see Fit Mixture Example.
where Fi(x) is one of the supported distributions, k is the number of components in the mixture, and the wi are positive weights that sum to 1. The Fit Mixture option attempts to identify clusters of observations that are drawn from each of the component distributions, Fi(x). It estimates the parameters of the mixture and the probability that an observation is drawn from any given component.
The fitting methodology is based on assumptions about the underlying clusters, called the Starting Value Method. Suppose that you designate k distributions. There are three Starting Value Methods:
•
|
Separable Clusters assumes that the ingredient distributions affect some observations more profoundly than others. For separable clusters, each of the k densities has an identifiable mode and defines a cluster.
|
•
|
Overlapping Clusters assumes a situation that is intermediate between Single Cluster and Separable Clusters. Some densities stand out, but others jointly affect a portion of the observations. In this case, there are m clusters in the data, where m is less than k, the total number of densities.
|
Select the number of components in the mixture distribution that have the given distribution. The sum of the Quantity values is k, the number of densities in the mixture.
Select a method that reflects your assumptions about the mixture. See Model Fit and Mixture Starting Value Methods.
Click Go to fit the desired mixture. The Model List is updated with the model that you fit, and a report with the name of the mixture model is added.
The Model List report lists the mixture distributions that you fit. The report provides the number of parameters, the number of actual observations, and the AICc, -2*LogLikelihood, and BIC statistics for each mixture distribution. For more details about these statistics, see Likelihood, AICc, and BIC in the Fitting Linear Models book.
•
|
The Comparison Criterion red triangle option does not affect the order of models in the Model List.
|
•
|
Parameter estimates are given for each distribution in the mixture. The Parameter column also includes parameters called Portion <i>, where i = 1, 2, .., k-1. These are estimates of the weights wi for the mixture. Since the weights sum to 1, the kth weight can be computed from the first k - 1 weights.
Shows four types of profilers for the combined mixture distribution F. See Mixture Profiler Options for a description of their red triangle options.
For each mixture density, saves a column to the data table containing the probability that an observation belongs to that density. For the formulas used in the calculation, see Fit Mixture Save Predictions Formulas.