Statistical Details for the Naive Bayes Algorithm

For the latest version of JMP Help, visit JMP.com/help.

Predictive and Specialized Modeling > Naive Bayes > Statistical Details for the Naive Bayes Platform > Statistical Details for the Naive Bayes Algorithm

Publication date: 04/21/2023

Statistical Details for the Naive Bayes Algorithm

The Naive Bayes method classifies an observation into the class for which its probability of membership, given the values of its features, is highest. The method assumes that the features are conditionally independent within each class.

Denote the possible classifications by C1, …, CK. Denote the features, or predictors, by X1, X2, …, Xp.

The conditional probability that an observation with predictor values x1, x2, …, xp belongs in the class Ck is computed as follows:

Equation shown here ,

where R is a regularization constant. In the formula above, the conditional probability that an observation with Xj = xj belongs to the class Ck, P(xj|Ck), is determined as follows:

• If Xj is categorical:

Equation shown here

• If Xj is continuous:

Equation shown here

Here, φ is the standard normal density function, and m and s are the mean and standard deviation, respectively, of the predictor values within the class Ck.

The unconditional probability that an observation belongs in class Ck, P(Ck), is computed as follows:

Equation shown here

An observation is classified into the class for which its conditional probability is the largest.

Note: In the formula for P(Ck), 0.5 is the prior bias factor. This value is the default value. To change the prior default factor, go to File > Preferences > Platforms > Naive Bayes, select the Prior Bias check box and change the value.

Want more information? Have questions? Get answers in the JMP User Community (community.jmp.com).