Due to the number of tests being performed and the manner in which the results are to be used, the p-values might require either adjustment or
transformation in order to adequately control false positives or false discovery rates across all tests.
Analysis of genomics data can produce thousands of p-values. Under a global null hypothesis of no associations and assuming that the tests are mutually independent, the
distribution of the
p-values is uniform. The chance of observing one or more small
p-values increases directly with the number of tests conducted. In order to control for global error rates, such as the Familywise Error Rate or the False Discovery Rate, during multiple, simultaneous comparisons,
p-values often should be statistically adjusted to account for multiple testing. The
P-value Adjustment process provides you with a variety of multiple-testing adjustment methods along with log-based transformations, which allow for the rapid and easy discernment of highly significant
p-values.
Running the P-Value Adjustment process generates an output data set (identified by the
_pva suffix) by adding additional columns containing the adjusted
p-values to the input data set. One new column is generated for each
P-value
variable specified on the
dialog.