The
Test Set Average Accuracy Range
tab contains the following elements:
Accuracy is a measure of the proportion of the test set samples that are predicted correctly. Learning curves are constructed by using a succession of different sized subsets of the full data and assessing
cross validation
performance on each.
Sample size
is plotted on the
x
-axis while the cross validation performance metric is plotted on the
y
-axis. The primary goal of this process is to determine whether adding more samples will change performance. This is achieved by inspecting the slope of the curves, especially toward the right-hand side. If the curves have a slope similar to that show in this example, it is likely that adding more samples will improve performance. If the slopes are flat, adding more samples will likely not make much of a difference. Refer to
Accuracy
for more information about this statistic.