Find Threshold (Meta)
Synopsis
Determines confidence thresholds based on misclassification costs, also possible to define costs for the option non-classified.
Description
This operator uses a set of class weights and also allows a weight for the fact that an example is not classified at all (marked as unknown). Based on the predictions of the model of the inner learner this operator optimized a set of thresholds regarding the defined weights.
This operator might be very useful in cases where it is better to not classify an example then to classify it in a wrong way. This way, it is often possible to get very high accuracies for the remaining examples (which are actually classified) for the cost of having some examples which must still be manually classified.
Input
training set
Output
model
example set
This is an example set output port
Parameters
class weights
The weights for all classes, empty: using 1 for all classes. The costs for not classifying at all are defined with class name '?'.
allow unkown predictions
This indicates if unkown predictions are allowed. If checked, the costs for unkown predictions must be specified.
predict unknown costs
Use this cost value for predicting an example as unknown.
training ratio
Use this amount of input data for model learning and the rest for threshold optimization.
number of iterations
Defines the number of optimization iterations.
use local random seed
Indicates if a local random seed should be used.
local random seed
Specifies the local random seed