Skip to main content

Find Threshold (Meta)

Synopsis

Determines confidence thresholds based on misclassification costs, also possible to define costs for the option non-classified.

Description

This operator uses a set of class weights and also allows a weight for the fact that an example is not classified at all (marked as unknown). Based on the predictions of the model of the inner learner this operator optimized a set of thresholds regarding the defined weights.

This operator might be very useful in cases where it is better to not classify an example then to classify it in a wrong way. This way, it is often possible to get very high accuracies for the remaining examples (which are actually classified) for the cost of having some examples which must still be manually classified.

Input

training set

Output

model

example set

This is an example set output port

Parameters

class weights

The weights for all classes, empty: using 1 for all classes. The costs for not classifying at all are defined with class name '?'.

allow unkown predictions

This indicates if unkown predictions are allowed. If checked, the costs for unkown predictions must be specified.

predict unknown costs

Use this cost value for predicting an example as unknown.

training ratio

Use this amount of input data for model learning and the rest for threshold optimization.

number of iterations

Defines the number of optimization iterations.

use local random seed

Indicates if a local random seed should be used.

local random seed

Specifies the local random seed