Skip to main content

Smote Upsampling

Synopsis

This operator implements the Synthetic Minority Over-sampling Technique as proposed by Chawla et. al., Journal of Artificial Intelligence Research 16 (2002), 321 -- 357.

Description

In the first step the ExampleSet is filtered to only consider examples of the minority class. Afterwards a search on the k nearest neighbours for all examples is performed. The algorithm then selects a random example and a random nearest neighbour for this example. A new example is created which is on the line between the two examples.

Input

exa

ExampleSet you want to upsample.

Output

ups

The original ExampleSet with the attached synthetic examples.

ori

The original ExampleSet.

Parameters

Number of neighbours

In SMOTE we calculate the k nearest neighborhood. This parameter defines the number of neighbors to consider.

Normalize

If checked range transformation to [0,1] is performed to make distance calculation solid.

Equalize classes

If activated as many new examples as needed to balance the classes are drawn.

Upsampling size

Defines the number of examples you want to create.

Auto detect minority class

If activated the class to upsample is the class with the least occurrences.

Minority class

Defines the class you want to upsample.

Round integers

Round Integer attributes to the next Integer.

Nominal change rate

Probability to change a nominal value to the nominal value of it's nearest neighbor.

Use local random seed

This parameter indicates if a local random seed should be used.

Local random seed

If the use local random seed parameter is checked this parameter determines the local random seed.