Skip to main content

Select by Random

Synopsis

This operator selects a random subset of attributes of the given ExampleSet.

Description

The Select by Random operator selects attributes randomly from the input ExampleSet. If the use fixed number of attributes parameter is set to true, then the required number of attributes is specified through the number of attributes parameter. Otherwise, a random number of attributes is selected. The randomization can be changed by changing the seed value in the corresponding parameters. This operator can be useful in combination with the Loop Parameters operator or can be used as a baseline for significance test comparisons for feature selection techniques.

Input

example set

This input port expects an ExampleSet. It is the output of the Retrieve operator in the attached Example Process. The output of other operators can also be used as input. It is essential that meta data should be attached with the data for the input because attributes are specified in their meta data. The Retrieve operator provides meta data along-with the data.

Output

example set

The ExampleSet with selected attributes is output of this port.

original

The ExampleSet that was given as input is passed without changing to the output through this port. This is usually used to reuse the same ExampleSet in further operators or to view the ExampleSet in the Results Workspace.

Parameters

Use fixed number of attributes

This parameter specifies if a fixed number of attributes should be selected.

Number of attributes

This parameter is only available when the use fixed number of attributes parameter is set to true. This parameter specifies the number of attributes which should be randomly selected.

Use local random seed

This parameter indicates if a local random seed should be used for randomization. Using the same value of local random seed will produce the same ExampleSet. Changing the value of the local seed changes the randomization, thus the ExampleSet will have a different set of attributes.

Local random seed

This parameter specifies the local random seed. This parameter is only available if the use local random seed parameter is set to true.