Select by Random
Synopsis
This operator selects a random subset of attributes of the given ExampleSet.
Description
The Select by Random operator selects attributes randomly from the input ExampleSet. If the use fixed number of attributes parameter is set to true, then the required number of attributes is specified through the number of attributes parameter. Otherwise, a random number of attributes is selected. The randomization can be changed by changing the seed value in the corresponding parameters. This operator can be useful in combination with the Loop Parameters operator or can be used as a baseline for significance test comparisons for feature selection techniques.
Input
example set
This input port expects an ExampleSet. It is the output of the Retrieve operator in the attached Example Process. The output of other operators can also be used as input. It is essential that meta data should be attached with the data for the input because attributes are specified in their meta data. The Retrieve operator provides meta data along-with the data.
Output
example set
The ExampleSet with selected attributes is output of this port.
original
The ExampleSet that was given as input is passed without changing to the output through this port. This is usually used to reuse the same ExampleSet in further operators or to view the ExampleSet in the Results Workspace.
Parameters
Use fixed number of attributes
This parameter specifies if a fixed number of attributes should be selected.
Number of attributes
This parameter is only available when the use fixed number of attributes parameter is set to true. This parameter specifies the number of attributes which should be randomly selected.
Use local random seed
This parameter indicates if a local random seed should be used for randomization. Using the same value of local random seed will produce the same ExampleSet. Changing the value of the local seed changes the randomization, thus the ExampleSet will have a different set of attributes.
Local random seed
This parameter specifies the local random seed. This parameter is only available if the use local random seed parameter is set to true.