Skip to main content

Generate Massive Data

Synopsis

Generates huge amounts of data for testing purposes.

Description

Generates huge amounts of data in either sparse or dense format. This operator can be used to check if huge amounts of data can be handled for a given process setup without creating the correct format / writing special purpose input operators.

Output

output

The output of this operator.

Parameters

number examples

The number of generated examples.

number attributes

The number of attributes.

sparse fraction

The fraction of default attributes.

sparse representation

Indicates if the example should be internally represented in a sparse format.

use local random seed

Indicates if a local random seed should be used.

local random seed

Specifies the local random seed