Extract Peaks
Synopsis
This operator performs a peak transformation on one or more time series attributes and provides the properties of the peaks as features.
Description
This operator performs a peak transformation on one or more time series. The method to detect the peaks can be selected by the parameter peak detection method. For a detailed description of the peak detection algorithm, please see the help text of the corresponding Peak Transformation operators.
For each detected peak in the time series, the following features are extracted:
peak_<nr>_value: The value of the extremum (minimum / maximum) of the peakpeak_<nr>_extremum_position: The position (series index) of the extremumpeak_<nr>_width: The width of the peak (in units of the indices attributes. For date time indices attributes this is milliseconds)peak_<nr>_high_low_amplitude: The amplitude between the highest and lowest value in the peak (and one slice left and right of the peak)peak_<nr>_type: The type (minimum / maximum) of the peak
If the parameter add index features is selected, additional features are extracted:
peak_<nr>_left_position: The position (series index) of the left side of the peakpeak_<nr>_center: The center position (series index) of the peakpeak_<nr>_right_position: The position (series index) of the right side of the peak
If an indices attribute is provided, additional features are extracted:
peak_<nr>_extremum_index_value: The value of the indices attribute at the extremumpeak_<nr>_left_position: The value of the indices attribute at the left side of the peak (only extracted if parameter add index features is selected)peak_<nr>_center: The value of the indices attribute at the center of the peak (only extracted if parameter add index features is selected)peak_<nr>_right_position: The value of the indices attribute at the right side of the peak (only extracted if parameter add index features is selected)
The extracted features for every detected peak are provided as an ExampleSet at the features output port of the operator. In addition the original time series and the peaked time series (all non-peak values are set to null) are provided at the peaked output port of the operator.
Depending on the parameter add time series name the ExampleSet will have one example with attributes for all combination of time series and features, or n examples, one example per time series. In combination with the Process Windows operator, this operator can be used to calculate features of windows of time series as a preparation for a general machine learning problem.
This operator works only on numerical time series.
Input
example set
The ExampleSet which contains the time series data as attributes.
Output
features
The ExampleSet which contains the calculated peak features as attributes. Depending on the parameter add time series name the ExampleSet will have one example with attributes for all combination of time series and features, or n examples, one example per time series.
original
The ExampleSet that was given as input is passed through without changes.
peaked
This data set contains the original time series and the peaked time series. The peaked time series contains the values of the original time series for the areas where a peak was detected and missing values for non-peak areas. It can be used to compare the peaked time series and the original time series data.
Parameters
Attribute filter type
This parameter allows you to select the filter for the time series attributes selection filter; the method you want to select the attributes which holds the time series values. Only numeric attributes can be selected as time series attributes. The different filter types are:
- all: This option selects all attributes of the ExampleSet to be time series attributes. This is the default option.
- single: This option allows the selection of a single time series attribute. The required attribute is selected by the attribute parameter.
- subset: This option allows the selection of multiple time series attributes through a list (see parameter attributes). If the meta data of the ExampleSet is known all attributes are present in the list and the required ones can easily be selected.
- regular_expression: This option allows you to specify a regular expression for the time series attribute selection. The regular expression filter is configured by the parameters regular expression, use except expression and except expression.
- value_type: This option allows selection of all the attributes of a particular type to be time series attributes. It should be noted that types are hierarchical. For example real and integer types both belong to the numeric type. The value type filter is configured by the parameters value type, use value type exception, except value type.
- block_type: This option allows the selection of all the attributes of a particular block type to be time series attributes. It should be noted that block types may be hierarchical. For example value_series_start and value_series_end block types both belong to the value_series block type. The block type filter is configured by the parameters block type, use block type exception, except block type.
- no_missing_values: This option selects all attributes of the ExampleSet as time series attributes which do not contain a missing value in any example. Attributes that have even a single missing value are not selected.
- numeric_value_filter: All numeric attributes whose examples all match a given numeric condition are selected as time series attributes. The condition is specified by the numeric condition parameter.
Attribute
The required attribute can be selected from this option. The attribute name can be selected from the drop down box of the parameter if the meta data is known.
Attributes
The required attributes can be selected from this option. This opens a new window with two lists. All attributes are present in the left list. They can be shifted to the right list, which is the list of selected time series attributes.