You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 22 Next »

Statistical functions

Box plot

The box plot function will calculate a set of statistical values for each of the selected timeseries and displays these values in the chart as a Box and Whisker Plot.

In the table the following list of statical values is shown for each of the timeseries:

Statistical variables
Minimum outlier: minimum outlier value for selection.
Minimum regular value: minimum value for selection that is not defined as an outlier. Also known as the Whiskers
25%: 25th percentile
Median: 50th percentile
Mean: average value for selection
75%: 75th percentileMaximum regular value: maximum value for selection that is not defined as an outlier. Also known as the WhiskersMaximum regular value: maximum value for selection that is not defined as an outlier. Also known as the Whiskers
Maximum outlier: maximum outlier value for selection.

The chart shows the same values as the table, hover the chart also can include some extra values that are not shown in the table. These are the outliers and the far-out indicators.

Far-out indicator: indicates that there are values that lie outside the plotted range of the axis.
Single outlier: a single outlier value.
Clustered outlier: multiple outliers that are located too close together to be plotted separately.

Outliers: cases where the values are between 1.5 and 3 box-lengths from the 75th percentile or 25th percentile.
Farout values: cases where the values are more than 3 box-lengths from the 75th percentile or 25th percentile.

Calendar aggregation

Central moving average

Display lows below value

A scatterplot is made where the x-axis shows the duration of a 'low' (=values within this low-area are all beneath the given reference level), the y-axis shows the normalized difference between the parameter value and the reference level. The reference level can be altered by entering a value into the input field associated with this statistical function. After clicking 'Apply' the result time series array is returned.
If no reference level is entered, then the 'low' areas are determined according to the maximum available value of the input time series array.

Display peaks above value

A scatterplot is made where the x-axis shows the duration of a 'peak' (=values within this peak-area are all above the given reference level), the y-axis shows the normalized difference between the parameter value and the reference level. The reference level can be altered by entering a value into the input field associated with this statistical function. After clicking 'Apply' the result time series array is returned.
If no reference level is entered, then the 'peak' areas are determined according to the minimum available value of the input time series array.

Duration curve

A convenient way to show the variation of hydrological quantities through time may be done by means of duration curves. For the selected time period the values of the selected quantity are sorted descending (durationExceedence) or ascending (durationNonExceedence). When the duration curve is plotted in the timeseries display, the x-axis will show the entire length in time of the selected view period. Percentages are shown as duration with respect to the entire chosen view period.

In the configuration of this statistical function there is the option to ommit missing values which may occur in the selected view period. If this option is set to true, all entries with missing values will be disregarded before the duration curve is calculated. If this option is not defined (default) or is set to false, missing values will be added to the the end of the array. In this case the plotted duration curve will never reach the 100%.

An example of the durationExceedence plot is given here:

An example of the durationNonExceedence plot is given here:

When selecting more than one location it could occur that the view periods of these selected timeseries do not cover the same period in time. In this case it is difficult to make a correct comparison of the calculated duration curves because they are analysed on different periods in time. A warning message will be given in order to ensure that the user is aware of this. The pop-up message will be shown each time the user zooms in or out until all view periods are an exact match.

Frequency distribution

The frequency distribution function divides the distance between the min and max value of the timeseries by the number of samples to create a classification. It then evaluate each value in the timeseries and assigns it to a class. The result is a frequency distribution diagram.

Gaussian curve

Scatter plot

  • No labels