Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Spatial correlation_tests_

Anchor
IDH_spatial_correlation
IDH_spatial_correlation

Anchor
IDH_correlation
IDH_correlation

General

The function described here is applicable to quality and quantity parameters with a spatial character, like rainfall, temperature, evaporation, etc., but sampled at a number of stations (point measurements).
The two correlation functions, the correlation and the semivariance, give an indication of similarities in mechanisms causing a phenomenon to exhibit. Both functions use the co-ordinates of the selected series in their calculation together with the measured values.

The correlation and semivariance functions both make use the covariance. The covariance of two random variables is defined as the expectation of the product between the respective deviations from their mean. The sample estimator C(x~1~ ,x~2~ ) for the population covariance is computed from:

Series codes

A series can be selected by pointing with your mouse on the series and pressing your left mouse button, a check mark will be shown in the check box of the series ID. To select, or de-select, all the series in one go, click your right mouse button when your mouse pointer is located on top of the series list box and select <Select All>.
Remember that a minimum of 5 series must be selected.

Correlation

Regional patterns from pair-wise combined series can be studied of different time steps by means of the (lag-zero-) cross-correlation coefficient. The cross-correlation is described as the normalised covariance between two stochastic variables. The normalised covariance (=correlation) of two series is derived from dividing the covariance by the product of the standard deviations of each series. The covariance between two stochastic variables can be calculated with:

where
m = mean of Z(x)
n = number of measurements
The sample estimates S(x 1 ) and S(x 1 ) for the population standard deviations are computed from:

Thus the equation for computing the sample correlation coefficient is:

Note:
The correlation coefficient and covariance may be affected by a few eccentric data pairs. A good alignment of a few extreme pairs can dramatically improve an otherwise poor correlation coefficient. Conversely, an otherwise good correlation could be ruined by the poor alignment of a few extreme pairs.

Cross-correlation functions

Functions that describe the relation between inter-station correlation and inter-station distance, the so-called cross-correlation functions, can be determined empirically from a graph or calculated by least squares fitting. The objective is to find the relationship between inter-station correlation and inter-station distance.
There are many types of functions, i.e. higher degree functions, power functions, exponential functions or Bessel functions. HYMOS uses the power function for calculation of the correlation distance:
r(h) = r(0) * Exp(-h/h0) + a
where:
h = distance between two stations
h0 = correlation distance
a = threshold in function
r(0) = correlation at distance zero
The following information must be entered when using this function:

Number of time steps

This option gives the freedom to use an interval of analysis. If, for example, the series interval is month and summers are to be considered (from April to September). Then in the start date April is indicated and the number of time steps is 6. For each year only the monthly values from April to September will be dealt with in the calculation.

Lower Boundary / Upper Boundary

By entering values for the boundaries, only the values between these values will be taken in the calculation.

Correlation distance function

The values for r(0) and a must be entered, these values are input values for the correlation distance function. The maximum inter-station distance is the maximum distance between stations for which the correlation will be calculated. This distance is also the maximum distance in the graph.

Semivariance

Wiki Markup
An estimation of the semivariance can be made from the available n measurements _Zi_ with locations _X_ _~i~_ and _X_ _~j~_ . Each pair of measurements can be associated with a lag, or distance vector _h=X_ _~i~_ _-X_ _~j~_ . The pairs are then grouped in a limited number of lag classes in order to have a significant number of pairs in each class. The mean value of the semivariance of all pairs in a class with a mean distance _h_ _~m~_ , may be estimated of half the spatial variance _var\[Z\(x)-Z(x+h)\]_ divided by the possible pairs of measurements in that distance class. The traditional non-parametric estimator is:
!image009.gif!  
where:
g(h)  = the estimated semivariance for the distance class _h_ ,
Z(x~i~ ),Z(x~i~ +h) = the measured values within a distance class _h_ . 
n(h) = the number of pairs in the distance class _h_ .
\\
The resulting plot is called an experimental semivariogram. The semivariogram shows the expected difference in value of two points against its distance. 
\\
Any found experimental semivariogram is only a reflection of the true semivariogram. Kitanidis \[1993\] gave some useful guidelines to obtain a reasonable semivariogram:

...