Variogram

In spatial statistics the theoretical variogram is a function describing the degree of spatial dependence of a spatial random field or stochastic process.
In the case of a concrete example from the field of gold mining, a variogram will give a measure of how much two samples taken from the mining area will vary in gold percentage depending on the distance between those samples. Samples taken far apart will vary more than samples taken close to each other.

Definition

Semivariogram

The semivariogram was first defined by Matheron as half the average squared difference between points separated at distance. Formally
where is a point in the geometric field, and is the value at that point. For example, suppose we are interested in iron content in soil samples in some region or field. would be the content of iron at some location, where has coordinates of latitude, longitude, and depth. The triple integral is over 3 dimensions. is the separation distance of interest. To obtain the semivariogram for a given, all pairs of points at that exact distance would be sampled. In practice it is impossible to sample everywhere, so the empirical variogram is used instead.

Variogram

The variogram is defined as the variance of the difference between field values at two locations across realizations of the field :
or in other words is twice the semivariogram. If the spatial random field has constant mean, this is equivalent to the expectation for the squared increment of the values between locations and :
In the case of a stationary process, the variogram and semivariogram can be represented as a function of the difference between locations only, by the following relation :
If the process is furthermore isotropic, then the variogram and semivariogram can be represented by a function of the distance only :
The indexes or are typically not written. The terms are used for all three forms of the function. Moreover, the term "variogram" is sometimes used to denote the semivariogram, and the symbol is sometimes used for the variogram, which brings some confusion.

Properties

According to the theoretical variogram has the following properties:

The semivariogram is nonnegative, since it is the expectation of a square.
The semivariogram at distance 0 is always 0, since.
A function is a semivariogram if and only if it is a conditionally negative definite function, i.e. for all weights subject to and locations it holds:
which corresponds to the fact that the variance of is given by the negative of this double sum and must be nonnegative.
As a consequence the semivariogram might be non continuous only at the origin. The height of the jump at the origin is sometimes referred to as nugget or nugget effect.
If the covariance function of a stationary process exists it is related to variogram by
For a non-stationary process the square of the difference between expected values at both points must be added:
If a stationary random field has no spatial dependence, the semivariogram is the constant everywhere except at the origin, where it is zero.
is a symmetric function.
Consequently, is an even function.
If the random field is stationary and ergodic, the corresponds to the variance of the field. The limit of the semivariogram is also called its sill.
Empirical variogram and application

Generally an empirical variogram is needed, because sample information is not available for every location. The sample information for example could be concentration of iron in soil samples, or pixel intensity on a camera. Each piece of sample information has coordinates for a 2D sample space where and are geographical coordinates. In the case of the iron in soil, the sample space could be 3 dimensional. If there is temporal variability as well then could be a 4 dimensional vector. For the case where dimensions have different units then a scaling factor can be applied to each to obtain a modified Euclidean distance.
Sample observations are denoted. Samples may be taken at total different locations. This would provide as set of samples at locations. Generally plots show the semivariogram values as a function of sample point separation. In the case of empirical semivariogram, separation distance bins are used rather than exact distances, and usually isotropic conditions are assumed. Then, the empirical semivariogram can be calculated for each bin:
Or in other words, each pair of points separated by are found. These form the set of points. The number of these points in this bin is. Then for each pair of points, the square of the difference in the observation is found. These squared differences are added together and normalized by the natural number. By definition the result is divided by 2 for the semivariogram at this separation.
For computational speed, only the unique pairs of points are needed. For example, for 2 observations pairs taken from locations with separation only need to be considered, as the pairs do not provide any additional information.
The empirical variogram is used in geostatistics as a first estimate of the variogram needed for spatial interpolation by kriging.
According to, for observations from a stationary random field, the empirical variogram with lag tolerance 0 is an unbiased estimator of the theoretical semivariogram, due to:

Variogram parameters

The following parameters are often used to describe variograms:

nugget : The height of the jump of the semivariogram at the discontinuity at the origin.
sill : Limit of the variogram tending to infinity lag distances.
range : The distance in which the difference of the variogram from the sill becomes negligible. In models with a fixed sill, it is the distance at which this is first reached; for models with an asymptotic sill, it is conventionally taken to be the distance when the semivariance first reaches 95% of the sill.
Variogram models

The empirical variogram cannot be computed at every lag distance and due to variation in the estimation it is not ensured that it is a valid variogram, as defined above. However some Geostatistical methods such as kriging need valid semivariograms. In applied geostatistics the empirical variograms are thus often approximated by model function ensuring validity. Some important models are :

The exponential variogram model
The spherical variogram model
The Gaussian variogram model

The parameter has different values in different references, due to the ambiguity in the definition of the range. E.g. is the value used in. The function is 1 if and 0 otherwise.

Discussion

Three functions are used in geostatistics for describing the spatial or the temporal correlation of observations: these are the correlogram, the covariance and the semivariogram. The last is also more simply called variogram. The sampling variogram, unlike the semivariogram and the variogram, shows where a significant degree of spatial dependence in the sample space or sampling unit dissipates into randomness when the variance terms of a temporally or in-situ ordered set are plotted against the variance of the set and the lower limits of its 99% and 95% confidence ranges.
The variogram is the key function in geostatistics as it will be used to fit a model of the temporal/spatial correlation of the observed phenomenon. One is thus making a distinction between the experimental variogram that is a visualisation of a possible spatial/temporal correlation and the variogram model that is further used to define the weights of the kriging function. Note that the experimental variogram is an empirical estimate of the covariance of a Gaussian process. As such, it may not be positive definite and hence not directly usable in kriging, without constraints or further processing. This explains why only a limited number of variogram models are used: most commonly, the linear, the spherical, the Gaussian and the exponential models.

Related concepts

The squared term in the variogram, for instance, can be replaced with different powers: A madogram is defined with the absolute difference,, and a rodogram is defined with the square root of the absolute difference,. Estimators based on these lower powers are said to be more resistant to outliers. They can be generalized as a "variogram of order α",
in which a variogram is of order 2, a madogram is a variogram of order 1, and a rodogram is a variogram of order 0.5.
When a variogram is used to describe the correlation of different variables it is called cross-variogram. Cross-variograms are used in co-kriging.
Should the variable be binary or represent classes of values, one is then talking about indicator variograms. Indicator variogram is used in indicator kriging.

Example studies

Empirical variograms for the spatiotemporal variability of column-averaged carbon dioxide was used to determine coincidence criteria for satellite and ground-based measurements.
Empirical variograms were calculated for the density of a heterogeneous material.
Empirical variograms are calculated from observations of strong ground motion from earthquakes. These models are used for seismic risk and loss assessments of spatially-distributed infrastructure.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...

Variogram

Definition

Semivariogram

Variogram

Properties

Empirical variogram and application

Variogram parameters

Variogram models

Discussion

Related concepts

Example studies