Randomization


Randomization is the process of making something random; in various contexts this involves, for example:
Randomization is not haphazard. Instead, a random process is a sequence of random variables describing a process whose outcomes do not follow a deterministic pattern, but follow an evolution described by probability distributions. For example, a random sample of individuals from a population refers to a sample where every individual has a known probability of being sampled. This would be contrasted with nonprobability sampling where arbitrary individuals are selected.

Applications

Randomization is used in statistics and in gambling.

Statistics

Randomization is a core principle in statistical theory, whose importance was emphasized by Charles S. Peirce in "Illustrations of the Logic of Science" and "A Theory of Probable Inference". Randomization-based inference is especially important in experimental design and in survey sampling. The first use of "randomization" listed in the Oxford English Dictionary is its use by Ronald Fisher in 1926.

Randomized experiments

In the statistical theory of design of experiments, randomization involves randomly allocating the experimental units across the treatment groups. For example, if an experiment compares a new drug against a standard drug, then the patients should be allocated to either the new drug or to the standard drug control using randomization. Randomization reduces confounding by equalising so-called factors that have not been accounted for in the experimental design.

Survey sampling

uses randomization, following the criticisms of previous "representative methods" by Jerzy Neyman in his 1922 report to the International Statistical Institute.

Resampling

Some important methods of statistical inference use resampling from the observed data. Multiple alternative versions of the data-set that "might have been observed" are created by randomization of the original data-set, the only one observed. The variation of statistics calculated for these alternative data-sets is a guide to the uncertainty of statistics estimated from the original data.

Gambling

Randomization is used extensively in the field of gambling. Because poor randomization may allow a skilled gambler to take advantage, much research has been devoted to effective randomization. A classic example of randomizing is shuffling playing cards.

Techniques

Although historically "manual" randomization techniques were common, nowadays automated techniques are mostly used. As both selecting random samples and random permutations can be reduced to simply selecting random numbers, random number generation methods are now most commonly used, both hardware random number generators and pseudo-random number generators.

Optimization

Randomization is used in optimization to alleviate the computational burden associated to robust control techniques: a sample of values of the uncertainty parameters is randomly drawn and robustness is enforced for these values only. This approach has gained popularity by the introduction of rigorous theories that permit one to have control on the probabilistic level of robustness, see scenario optimization.
Non-algorithmic randomization methods include: