Back

ⓘ Exponentially modified Gaussian distribution. In probability theory, an exponentially modified Gaussian distribution describes the sum of independent normal and ..




Exponentially modified Gaussian distribution
                                     

ⓘ Exponentially modified Gaussian distribution

In probability theory, an exponentially modified Gaussian distribution describes the sum of independent normal and exponential random variables. An exGaussian random variable Z may be expressed as Z = X + Y, where X and Y are independent, X is Gaussian with mean μ and variance σ 2, and Y is exponential of rate λ. It has a characteristic positive skew from the exponential component.

It may also be regarded as a weighted function of a shifted exponential with the weight being a function of the normal distribution.

                                     

1. Definition

The probability density function pdf of the exponentially modified normal distribution is

f x ; μ, σ, λ = λ 2 e λ 2 μ + λ σ 2 − 2 x erfc ⁡ μ + λ σ 2 − x 2 σ, {\displaystyle fx;\mu,\sigma,\lambda={\frac {\lambda }{2}}e^

This density function is derived via convolution of the normal and exponential probability density functions.

                                     

2. Alternative forms for computation

An alternative but equivalent form of the EMG distribution is used for description of peak shape in chromatography. This is as follows

f x ; h, μ, σ, τ = h σ τ π 2 exp ⁡ 1 2 σ τ 2 − x − μ τ) erfc ⁡ 1 2 σ τ − x − μ σ), {\displaystyle fx;h,\mu,\sigma,\tau={\frac {h\sigma }{\tau }}{\sqrt {\frac {\pi }{2}}}\exp \left{\frac {1}{2}}\left{\frac {\sigma }{\tau }}\right^{2}-{\frac {x-\mu }{\tau }}\right)\operatorname {erfc} \left{\frac {1}{\sqrt {2}}}\ \left{\frac {\sigma }{\tau }}-{\frac {x-\mu }{\sigma }}\right\right),} 1

where

h {\displaystyle h} is the amplitude of Gaussian, τ = 1 λ {\displaystyle \tau ={\frac {1}{\lambda }}} is exponent relaxation time.

This function cannot be calculated for some values of parameters for example, τ=0 because of arithmetic overflow. Alternative, but equivalent form of writing the function was proposed by Delley:

f x ; h, μ, σ, τ = h exp ⁡ − 1 2 x − μ σ 2) σ τ π 2 erfcx ⁡ 1 2 σ τ − x − μ σ), {\displaystyle fx;h,\mu,\sigma,\tau=h\exp \left-{\frac {1}{2}}\left{\frac {x-\mu }{\sigma }}\right^{2}\right){\frac {\sigma }{\tau }}{\sqrt {\frac {\pi }{2}}}\operatorname {erfcx} \left{\frac {1}{\sqrt {2}}}\ \left{\frac {\sigma }{\tau }}-{\frac {x-\mu }{\sigma }}\right\right),} 2

where erfcx ⁡ t = exp ⁡ t 2 ⋅ erfc ⁡ t {\displaystyle \operatorname {erfcx} t=\exp t^{2}\cdot \operatorname {erfc} t} is a scaled complementary error function

In the case of this formula arithmetic overflow is also possible, region of overflow is different from the first formula, except for very small τ.

For small τ it is reasonable to use asymptotic form of the second formula:

f x ; h, μ, σ, τ = h exp ⁡ − 1 2 x − μ σ 2) 1 + x − μ τ σ 2, {\displaystyle fx;h,\mu,\sigma,\tau={\frac {h\exp \left-{\frac {1}{2}}\left{\frac {x-\mu }{\sigma }}\right^{2}\right)}{1+{\frac {\leftx-\mu \right\tau }{\sigma ^{2}}}}},} 3

Decision on formula usage is made on the basis of the parameter z = 1 2 σ τ − x − μ σ {\displaystyle z={\frac {1}{\sqrt {2}}}\left{\frac {\sigma }{\tau }}-{\frac {x-\mu }{\sigma }}\right}:

for z < 0 computation should be made according to the first formula, for 0 ≤ z ≤ 6.71 10 7 in the case of double-precision floating-point format according to the second formula, and for z > 6.71 10 7 according to the third formula.

Mode position of apex, most probable value is calculated using derivative of formula 2; inverse of scaled complementary error function erfcxinv is used for calculation. The apex is always located on the original unmodified Gaussian.

                                     

3. Parameter estimation

There are three parameters: the mean of the normal distribution μ, the standard deviation of the normal distribution σ and the exponential decay parameter τ = 1 / λ. The shape K = τ / σ is also sometimes used to characterise the distribution. Depending on the values of the parameters, the distribution may vary in shape from almost normal to almost exponential.

The parameters of the distribution can be estimated from the sample data with the method of moments as follows:

m = μ + τ, {\displaystyle m=\mu +\tau,} s 2 = σ 2 + τ 2, {\displaystyle s^{2}=\sigma ^{2}+\tau ^{2},} γ 1 = 2 τ 3 σ 2 + τ 2 3 / 2, {\displaystyle \gamma _{1}={\frac {2\tau ^{3}}{\sigma ^{2}+\tau ^{2}^{3/2}}},}

where m is the sample mean, is the sample standard deviation, and γ 1 is the skewness.

Solving these for the parameters gives:

μ ^ = m − s γ 1 2 1 / 3, {\displaystyle {\hat {\mu }}=m-s\left{\frac {\gamma _{1}}{2}}\right^{1/3},} σ 2 ^ = s 2,} τ ^ = s γ 1 2 1 / 3. {\displaystyle {\hat {\tau }}=s\left{\frac {\gamma _{1}}{2}}\right^{1/3}.}


                                     

3.1. Parameter estimation Recommendations

Ratcliff has suggested that there be at least 100 data points in the sample before the parameter estimates should be regarded as reliable. Vincent averaging may be used with smaller samples, as this procedure only modestly distorts the shape of the distribution. These point estimates may be used as initial values that can be refined with more powerful methods, including maximum likelihood.

                                     

3.2. Parameter estimation Confidence intervals

There are currently no published tables available for significance testing with this distribution. The distribution can be simulated by forming the sum of two random variables one drawn from a normal distribution and the other from an exponential.

                                     

3.3. Parameter estimation Skew

The value of the nonparametric skew

mean − median standard deviation {\displaystyle {\frac

of this distribution lies between 0 and 0.31. The lower limit is approached when the normal component dominates, and the upper when the exponential component dominates.

                                     

4. Usage

The distribution is used as a theoretical model for the shape of chromatographic peaks. It has been proposed as a statistical model of intermitotic time in dividing cells. It is also used in modelling cluster ion beams. It is commonly used in psychology and other brain sciences in the study of response times. In a slight variant where the mean of the Normal component is set to zero, it is also used in Stochastic Frontier Analysis, as one of the distributional specifications for the composed error term that models inefficiency.

                                     

5. Related distributions

This family of distributions is a special or limiting case of the normal-exponential-gamma distribution. This can also be seen as a three-parameter generalization of a normal distribution to add skew; another distribution like that is the skew normal distribution, which has thinner tails. The distribution is a compound probability distribution in which the mean of a normal distribution varies randomly as a shifted exponential distribution.

A Gaussian minus exponential distribution has been suggested for modelling option prices. If such a random variable Y has parameters μ, σ, λ, then its negative -Y has an exponentially modified Gaussian distribution with parameters -μ, σ, λ, and thus Y has mean μ − 1 λ {\displaystyle \mu -{\tfrac {1}{\lambda }}} and variance σ 2 + 1 λ 2 {\displaystyle \sigma ^{2}+{\tfrac {1}{\lambda ^{2}}}}.

                                     
  • hypergeometric - Gaussian HyGG modes can be listed as the modified Bessel - Gaussian modes, the modified exponential Gaussian modes, and the modified Laguerre Gaussian
  • statistics, the multivariate normal distribution multivariate Gaussian distribution or joint normal distribution is a generalization of the one - dimensional
  • then the sum converges to a stable distribution with stability parameter equal to 2, i.e. a Gaussian distribution There are other possibilities as well
  • the gamma distribution is a two - parameter family of continuous probability distributions The exponential distribution Erlang distribution and chi - squared
  • mean. Two other distributions often used in test - statistics are also ratio distributions the t - distribution arises from a Gaussian random variable divided
  • where I0 z is the modified Bessel function of the first kind with order zero. In the context of Rician fading, the distribution is often also rewritten
  • constructed from the Gaussian distributions For a Gaussian process, all sets of values have a multidimensional Gaussian distribution Analogiusly, X t
  • parameter of the distribution Its complementary cumulative distribution function is a stretched exponential function. The Weibull distribution is related to
  • normal distribution Exponential random numbers  redirect to subsection of Exponential distribution Exponential smoothing Exponentially modified Gaussian distribution
  • Laplace distributions are extensions of the Laplace distribution and the asymmetric Laplace distribution to multiple variables. The marginal distributions of
  • been modified slightly: J: modified peak on right L: unimodal peak on left F: no peak flat Under this classification bimodal distributions are

Users also searched:

...
...
...