Laplace distribution

From Infogalactic: the planetary knowledge core
Jump to: navigation, search
Laplace
Probability density function
Probability density plots of Laplace distributions
Cumulative distribution function
Cumulative distribution plots of Laplace distributions
Parameters \mu location (real)
b > 0 scale (real)
Support x \in (-\infty; +\infty)\,
PDF \frac{1}{2\,b} \exp \left(-\frac{|x-\mu|}b \right) \,
CDF \begin{cases}
      \frac12 \exp \left( \frac{x-\mu}{b} \right) & \mbox{if }x < \mu
             \\[8pt]
          1-\frac12 \exp \left( -\frac{x-\mu}{b} \right) & \mbox{if }x \geq \mu
       \end{cases}
Mean \mu
Median \mu
Mode \mu
Variance 2b^2
Skewness 0
Ex. kurtosis 3
Entropy 1+\log(2b)
MGF \frac{\exp(\mu\,t)}{1-b^2\,t^2}\,\!\text{ for }|t|<1/b\,
CF \frac{\exp(\mu\,i\,t)}{1+b^2\,t^2}\,\!

In probability theory and statistics, the Laplace distribution is a continuous probability distribution named after Pierre-Simon Laplace. It is also sometimes called the double exponential distribution, because it can be thought of as two exponential distributions (with an additional location parameter) spliced together back-to-back, although the term 'double exponential distribution' is also sometimes used to refer to the Gumbel distribution. The difference between two independent identically distributed exponential random variables is governed by a Laplace distribution, as is a Brownian motion evaluated at an exponentially distributed random time. Increments of Laplace motion or a variance gamma process evaluated over the time scale also have a Laplace distribution.

Characterization

Probability density function

A random variable has a Laplace(μ, b) distribution if its probability density function is

f(x\mid\mu,b) = \frac{1}{2b} \exp \left( -\frac{|x-\mu|}{b} \right) \,\!
    = \frac{1}{2b}
    \left\{\begin{matrix}
      \exp \left( -\frac{\mu-x}{b} \right) & \text{if }x < \mu
      \\[8pt]
      \exp \left( -\frac{x-\mu}{b} \right) & \text{if }x \geq \mu
    \end{matrix}\right.

Here, μ is a location parameter and b > 0, which is sometimes referred to as the diversity, is a scale parameter. If μ = 0 and b = 1, the positive half-line is exactly an exponential distribution scaled by 1/2.

The probability density function of the Laplace distribution is also reminiscent of the normal distribution; however, whereas the normal distribution is expressed in terms of the squared difference from the mean μ, the Laplace density is expressed in terms of the absolute difference from the mean. Consequently the Laplace distribution has fatter tails than the normal distribution.

Differential equation

The pdf of the Laplace distribution is a solution of the following differential equation:

\begin{cases}
  \left\{\begin{array}{l}
  b f'(x)+f(x)=0 \\[8pt]
  f(0)=\frac{e^{\frac{\mu}{b}}}{2b}\end{array}\right\} & \text{if } x \geq \mu \\[8pt]
  \left\{\begin{array}{l}
  b f'(x)-f(x)=0 \\[8pt]
  f(0)=\frac{e^{-\frac{\mu}{b}}}{2b}\end{array}\right\} & \text{if } x < \mu
  \end{cases}

Cumulative distribution function

The Laplace distribution is easy to integrate (if one distinguishes two symmetric cases) due to the use of the absolute value function. Its cumulative distribution function is as follows:

\begin{align}
F(x) &= \int_{-\infty}^x \!\!f(u)\,\mathrm{d}u  = \begin{cases}
             \frac12 \exp \left( \frac{x-\mu}{b} \right) & \mbox{if }x < \mu \\
             1-\frac12 \exp \left( -\frac{x-\mu}{b} \right) & \mbox{if }x \geq \mu
            \end{cases} \\
&=\tfrac{1}{2} + \tfrac{1}{2} \sgn(x-\mu) \left(1-\exp \left(-\frac{|x-\mu|}{b} \right ) \right ).
\end{align}

The inverse cumulative distribution function is given by

F^{-1}(p) = \mu - b\,\sgn(p-0.5)\,\ln(1 - 2|p-0.5|).

Generating random variables according to the Laplace distribution

Given a random variable U drawn from the uniform distribution in the interval (−1/2, 1/2], the random variable

X=\mu - b\,\sgn(U)\,\ln(1 - 2|U|)

has a Laplace distribution with parameters μ and b. This follows from the inverse cumulative distribution function given above.

A Laplace(0, b) variate can also be generated as the difference of two i.i.d. Exponential(1/b) random variables. Equivalently, a Laplace(0, 1) random variable can be generated as the logarithm of the ratio of two iid uniform random variables.

Parameter estimation

Given N independent and identically distributed samples x1, x2, ..., xN, the maximum likelihood estimator \hat{\mu} of μ is the sample median,[1] and the maximum likelihood estimator of b is

\hat{b} = \frac{1}{N} \sum_{i = 1}^{N} |x_i - \hat{\mu}|

(revealing a link between the Laplace distribution and least absolute deviations).

Moments

\mu_r' = \bigg({\frac{1}{2}}\bigg) \sum_{k=0}^r \bigg[{\frac{r!}{k! (r-k)!}} b^k \mu^{(r-k)} k! \{1 + (-1)^k\}\bigg]=\frac{m^{n+1}}{2 b}\left(e^{m /b} E_{-n}(m/b)-e^{-m/b} E_{-n}(-m/b)\right)

where E_n( ) is the generalized exponential integral function E_n(x)=x^{n-1}\Gamma(1-n,x).

Related distributions

  • If X ~ Laplace(μ, b) then kX + c ~ Laplace(kμ + c, kb).
  • If X ~ Laplace(0, b) then |X| ~ Exponential(b−1).
  • If X, Y ~ Exponential(λ) then XY ~ Laplace(0, λ−1) .
  • If X ~ Laplace(μ, b) then |X − μ| ~ Exponential(b−1).
  • If X ~ Laplace(μ, b) then X ~ EPD(μ, b, 0).
  • If X1, ... X4 ~ N(0, 1) then X1X2X3X4 ~ Laplace(0, 1).
  • If Xi ~ Laplace(μ, b) then \frac{\displaystyle 2 \sum_{i=1}^n |X_i-\mu|}{b} \sim \chi^2(2n) \, (Chi-squared distribution)
  • If X, Y ~ Laplace(μ, b) then  \tfrac{|X-\mu|}{|Y-\mu|} \sim \operatorname{F}(2,2) (F-distribution)
  • If X, Y ~ U(0, 1) then log(X/Y) ~ Laplace(0, 1).
  • If X ~ Exponential(λ) and Y ~ Bernoulli(0.5) independent of X, then X(2Y − 1) ~ Laplace(0, λ−1).
  • If X ~ Exponential(λ) and Y ~ Exponential(ν) independent of X, then λX − νY ~ Laplace(0, 1) .
  • If X has a Rademacher distribution and Y ~ Exp(λ) then XY ~ Laplace(0, 1/λ)
  • If V ~ Exponential(1) and Z ~ N(0, 1) independent of V, then X = \mu + b \sqrt{2 V}Z \sim \mathrm{Laplace}(\mu,b).
  • If X ~ GeometricStable(2, 0, λ, 0) then X ~ Laplace(0, λ).
  • The Laplace distribution is a limiting case of the hyperbolic distribution.
  • If X|Y ~ Normal(μ, σ = Y) with Y ~ Rayleigh(b) then X ~ Laplace(μ, b).

Relation to the exponential distribution

A Laplace random variable can be represented as the difference of two iid exponential random variables.[2] One way to show this is by using the characteristic function approach. For any set of independent continuous random variables, for any linear combination of those variables, its characteristic function (which uniquely determines the distribution) can be acquired by multiplying the corresponding characteristic functions.

Consider two i.i.d random variables X, Y ~ Exponential(λ). The characteristic functions for X, −Y are

\frac{\lambda }{-i t+\lambda }, \quad \frac{\lambda }{i t+\lambda }

respectively. On multiplying these characteristic functions (equivalent to the characteristic function of the sum of therandom variables X + (−Y)), the result is

\frac{\lambda ^2}{(-i t+\lambda ) (i t+\lambda )} = \frac{\lambda ^2}{t^2+\lambda ^2}.

This is the same as the characteristic function for Z ~ Laplace(0,1/λ), which is

\frac{1}{1+\frac{t^2}{\lambda ^2}}.

Sargan distributions

Sargan distributions are a system of distributions of which the Laplace distribution is a core member. A pth order Sargan distribution has density[3][4]

f_p(x)=\tfrac{1}{2} \exp(-\alpha |x|)  \frac{\displaystyle 1+\sum_{j=1}^p \beta_j \alpha^j |x|^j}{\displaystyle 1+\sum_{j=1}^p j!\beta_j},

for parameters α ≥ 0, βj ≥ 0. The Laplace distribution results for p = 0.

Applications

The Laplacian distribution has been used in speech recognition to model priors on DFT coefficients [5] and in JPEG image compression to model AC coefficients [6] generated by a DCT.

The addition of noise drawn from a Laplacian distribution, with scaling parameter appropriate to a function's sensitivity, to the output of a statistical database query is the most common means to provide differential privacy in statistical databases.

In regression analysis, the least absolute deviations estimate arises as the maximum likelihood estimate if the errors have a Laplace distribution.

History

This distribution is often referred to as Laplace's first law of errors. He published it in 1774 when he noted that the frequency of an error could be expressed as an exponential function of its magnitude once its sign was disregarded.[7][8]

Keynes published a paper in 1911 based on his earlier thesis wherein he showed that the Laplace distribution minimised the absolute deviation from the median.[9]

See also

References

  1. Lua error in package.lua at line 80: module 'strict' not found.
  2. Lua error in package.lua at line 80: module 'strict' not found.
  3. Everitt, B.S. (2002) The Cambridge Dictionary of Statistics, CUP. ISBN 0-521-81099-X
  4. Johnson, N.L., Kotz S., Balakrishnan, N. (1994) Continuous Univariate Distributions, Wiley. ISBN 0-471-58495-9. p. 60
  5. Lua error in package.lua at line 80: module 'strict' not found.
  6. Lua error in package.lua at line 80: module 'strict' not found.
  7. Laplace, P-S. (1774). Mémoire sur la probabilité des causes par les évènements. Mémoires de l’Academie Royale des Sciences Presentés par Divers Savan, 6, 621–656
  8. Wilson EB (1923) First and second laws of error. JASA 18, 143
  9. Keynes JM (1911) The principal averages and the laws of error which lead to them. J Roy Stat Soc, 74, 322–331

External links

  • Lua error in package.lua at line 80: module 'strict' not found.