Kullback-Leibler divergence #
Definition of Kullback-Leibler divergence and basic facts
If X, Y
are two G
-valued random variables, the Kullback--Leibler divergence is defined as
KL(X ‖ Y) := ∑ₓ 𝐏(X = x) log (𝐏(X = x) / 𝐏(Y = x))
.
Note that this definition only makes sense when X
is absolutely continuous wrt to Y
,
i.e., ∀ x, 𝐏(Y = x) = 0 → 𝐏(X = x) = 0
. Otherwise, the divergence should be infinite, but since
we use real numbers for ease of computations, this is not a possible choice.
Pretty printer defined by notation3
command.
Equations
- One or more equations did not get rendered due to their size.
If X, Y
are two G
-valued random variables, the Kullback--Leibler divergence is defined as
KL(X ‖ Y) := ∑ₓ 𝐏(X = x) log (𝐏(X = x) / 𝐏(Y = x))
.
Note that this definition only makes sense when X
is absolutely continuous wrt to Y
,
i.e., ∀ x, 𝐏(Y = x) = 0 → 𝐏(X = x) = 0
. Otherwise, the divergence should be infinite, but since
we use real numbers for ease of computations, this is not a possible choice.
Equations
- One or more equations did not get rendered due to their size.
If X, Y
are two G
-valued random variables, the Kullback--Leibler divergence is defined as
KL(X ‖ Y) := ∑ₓ 𝐏(X = x) log (𝐏(X = x) / 𝐏(Y = x))
.
Note that this definition only makes sense when X
is absolutely continuous wrt to Y
,
i.e., ∀ x, 𝐏(Y = x) = 0 → 𝐏(X = x) = 0
. Otherwise, the divergence should be infinite, but since
we use real numbers for ease of computations, this is not a possible choice.
Equations
- One or more equations did not get rendered due to their size.
Pretty printer defined by notation3
command.
Equations
- One or more equations did not get rendered due to their size.
If X'
is a copy of X
, and Y'
is a copy of Y
, then KL(X' ‖ Y') = KL(X ‖ Y)
.
KL(X ‖ Y) ≥ 0
.
KL(X ‖ Y) = 0
if and only if Y
is a copy of X
.
If
If
The distribution of X + Z
is the convolution of the distributions of Z
and X
when these
random variables are independent.
Probably already available somewhere in some form, but I couldn't locate it.
The distribution of X + Z
is the convolution of the distributions of Z
and X
when these
random variables are independent.
Probably already available somewhere in some form, but I couldn't locate it.
If
If
If
Equations
- One or more equations did not get rendered due to their size.
Pretty printer defined by notation3
command.
Equations
- One or more equations did not get rendered due to their size.
Pretty printer defined by notation3
command.
Equations
- One or more equations did not get rendered due to their size.
If
Equations
- One or more equations did not get rendered due to their size.
If
KL(X|Z ‖ Y) ≥ 0
.