Ruzsa distance #
Here we define Ruzsa distance and establish its basic properties.
Main definitions #
rdist
: The Ruzsa distance $d[X ; Y]$ between two random variablescondRuzsaDist
: The conditional Ruzsa distance $d[X | Z ; Y | W]$ (or $d[X ; Y | W]$) between two random variables, conditioned on additional random variables
Main results #
rdist_triangle
: The Ruzsa triangle inequality for three random variables.kaimanovich_vershik
. $$H[X + Y + Z] - H[X + Y] \leq H[Y+ Z] - H[Y]$$ent_bsg
: If $Z=A+B$, then $$\sum_{z} P[Z=z] d[(A | Z = z) ; (B | Z = z)] \leq 3 I[A :B] + 2 H[Z] - H[A] - H[B]$$
Entropy depends continuously on the measure.
The Ruzsa distance rdist X Y
or d[X ; Y]
between two random variables is defined as
H[X'- Y'] - H[X']/2 - H[Y']/2
, where X', Y'
are independent copies of X, Y
.
Equations
- d[X ; μ # Y ; μ'] = H[fun (x : G × G) => x.1 - x.2 ; (MeasureTheory.Measure.map X μ).prod (MeasureTheory.Measure.map Y μ')] - H[X ; μ] / 2 - H[Y ; μ'] / 2
Instances For
The Ruzsa distance rdist X Y
or d[X ; Y]
between two random variables is defined as
H[X'- Y'] - H[X']/2 - H[Y']/2
, where X', Y'
are independent copies of X, Y
.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Pretty printer defined by notation3
command.
Equations
- One or more equations did not get rendered due to their size.
Instances For
The Ruzsa distance rdist X Y
or d[X ; Y]
between two random variables is defined as
H[X'- Y'] - H[X']/2 - H[Y']/2
, where X', Y'
are independent copies of X, Y
.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Pretty printer defined by notation3
command.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Explicit formula for the Ruzsa distance.
Ruzsa distance of random variables equals Ruzsa distance of the kernels.
Ruzsa distance depends continuously on the measure.
Ruzsa distance between random variables equals Ruzsa distance between their distributions.
Ruzsa distance depends continuously on the second measure.
If X', Y'
are copies of X, Y
respectively then d[X' ; Y'] = d[X ; Y]
.
If X, Y
are independent G
-random variables then d[X ; Y] = H[X - Y] - H[X]/2 - H[Y]/2
.
d[X ; Y] ≤ H[X]/2 + H[Y]/2
.
Applying an injective homomorphism does not affect Ruzsa distance.
d[X ; 0] = H[X] / 2
.
d[X ; Y] = d[Y ; X]
|H[X] - H[Y]| ≤ 2 d[X ; Y]
.
H[X - Y] - H[X] ≤ 2d[X ; Y]
.
H[X - Y] - H[Y] ≤ 2d[X ; Y]
.
d[X ; Y] ≥ 0
.
If $G$ is an additive group and $X$ is a $G$-valued random variable and $H\leq G$ is a finite subgroup then, with $\pi:G\to G/H$ the natural homomorphism we have (where $U_H$ is uniform on $H$) $\mathbb{H}(\pi(X))\leq 2d[X;U_H].$
Adding a constant to a random variable does not change the Rusza distance.
A variant of rdist_add_const
where one adds constants to both variables.
The improved entropic Ruzsa triangle inequality.
The entropic Ruzsa triangle inequality
The conditional Ruzsa distance d[X|Z ; Y|W]
.
Equations
- d[X | Z ; μ # Y | W ; μ'] = dk[ProbabilityTheory.condDistrib X Z μ ; MeasureTheory.Measure.map Z μ # ProbabilityTheory.condDistrib Y W μ' ; MeasureTheory.Measure.map W μ']
Instances For
Pretty printer defined by notation3
command.
Equations
- One or more equations did not get rendered due to their size.
Instances For
The conditional Ruzsa distance d[X|Z ; Y|W]
.
Equations
- One or more equations did not get rendered due to their size.
Instances For
The conditional Ruzsa distance d[X|Z ; Y|W]
.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Pretty printer defined by notation3
command.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Explicit formula for conditional Ruzsa distance $d[X|Z; Y|W]$ in a fintype.
Explicit formula for conditional Ruzsa distance $d[X|Z; Y|W]$.
$$ d[X|Z; Y|W] = d[Y|W; X|Z]$$
The conditional Ruzsa distance d[X ; Y|W]
.
Equations
- One or more equations did not get rendered due to their size.
Instances For
The conditional Ruzsa distance d[X ; Y|W]
.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Pretty printer defined by notation3
command.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Pretty printer defined by notation3
command.
Equations
- One or more equations did not get rendered due to their size.
Instances For
The conditional Ruzsa distance d[X ; Y|W]
.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Conditional Ruzsa distance equals Ruzsa distance of associated kernels.
Explicit formula for conditional Ruzsa distance d[X ; Y | W]
.
Alternate formula for conditional Ruzsa distance d[X ; Y | W]
when T is a Fintype.
Version of condRuzsaDist'_prod_eq_sum
when W
has finite codomain.
Explicit formula for conditional Ruzsa distance d[X ; Y | W]
, in integral form.
Conditioning by a constant does not affect Ruzsa distance.
If $(X,Z)$
and $(Y,W)$
are independent, then
d[X | Z ; Y | W] = H[X'- Y' | Z', W'] - H[X'|Z']/2 - H[Y'|W']/2
.
Formula for conditional Ruzsa distance for independent sets of variables.
The conditional Ruzsa distance is unchanged if the sets of random variables are replaced with copies.
The Kaimanovich-Vershik inequality. H[X + Y + Z] - H[X + Y] ≤ H[Y + Z] - H[Y]
.
A version of the Kaimanovich-Vershik inequality with some variables negated.
The entropic Balog-Szemerédi-Gowers inequality. Let $A, B$ be $G$-valued random variables on
$\Omega$, and set $Z := A+B$. Then
$$\sum_{z} P[Z=z] d[(A | Z = z) ; (B | Z = z)] \leq 3 I[A :B] + 2 H[Z] - H[A] - H[B]. $$
TODO: remove the hypothesis of Fintype G
from here and from condIndep_copies'
Suppose that $(X, Z)$ and $(Y, W)$ are random variables, where $X, Y$ take values in an abelian group. Then $$d[X | Z ; Y | W] \leq d[X ; Y] + \tfrac{1}{2} I[X : Z] + \tfrac{1}{2} I[Y : W]$$
Let $X, Y, Z$ be random variables taking values in some abelian group, and with $Y, Z$ independent. Then we have $$d[X ; Y + Z] -d[X ; Y] \leq \tfrac{1}{2} (H[Y+ Z] - H[Y])$$ $$= \tfrac{1}{2} d[Y ; Z] + \tfrac{1}{4} H[Z] - \tfrac{1}{4} H[Y]$$ and $$d[X ; Y|Y+ Z] - d[X ; Y] \leq \tfrac{1}{2} \bigl(H[Y+ Z] - H[Z]\bigr)$$ $$= \tfrac{1}{2} d[Y ; Z] + \tfrac{1}{4} H[Y] - \tfrac{1}{4} H[Z]$$