PFR

3 Entropic Ruzsa calculus

In this section G will be a finite additive group. (May eventually want to generalize to infinite G.)

Lemma 3.1 Negation preserves entropy
#

If X is G-valued, then H[X]=H[X].

Proof

If X,Y are G-valued, then H[X±Y|Y]=H[X|Y] and H[X±Y,Y]=H[X,Y].

Proof

If X,Y are G-valued random variables on Ω, we have

max(H[X],H[Y])I[X:Y]H[X±Y].
Proof

If X,Y are G-valued random variables on Ω and Z is another random variable on Ω then

max(H[X|Z],H[Y|Z])I[X:Y|Z]H[X±Y|Z],
Proof
Corollary 3.5 Independent lower bound on sumset

If X,Y are independent G-valued random variables, then

max(H[X],H[Y])H[X±Y].
Proof

One random variable is said to be a copy of another if they have the same distribution.

Lemma 3.6 Copy preserves entropy

If X is a copy of X then H[X]=H[X].

Proof

Let Xi:ΩiSi be random variables for i=1,,k. Then if one gives i=1kSi the product measure of the laws of Xi, the coordinate functions (xj)j=1kxi are jointly independent random variables which are copies of the X1,,Xk.

Proof
Definition 3.8 Ruzsa distance
#

Let X,Y be G-valued random variables (not necessarily on the same sample space). The Ruzsa distance d[X;Y] between X and Y is defined to be

d[X;Y]:=H[XY]H[X]/2H[Y]/2

where X,Y are (the canonical) independent copies of X,Y from Lemma 3.7.

Lemma 3.9 Distance from zero
#

If X is a G-valued random variable and 0 is the random variable taking the value 0 everywhere then

d[X;0]=H(X)/2.
Proof
Lemma 3.10 Copy preserves Ruzsa distance

If X,Y are copies of X,Y respectively then d[X;Y]=d[X;Y].

Proof
Lemma 3.11 Ruzsa distance in independent case

If X,Y are independent G-random variables then

d[X;Y]:=H[XY]H[X]/2H[Y]/2.
Proof
Lemma 3.12 Distance symmetric
#

If X,Y are G-valued random variables, then

d[X;Y]=d[Y;X].
Proof
Lemma 3.13 Distance controls entropy difference
#

If X,Y are G-valued random variables, then

|H[X]H[Y]|2d[X;Y].
Proof
Lemma 3.14 Distance controls entropy growth

If X,Y are independent G-valued random variables, then

H[XY]H[X],H[XY]H[Y]2d[X;Y].
Proof
Lemma 3.15 Distance nonnegative
#

If X,Y are G-valued random variables, then

d[X;Y]0.
Proof
Lemma 3.16 Projection entropy and distance
#

If G is an additive group and X is a G-valued random variable and HG is a finite subgroup then, with π:GG/H the natural homomorphism we have (where UH is uniform on H)

H(π(X))2d[X;UH].
Proof
Lemma 3.17 Improved Ruzsa triangle inequality
#

If X,Y,Z are G-valued random variables on Ω with (X,Y) independent of Z, then

H[XY]H[XZ]+H[ZY]H[Z]
1

This is an improvement over the usual Ruzsa triangle inequality because X,Y are not assumed to be independent. However we will not utilize this improvement here.

Proof
Lemma 3.18 Ruzsa triangle inequality
#

If X,Y,Z are G-valued random variables, then

d[X;Y]d[X;Z]+d[Z;Y].
Proof
Definition 3.19 Conditioned Ruzsa distance
#

If (X,Z) and (Y,W) are random variables (where X and Y are G-valued) we define

d[X|Z;Y|W]:=z,wP[Z=z]P[W=w]d[(X|Z=z);(Y|(W=w))].

similarly

d[X;Y|W]:=wP[W=w]d[X;(Y|(W=w))].
Lemma 3.20 Alternate form of distance

The expression d[X|Z;Y|W] is unchanged if (X,Z) or (Y,W) is replaced by a copy. Furthermore, if (X,Z) and (Y,W) are independent, then

d[X|Z;Y|W]=H[XY|Z,W]H[X|Z]/2H[Y|W]/2

and similarly

d[X;Y|W]=H[XY|W]H[X]/2H[Y|W]/2.
Proof
Lemma 3.21 Kaimanovich-Vershik-Madiman inequality
#

Suppose that X,Y,Z are independent G-valued random variables. Then

H[X+Y+Z]H[X+Y]H[Y+Z]H[Y].
Proof
Lemma 3.22 Existence of conditional independent trials
#

For X,Y random variables, there exist random variables X1,X2,Y on a common probability space with (X1,Y),(X2,Y) both having the distribution of (X,Y), and X1,X2 conditionally independent over Y in the sense of Definition 2.28.

Proof
Lemma 3.23 Balog-Szemerédi-Gowers
#

Let A,B be G-valued random variables on Ω, and set Z:=A+B. Then

zP[Z=z]d[(A|Z=z);(B|Z=z)]3I[A:B]+2H[Z]H[A]H[B].
2

Proof
Lemma 3.24 Upper bound on conditioned Ruzsa distance

Suppose that (X,Z) and (Y,W) are random variables, where X,Y take values in an abelian group. Then

d[X|Z;Y|W]d[X;Y]+12I[X:Z]+12I[Y:W].

In particular,

d[X;Y|W]d[X;Y]+12I[Y:W].
Proof
Lemma 3.25 Comparison of Ruzsa distances, I

Let X,Y,Z be random variables taking values in some abelian group of characteristic 2, and with Y,Z independent. Then we have

d[X;Y+Z]d[X;Y]12(H[Y+Z]H[Y])=12d[Y;Z]+14H[Z]14H[Y].

and

d[X;Y|Y+Z]d[X;Y]12(H[Y+Z]H[Z])=12d[Y;Z]+14H[Y]14H[Z].
Proof
Lemma 3.26 Comparison of Ruzsa distances, II
#

Let X,Y,Z,Z be random variables taking values in some abelian group, and with Y,Z,Z independent. Then we have

d[X;Y+Z|Y+Z+Z]d[X;Y]12(H[Y+Z+Z]+H[Y+Z]H[Y]H[Z]).
Proof