3 Entropic Ruzsa calculus
In this section will be a finite additive group. (May eventually want to generalize to infinite .)
Lemma
3.1
Negation preserves entropy
Lemma
3.2
Shearing preserves entropy
If are -valued, then and .
Lemma
3.3
Lower bound of sumset
If are -valued random variables on , we have
Corollary
3.4
Conditional lower bound on sumset
If are -valued random variables on and is another random variable on then
Proof
▶
This follows from Lemma 3.3 by conditioning to and summing over (weighted by ).
Corollary
3.5
Independent lower bound on sumset
If are independent -valued random variables, then
One random variable is said to be a copy of another if they have the same distribution.
Lemma
3.6
Copy preserves entropy
Lemma
3.7
Existence of independent copies
Let be random variables for . Then if one gives the product measure of the laws of , the coordinate functions are jointly independent random variables which are copies of the .
Definition
3.8
Ruzsa distance
Let be -valued random variables (not necessarily on the same sample space). The Ruzsa distance between and is defined to be
where are (the canonical) independent copies of from Lemma 3.7.
Lemma
3.9
Distance from zero
If is a -valued random variable and is the random variable taking the value everywhere then
Proof
▶
This is an immediate consequence of the definitions and and .
Lemma
3.10
Copy preserves Ruzsa distance
If are copies of respectively then .
Lemma
3.11
Ruzsa distance in independent case
If are independent -random variables then
Lemma
3.12
Distance symmetric
If are -valued random variables, then
Lemma
3.13
Distance controls entropy difference
If are -valued random variables, then
Lemma
3.14
Distance controls entropy growth
If are independent -valued random variables, then
Lemma
3.15
Distance nonnegative
If are -valued random variables, then
Lemma
3.16
Projection entropy and distance
If is an additive group and is a -valued random variable and is a finite subgroup then, with the natural homomorphism we have (where is uniform on )
Lemma
3.17
Improved Ruzsa triangle inequality
If are -valued random variables on with independent of , then
This is an improvement over the usual Ruzsa triangle inequality because are not assumed to be independent. However we will not utilize this improvement here.
Lemma
3.18
Ruzsa triangle inequality
If are -valued random variables, then
Proof
▶
By Lemma 3.10 and Lemmas 3.7, 3.11, it suffices to prove this inequality assuming that are defined on the same space and are independent. But then the claim follows from Lemma 3.17 and Definition 3.8.
Definition
3.19
Conditioned Ruzsa distance
If and are random variables (where and are -valued) we define
similarly
Lemma
3.20
Alternate form of distance
The expression is unchanged if or is replaced by a copy. Furthermore, if and are independent, then
and similarly
Lemma
3.21
Kaimanovich-Vershik-Madiman inequality
Suppose that are independent -valued random variables. Then
Proof
▶
From Corollary 2.20 we have
However, using Lemmas 2.24, 2.2 repeatedly we have , and . The claim then follows from a calculation.
Lemma
3.22
Existence of conditional independent trials
For random variables, there exist random variables on a common probability space with both having the distribution of , and conditionally independent over in the sense of Definition 2.28.
Lemma
3.23
Balog-Szemerédi-Gowers
Let be -valued random variables on , and set . Then
Proof
▶
Let and (and , which by abuse of notation we call ) be conditionally independent trials of relative to as produced by Lemma 3.22, thus and are coupled through the random variable , which by abuse of notation we shall also call .
Observe from Lemma 3.11 that the left-hand side of 2 is
since, crucially, and are independent for all .
Applying submodularity (Corollary 2.21) gives
We estimate the second, third and fourth terms appearing here. First note that, by Corollary 2.30 and Lemma 2.2 (noting that the tuple determines the tuple since )
Next observe that
Finally, we have
Substituting 7, 8 and 9 into 4 yields
and so by Corollary 2.19
Since
and similarly for , we see that 3 is bounded by as claimed.
Lemma
3.24
Upper bound on conditioned Ruzsa distance
Suppose that and are random variables, where take values in an abelian group. Then
In particular,
Lemma
3.25
Comparison of Ruzsa distances, I
Let be random variables taking values in some abelian group of characteristic , and with independent. Then we have
and
Lemma
3.26
Comparison of Ruzsa distances, II
Let be random variables taking values in some abelian group, and with independent. Then we have
Proof
▶
By Lemma 3.25 (with a change of variables) we have
Adding this to 10 gives the result. □