6 Entropy version of PFR
Throughout this chapter, , and are -valued random variables.
Definition
6.2
functional
If are two -valued random variables, then
Lemma
6.3
depends only on distribution
If are copies of , then .
Definition
6.4
-minimizer
A pair of -valued random variables are said to be a -minimizer if one has
for all -valued random variables .
Proposition
6.5
has minimum
A pair of -minimizers exist.
Proof
▶
By Lemma 6.3, only depends on the probability distributions of . This ranges over a compact space, and is continuous. So has a minimum.
6.1 Basic facts about minimizers
In this section we assume that are -minimizers. We also write .
Lemma
6.6
Distance lower bound
For any -valued random variables , one has
Lemma
6.7
Conditional distance lower bound
For any -valued random variables and random variables , one has
Proof
▶
Apply Lemma 6.6 to conditioned random variables and then average.
6.2 First estimate
We continue the assumptions from the preceding section.
Let be independent random variables, with copies of and copies of . (This is possible thanks to Lemma 3.7.)
We also define the quantity
Lemma
6.8
Fibring identity for first estimate
Lemma
6.9
Lower bound on distances
Lemma
6.10
Lower bound on conditional distances
Lemma
6.11
Upper bound on distance differences
Proof
▶
Immediate from Lemma 3.25 (and recalling that is defined to be ).
Lemma
6.12
First estimate
One can also extract the following useful inequality from the proof of the above lemma.
Lemma
6.13
Entropy bound on quadruple sum
With the same notation, we have
6.3 Second estimate
We continue the assumptions from the preceding section. We introduce the quantity
Lemma
6.14
Distance between sums
Lemma
6.16
Second estimate
6.4 Endgame
Let be as before, and introduce the random variables
and
Lemma
6.17
Symmetry identity
Lemma
6.18
Bound on conditional mutual informations
Proof
▶
From the definitions of and Lemma 6.17, we see that
Applying Lemma 6.12 and Lemma 6.16 we have the inequalities
We conclude that
and the claim follows from some calculation.
Lemma
6.19
Bound on distance increments
Proof
▶
By Lemma 3.26 (taking , , and , so that and ) we have, noting that ,
Further applications of Lemma 3.26 give
and
where . To treat , first note that this equals , since for a fixed choice of we have (here we need some helper lemma about Ruzsa distance). Now we may apply Lemma 3.26 to obtain
Summing these six estimates and using Lemma 6.13, we conclude that
as required.
Proof
▶
Obvious because we are in characteristic two.
For the next two lemmas, let be a -valued random variable such that holds identically. Set
Lemma
6.21
Constructing good variables, I
(Note: in the paper, this lemma was phrased in a more intuitive formulation that is basically the contrapositive of the one here. Similarly for the next two lemmas.)
Proof
▶
We apply Lemma 3.23 with there. Since , the conclusion is that
The right-hand side in 4 can be rearranged as
using the fact (from Lemma 2.2) that all three terms are equal to and hence to each other. We also have
by Lemma 3.24, and similarly
Putting the above observations together, we have
where we introduce the notation
On the other hand, from Lemma 6.6 we have , and the claim follows.
Lemma
6.22
Constructing good variables, II
Proof
▶
Average Lemma 6.21 over all six permutations of .
Theorem
6.23
-decrement
Let be tau-minimizers. Then .
Proof
▶
Set . Applying Lemma 6.22 with any random variables such that holds identically, we deduce that
Note that is still defined by 3 and thus depends on . In particular we may apply this for
for in the range of (which is a valid choice by Lemma 6.20) and then average over with weights , to obtain
where
Putting this together with Lemma 6.18 and Lemma 6.19, we conclude that
since the quantity is non-negative (by Lemma 6.12), and its coefficient in the above expression is non-positive provided that , which is certainly the case with Definition 6.1. Moreover, from Definition 6.1 we have . It follows that , as desired.
6.5 Conclusion
Theorem
6.24
Entropy version of PFR
Let , and suppose that are -valued random variables. Then there is some subgroup such that
where is uniformly distributed on . Furthermore, both and are at most .
Note: a “stretch goal” for this project would be to obtain a ‘decidable‘ analogue of this result (see the remark at the end of Section 2 for some related discussion).