Analytic Number Theory Exponent Database

10 Large value additive energy

10.1 Additive energy

Definition 10.1 Additive energy
#

Let \(W\) be a finite set of real numbers. The additive energy \(E_1(W)\) of such a set is defined to be the number of quadruples \((t_1,t_2,t_3,t_4) \in W\) such that

\[ |t_1 + t_2 - t_3 - t_4| \leq 1. \]

We remark that in additive combinatorics, the variant \(E_0(W)\) of the additive energy is often studied, in which \(t_1+t_2-t_3-t_4\) is not merely required to be \(1\)-bounded, but in fact vanishes exactly. However, this version of additive energy is less relevant for analytic number theory applications.

Lemma 10.2 Basic properties of additive energy
  • If \(W\) is a finite set of reals, then

    \[ E_1(W) \asymp \int _\mathbf{R}|\# \{ (t_1,t_2) \in W: |t_1+t_2 - x| \leq 1\} |^2\ dx. \]

    More generally, for any \(r{\gt}0\) we have

    \[ E_1(W) \asymp r^{O(1)} \int _\mathbf{R}|\# \{ (t_1,t_2) \in W: |t_1+t_2 - x| \leq r\} |^2\ dx. \]
  • If \(W\) is a finite set of reals, then

    \[ E_1(W) \asymp \int _{-1}^1 |\sum _{t \in W} e(t\theta )|^4\ d\theta . \]
  • If \(W_1,\dots ,W_k\) are finite sets of reals, then

    \[ E_1(W_1 \cup \dots \cup W_k)^{1/4} \ll E_1(W_1)^{1/4} + \dots + E_1(W_k)^{1/4}. \]
  • If \(W\) is \(1\)-separated and contained in an interval of length \(T \geq 1\), then

    \[ (\# W)^2, (\# W)^4/T \ll E_1(W) \ll (\# W)^3. \]
  • If \(W\) is contained in an interval \(I\), which is in turn split into \(K\) equally sized subintervals \(J_1,\dots ,J_K\), then

    \[ E_1(W)^{1/3} \ll \sum _{k=1}^K E_1(W \cap J_k)^{1/3}. \]

Note that the lower bound of \((\# W)^4 / T\) would be expected to be attained if the set \(W\) is distributed “randomly” and is reasonably large (of size \(\gg \sqrt{T}\)). So getting upper bounds of the additive energy of similar strength to this lower bound can be viewed as a statement of “pseudorandomness” (or “Gowers uniformity”) of this set.

Proof

For (i), we just prove the first estimate, as the second follows from the first by several applications of the triangle inequality. The right-hand side can be expanded as

\[ \sum _{t_1,t_2,t_3,t_4 \in W} |\{ x: |t_1+t_2-x|, |t_3+t_4-x| \leq 1 \} |. \]

Every quadruple contributing to \(E_1(W)\) then contributes \(\gg 1\) to the right-hand side, giving the upper bound. To get the matching lower bound, note that

\[ \sum _{t_1,t_2,t_3,t_4 \in W} |\{ x: |t_1+t_2-x|, |t_3+t_4-x| \leq 1/2 \} \leq E_1(W) \]

and hence

\[ E_1(W) \gg \int _\mathbf{R}|\# \{ (t_1,t_2) \in W: |t_1+t_2 - x| \leq 1/2\} |^2\ dx. \]

The upper bound then follows from the triangle inequality.

For (ii), we can upper bound the indicator function of \([-1,1]\) by the Fourier transform of a non-negative bump function \(\varphi \), so that the right-hand side is bounded by

\[ \sum _{t_1,t_2,t_3,t_4 \in W} \varphi (t_1+t_2-t_3-t_4) \]

which is then bounded by \(O(E_1(W))\) by choosing the support of \(\varphi \) appropriately. The lower bound is established similarly (using the arguments in (i) to adjust the error tolerance \(1\) in the constraint \( |t_1+t_2 - x| \leq 1\) as necessary.)

For (iii), first observe we may remove duplicates and assume that the \(W_i\) are disjoint, then we can use (ii) and the triangle inequality.

For (iv), the first lower bound comes from considering the diagonal case \(t_1=t_3, t_2 = t_4\) and the upper bound comes from observing that once \(t_1,t_2,t_3\) are fixed, there are only \(O(1)\) choices for \(t_4\) thanks to the \(1\)-separated hypothesis. Finally, observe that

\[ \int _\mathbf{R}|\# \{ (t_1,t_2) \in W: |t_1+t_2 - x| \leq 1\} |\ dx = (2 \# W)^2 \]

hence by Cauchy–Schwarz

\[ \int _\mathbf{R}|\# \{ (t_1,t_2) \in W: |t_1+t_2 - x| \leq 1\} |^2\ dx \gg (\# W)^2/T \]

and the claim follows from (i).

For (v), write \(a_k := E_1(W \cap J_k)^{1/4}\). Each tuple \((t_1,t_2,t_3,t_4)\) that contributes to \(E_1(W)\) is associated to a tuple \(J_{k_1}, J_{k_2}, J_{k_3}, J_{k_4}\) of intervals with \(k_1+k_2-k_3-k_4=O(1)\). By modifying the proof of (ii), the total contribution of such a tuple of intervals is

\[ \ll \int _\mathbf{R}\prod _{j=1}^4 |\sum _{t \in W \cap J_{k_j}} e(t\theta )|\ d\theta \]

which by Cauchy–Schwarz is bounded by

\[ \ll a_{k_1} a_{k_2} a_{k_3} a_{k_4}. \]

Thus we see that

\[ E_1(W) \ll \sum _{m=O(1)} a * a * \tilde a * \tilde a(m) \]

where \(\tilde a_k := a_{-k}\) and \(*\) denotes convolution on the integers. By Young’s inequality we then have

\[ E_1(W) \ll \| a\| _{\ell ^{4/3}}^4 \]

and the claim follows.

We remark that (v) can also be proven using [ 42 , Lemma 4.8, (4.2) ] .

We will also study the following related quantity. Given a set \(W\) and a scale \(N{\gt}1\), let \(S(N,W)\) denote the double zeta sum

\begin{equation} \label{snw-def} S(N,W) := \sum _{t,t' \in W} \left|\sum _{n \in [N,2N]} n^{-i(t-t')} \right|^2. \end{equation}
1

We caution that this normalization differs from the one in [ 144 ] , where \(n^{-1/2-i(t-t')}\) is used in place of \(n^{-i(t-t')}\). This sum may also be rearranged as

\begin{equation} \label{snw} S(N,W) = \sum _{n,m \in [N,2N]} |R_W(n/m)|^2 \end{equation}
2

where \(R_W\) is the exponential sum

\[ R_W(x) := \sum _{t \in W} x^{it}. \]

From the first formula it is clear that \(S(N,W)\) is monotone non-decreasing in \(W\), and from the second formula one has the triangle inequality

\begin{equation} \label{S-triangle} S\left(N, \bigcup _{i=1}^k W_i \right)^{1/2} \leq \sum _{i=1}^k S(N,W_i)^{1/2} \end{equation}
3

when the \(W_i\) are disjoint, and hence also when they are not assumed to be disjoint, thanks to the monotonicity.

The following Cauchy–Schwarz inequality is also useful:

Lemma 10.3 Cauchy–Schwarz and double \(\zeta \)-sums

[ 22 , Lemma 3.4 ] If \(W,W'\) are finite sets of reals, \(N{\gt}1\), and \(a_n\) is a \(1\)-bounded sequence for \(n \in [N,2N]\), then

\begin{equation} \label{ttww} \sum _{t \in W, t' \in W'} \left|\sum _{n \in [N,2N]} a_n n^{-it} \right|^2 \leq S(N,W)^{1/2} S(N,W')^{1/2}. \end{equation}
4

In particular

\[ \sum _{t \in W} \left|\sum _{n \in [N,2N]} a_n n^{-it} \right|^2 \leq S(N,W). \]
Proof

The left-hand side of 4 can be rewritten as

\[ \sum _{n,m \in [N,2N]} a_n \overline{a_m} \overline{R_W}(n/m) R_{W'}(n/m). \]

The claim is now immediate from 2 and the Cauchy–Schwarz inequality.

To relate \(S(N,W)\) to \(E_1(W)\), we first observe the following lemma, implicit in [ 104 ] and made more explicit in [ 88 , Lemma 11.4 ] .

Lemma 10.4 Energy controlled by third moment

Suppose that \((N,T,V,(a_n)_{n \in [N,2N]},J,W)\) is a large value pattern with \(T \geq 1\) and \(1 \leq N \ll T^{O(1)}\). Then

\[ V^2 E_1(W) \ll T^{o(1)} \sum _{n,m \in [N,2N]} |R_W(n/m)|^3 + T^{-50}. \]
Proof

By hypothesis, we have

\[ V^2 E_1(W) \leq \sum _{t_1,t_2,t_3,t_4 \in W: |t_1+t_2-t_3-t_4| \leq 1} \left|\sum _{n \in [N,2N]} a_n n^{-it_4} \right|^2. \]

By standard Fourier arguments (see [ 88 , Lemma 11.3 ] ), we can bound

\[ \left|\sum _{n \in [N,2N]} a_n n^{-it_4}\right| \ll T^{o(1)} \int _{t: |t-t_4| \leq T^{o(1)}} \left|\sum _{n \in [N,2N]} a_n n^{-it}\right|\ dt + T^{-100}. \]

Since each \(t_1,t_2,t_3\) generates at most \(O(1)\) choices for \(t_4\), we conclude that

\[ V^2 E_1(W) \ll T^{o(1)} \sum _{t_1,t_2,t_3 \in W} \int _{s: |s| \leq T^{o(1)}} \left|\sum _{n \in [N,2N]} a_n n^{-i(t_1+t_2-t_3+s)} \right|^2\ ds + T^{-50}, \]

The right-hand side can be rewritten as

\[ T^{o(1)} \sum _{n,m \in [N,2N]} a_n \overline{a_m} (n/m)^{-is} \overline{R_W}(n/m)^2 R_W(n/m) + T^{-50}, \]

and the claim then follows from the triangle inequality.

Thus, \(S(N,W)\) involves a second moment of \(R_W\), while the energy \(E_1(W)\) is related to the third moment. Using the trivial bound \(|R_W(x)| \leq |W|\) we can then obtain the trivial bound

\begin{equation} \label{energy-triv} V^2 E_1(W) \ll T^{o(1)} |W| S(N,W) + T^{-50} \end{equation}
5

It is then natural to introduce the fourth moment

\[ S_4(N,W) := \sum _{n,m \in [N,2N]} |R_W(n/m)|^4 \]

since from Hölder’s inequality one now has

\begin{equation} \label{v1w} V^2 E_1(W) \ll T^{o(1)} S(N,W)^{1/2} S_4(N,W)^{1/2} + T^{-50} \end{equation}
6

(cf. [ 104 , Lemma 3 ] ). The quantity \(S_4(N,W)\) can also be expressed as

\[ S_4(N,W) = \sum _{t_1,t_2,t_3,t_4 \in W} \left|\sum _{n \in [N,2N]} n^{-i(t_1+t_2-t_3-t_4)}\right|^2. \]

One can bound this quantity by an \(S(N,W)\) type expression:

Lemma 10.5

If \(W \subset [-T,T]\) is \(1\)-separated and \(1 \leq N \ll T^{O(1)}\), then one has

\[ S_4(N,W) \ll T^{o(1)} u^2 S(N,U) + T^{-100} \]

for some \(1 \leq u \ll |W|\) and \(1\)-separated subset \(U\) of \([-2T,2T]\) with

\begin{equation} \label{v1} u |U| \ll |W|^2 \end{equation}
7

and

\begin{equation} \label{v2} u^2 |U| \ll E_1(W). \end{equation}
8

This result appears implicitly in [ 104 , p. 229 ] , and is made more explicit in the proof of [ 88 , Lemma 11.6 ] .

Proof

One can bound

\[ S_4(N,W) \ll T^{o(1)} \sum _{t_1,t_2,t_3,t_4 \in W} \int _{t = t_1+t_2-t_3-t_4+O(T^{o(1)})} \left|\sum _{n \in [N,2N]} n^{-it}\right|^2\ dt + T^{-100}, \]

and hence

\[ S_4(N,W) \ll T^{o(1)} \sum _{t_1,t_2 \in [-2N,2N] \cap \mathbf{Z}} f(t_1) f(t_2) \int _{t = t_1-t_2+O(T^{o(1)})} \left|\sum _{n \in [N,2N]} n^{-it}\right|^2\ dt + T^{-100} \]

where \(f\) is the counting function

\[ f(t) := |\{ (t_1,t_2) \in W: |t-t_1-t_2| \leq 1 \} |. \]

Note that \(f\) is integer valued and bounded above by \(|W|\). By dyadic decomposition, one can then find \(1 \leq u \ll |W|\) and a subset \(U\) of \([-2N,2N] \cap \mathbf{Z}\) such that \(f(t) \asymp u\) for \(t \in U\), and

\[ S_4(N,W) \ll T^{o(1)} \sum _{t_1,t_2 \in U} u^2 \int _{t = t_1-t_2+O(T^{o(1)})} \left|\sum _{n \in [N,2N]} n^{-it} \right|^2\ dt + T^{-100} \]

which we can rearrange as

\[ S_4(N,W) \ll T^{o(1)} u^2 \int _{s = O(T^{o(1)})} \sum _{n,m \in [N,2M]} (n/m)^{is} |R_U(n/m)|^2\ ds + T^{-100} \]

and hence by the triangle inequality

\[ S_4(N,W) \ll T^{o(1)} v^2 S(N,V) + T^{-100}. \]

Also, by double counting one easily verifies the claims 7, 8. The claim follows.

10.2 Large value additive energy region

Because the cardinality \(|W|\) and additive energy \(E_1(W)\) of a set \(W\) are correlated with each other, as well as with the double zeta sum \(S(N,W)\), we will not be able to consider them separately, and instead we will need to consider the possible joint exponents for these two quantities. We formalize this via the following set:

Definition 10.6 Large value energy region
#

The large value energy region \(\mathcal{E}\subset \mathbf{R}^5\) is defined to be the set of all fixed tuples \((\sigma ,\tau ,\rho ,\rho ^*,s)\) with \(1/2 \leq \sigma \leq 1\), \(\tau , \rho , \rho ' \geq 0\), such that there exists a large value pattern \((N,T,V,(a_n)_{n \in [N,2N]},J,W)\) with \(N{\gt}1\) unbounded, \(V = N^{\sigma +o(1)}\), \(T = N^{\tau +o(1)}\), \(V = N^{\sigma +o(1)}\), \(|W| = N^{\rho +o(1)}\), \(E_1(W) = N^{\rho ^*+o(1)}\) and \(S(N,W) = N^{s+o(1)}\).

We define the zeta large value energy region \(\mathcal{E}_\zeta \subset \mathbf{R}^5\) similarly, but where now \((N,T,V,(a_n)_{n \in [N,2N]},J,W)\) is required to be a zeta large value pattern.

Clearly we have

Lemma 10.7 Trivial containment

We have \(\mathcal{E}_\zeta \subset \mathcal{E}\).

These regions are related to \(\mathrm{LV}(\sigma ,\tau )\) and \(\mathrm{LV}_{\zeta }(\sigma ,\tau )\) as follows:

For any fixed \(1/2 \leq \sigma \leq 1, \tau \geq 0\), we have

\[ \mathrm{LV}(\sigma ,\tau ) = \sup \{ \rho : (\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\} \]

and

\[ \mathrm{LV}_\zeta (\sigma ,\tau ) = \sup \{ \rho : (\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}_\zeta \} \]

In particular, we have \(\rho \leq \mathrm{LV}(\sigma ,\tau )\) for all \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\), and \(\rho \leq \mathrm{LV}_\zeta (\sigma ,\tau )\) for all \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}_\zeta \).

Proof

Clear from definition.

Inspired by this, we can define

Definition 10.9

For any fixed \(1/2 \leq \sigma \leq 1, \tau \geq 0\), we define

\[ \mathrm{LV}^*(\sigma ,\tau ) := \sup \{ \rho ^*: (\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\} \]

and

\[ \mathrm{LV}^*_\zeta (\sigma ,\tau ) := \sup \{ \rho ^*: (\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}_\zeta \} . \]

Thus these exponents are upper bounds for the additive energy of large values of Dirichlet polynomials which may or may not be of zeta function type.

As usual, we have an equivalent non-asymptotic definition of the large value energy region:

Lemma 10.10 Non-asymptotic form of large value energy region

Let \(1/2 \leq \sigma \leq 1\), \(\tau \geq 0\), \(\rho , \rho ^* \geq 0\), and \(s \in \mathbf{R}\) be fixed. Then the following are equivalent:

  • \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\).

  • For every \(\varepsilon {\gt}0\) there exists \(C, \delta {\gt} 0\) such that there is a large value pattern \((N,T,V,(a_n)_{n \in [N,2N]},J,W)\) with \(N \geq C\), \(N^{\tau -\delta } \leq T \leq N^{\tau +\delta }\), \(N^{\sigma -\delta } \leq V \leq N^{\sigma +\delta }\), \(N^{\rho -\varepsilon } \leq |W| \leq N^{\rho +\varepsilon }\), \(N^{\rho ^*-\varepsilon } \leq E_1(W) \leq N^{\rho ^*+\varepsilon }\), and \(N^{s-\varepsilon } \leq S(N, W) \leq N^{s+\varepsilon }\).

Similarly with \(\mathcal{E}\) replaced by \(\mathcal{E}_\zeta \), and with \((N,T,V,(a_n)_{n \in [N,2N]},J,W)\) required to be a zeta large value pattern.

This lemma is proven by a routine expansion of the definitions, and is omitted.

Lemma 10.11 Basic properties
  • (Monotonicity in \(\sigma \)) If \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\), then \((\sigma ',\tau ',\rho ,\rho ^*,s) \in \mathcal{E}\) for all \(1/2 \leq \sigma ' \leq \sigma \) and \(\tau ' \geq \tau \).

  • (Subdivision) If \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\) and \(0 \leq \tau ' \leq \tau \), then amongst all tuples \((\sigma ,\tau ', \rho ', (\rho ')^*,s') \in \mathcal{E}\) with \(\rho ' \leq \rho \), \((\rho ')^* \leq \rho ^*\), and \(s' \leq s\), there exists a tuple with

    \[ \rho \leq \rho ' + \tau - \tau '; \]

    there exists a tuple with

    \[ \rho ^* \leq \rho ' + 3\min (\rho -\rho ',\tau -\tau '); \]

    and there exists a tuple with

    \[ s \leq s' + 2\min (\rho -\rho ',\tau -\tau '). \]

    (But it may not be the same tuple that satisfies all three properties.)

  • (Trivial bounds) If \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\), one has

    \[ 2\rho , 4\rho -\tau \leq \rho ^* \leq 3 \rho . \]
Proof

The claim (i) is trivial, so we turn to (ii). By definition, there exists a large value pattern \((N,T,V,(a_n)_{n \in [N,2N]},J,W)\) with \(N {\gt} 1\) unbounded, \(T = N^{\tau +o(1)}\), \(V = N^{\sigma +o(1)}\), \(|W| = N^{\rho +o(1)}\), \(E_1(W) = N^{\rho ^*+o(1)}\), and \(S(N,W) = N^{s+o(1)}\). We now partition \(J\) into \(N^{\tau -\tau '+o(1)}\) subintervals \(I\) of length \(N^{\tau '+o(1)}\), and subdivide \(W\) into \(W_I\) accordingly. By dyadically pigeonholing, we can then subdivide this collection \(I\) of intervals into \(N^{o(1)}\) subcollections, where on each subcollection there exists fixed \(\rho ', (\rho ')^*, s'\) such that \(|W_I| = N^{\rho '+o(1)}\), \(E_1(W_I) = N^{(\rho ')^* + o(1)}\), and \(S(N,W_I) = N^{s'+o(1)}\). Since \(W_I \subset W\), this forces \(\rho ' \leq \rho \), \((\rho ')^* \leq \rho ^*\), and \(s' \leq s\). From Definition 10.6 we see that \((\sigma ,\tau ', \rho ', (\rho ')^*,s') \in \mathcal{E}\).

By the pigeonhole principle, one of these subcollections must contribute at least \(N^{-o(1)}\) of the cardinality of \(W\). Since there are at most \(N^{\tau -\tau '+o(1)}\) intervals in this collection, we must have \(\rho \leq \rho ' + \tau - \tau '\) in this case.

By Lemma 10.2(iii), we also know that a (possibly different subcollection) must contribute at least \(N^{-o(1)}\) of the additive energy of \(W\). The number of intervals in this subcollection is at most \(\min ( N^{\tau -\tau '+o(1)}, N^{\rho -\rho '+o(1)})\). Applying Lemma 10.2(iii) again, we conclude \(\rho ^* \leq (\rho ^*)' + 3\min (\rho -\rho ',\tau -\tau ')\).

Finally, from 3, we know that a (possibly yet another subcollection) must contribute at least \(N^{-o(1)}\) of the double zeta sum \(S(N,W)\). The number of intervals in this subcollection is at most \(\min ( N^{\tau -\tau '+o(1)}, N^{\rho -\rho '+o(1)})\). Applying Lemma 3(iii) again, we conclude \(s \leq s' + 2\min (\rho -\rho ',\tau -\tau ')\).

Lemma 10.12 Raising to a power

If \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\), and \(k \geq 1\), then amongst all tuples \((\sigma ,\tau /k,\rho ',(\rho ')^*,s') \in \mathcal{E}\) with \(\rho ' \leq \rho /k\), \((\rho ')^* \leq \rho ^*/k\), and \(s' \leq s/k\), there exists a tuple with \(\rho ' = \rho /k\), there exists a tuple with \((\rho ')^* = \rho ^*/k\), and there exists a tuple with \(s' = s/k\). (These may be three different tuples.)

Proof

By definition, there exists a large value pattern \((N,T,V,(a_n)_{n \in [N,2N]},J,W)\) with \(N {\gt} 1\) unbounded, \(T = N^{\tau +o(1)}\), \(V = N^{\sigma +o(1)}\), \(|W| = N^{\rho +o(1)}\), \(E_1(W) = N^{\rho ^*+o(1)}\), and \(S(N,W) = N^{s+o(1)}\). Observe that

\[ \left( \sum _{n \in [N,2N]} a_n n^{-it} \right)^k = \sum _{n \in [N^k, 2^k N^k]} b_n n^{-it} \]

for some coefficients \(b_n = O(n^{o(1)})\). In particular, partitioning \([N^k, 2^k N^k]\) into \(O(1)\) sub-intervals \([N',2N']\) with \(N' = N^{k+o(1)}\), we can partition \(W\) into \(O(1)\) subcollections \(W_{N'}\), such that

\[ \left| \sum _{n \in [N',2N']} b_n n^{-it} \right| \gg V^k = (N')^{\sigma +o(1)} \]

for all \(t \in W_{N'}\). Again by the pigeonhole principle, one of the \(W_{N'}\) must have cardinality \(N^{\rho +o(1)}\), one must have energy \(N^{\rho ^*+o(1)}\), and one must have double zeta sum \(N^{s+o(1)}\) (but these may be different \(W_{N'}\)). Each of these \(W_{N'}\) then give the different conclusions to the lemma.

Morally speaking, one should be able to obtain equality in all three conclusions of Lemma 10.12 simultaneously, i.e. that \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\) essentially implies \((\sigma ,\tau /k,\rho /k,\rho ^*/k,s/k) \in \mathcal{E}\). This is because in practice one frequently controls \(\mathcal{E}\) by computing a containment region \(\mathcal{E}_1\) that possesses precisely the required monotonicity property. Specifically, we have

Lemma 10.13 Monotonicity criterion

Let \(\mathcal{E}_1\) be the intersection of sets \(E_i\), each of the form

\begin{align*} \{ (\sigma , \tau , \rho , \rho ^*, s) \in \mathbf{R}^5: \rho \le f_1(\rho ^*, s), \rho ^* \le f_2(\rho , s), s \le f_3(\rho , \rho ^*)\} \end{align*}

for some monotonically increasing functions \(f_1, f_2, f_3\) (that possibly also depend on \(\sigma \) and \(\tau \)).

Suppose amongst all tuples \((\sigma , \tau , \rho ', (\rho ^*)', s') \in \mathcal{E}_1\) with \(\rho ' \le \rho \), \((\rho ^*)' \le \rho ^*\) and \(s' \le s\), there exists a tuple with \(\rho ' = \rho \), a tuple with \((\rho ^*)' = \rho ^*\) and a tuple with \(s' = s\) (not necessarily the same tuple each time). Then, \((\sigma , \tau , \rho , \rho ^*, s) \in \mathcal{E}_1\).

Proof

Suppose that \((\sigma , \tau , \rho , (\rho ^*)', s') \in \mathcal{E}_1\) for some \((\rho ^*)'\le \rho ^*\) and \(s \le s'\) so that also \((\sigma , \tau , \rho , (\rho ^*)', s') \in E_i\). Then by definition \(\rho \le f_1((\rho ^*)', s')\). Since \(f_1\) is monotonically increasing (with respect to both \(\rho ^*\) and \(s\)), one has \(\rho \le f_1(\rho ^*, s)\). Similarly, \((\sigma , \tau , \rho ', \rho ^*, s') \in \mathcal{E}_1\) implies \(\rho ^* \le f_2(\rho , s)\) and \((\sigma , \tau , \rho ', (\rho ^*)', s) \in \mathcal{E}_1\) implies \(s \le f_3(\rho , \rho ^*)\), which together imply \((\sigma , \tau , \rho , \rho ^*, s) \in E_i\) by definition. Hence \((\sigma , \tau , \rho , \rho ^*, s) \in \mathcal{E}_1\) since \(\mathcal{E}_1\) is the intersection of sets \(E_i\).

Lemma 10.14 Raising to a power, alternative formulation

Let \(k\) be a positive integer, \(\mathcal{E}_1 \subseteq \mathbf{R}^{5}\) be a set satisfying the monotonicity criterion of Lemma 10.13 and

\[ \mathcal{E}_k := \{ (\sigma , \tau , \rho , \rho ^*, s) \in \mathbf{R}^5: (\sigma , \tau /k, \rho /k, \rho ^*/k, s/k) \in \mathcal{E}_1\} . \]

If \(\mathcal{E}\subseteq \mathcal{E}_1\) then \(\mathcal{E} \subseteq \mathcal{E}_k\).

Proof

Suppose that \((\sigma , \tau , \rho , \rho ^*, s) \in \mathcal{E}\subseteq \mathcal{E}_1\). By Lemma 10.12 and Lemma 10.13, \((\sigma , \tau /k, \rho /k, \rho ^*/k, s/k) \in \mathcal{E}_1\), hence by definition \((\sigma , \tau , \rho , \rho ^*, s) \in \mathcal{E}_k\).

10.3 Known relations for the large value energy region

Theorem 10.15 Reflection principle

[ 144 , § 11.5 ] If \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\) with \(\sigma \geq 3/4\) and \(\tau {\gt}1\), then for any integer \(k \geq 1\), either \(\rho \leq 2-2\sigma \), or there exists \(0 {\lt} \alpha \leq k(\tau -1)\) and \((\sigma , \tau /\alpha , \rho /\alpha , \rho ^*/\alpha , s'/\alpha ) \in \mathcal{E}\) such that

\[ \rho \leq \min ( 2-2\sigma , k(3-4\sigma )/2 + s' - 1). \]
Proof

By definition, there exists a large value pattern \((N,T,V,(a_n)_{n \in [N,2N]},J,W)\) with \(N{\gt}1\) unbounded, \(T = N^{\tau +o(1)}\), \(|W| = N^{\rho +o(1)}\), \(E_1(W) = N^{\rho ^*+o(1)}\) and \(S(N,W) = N^{s+o(1)}\). By [ 144 , (11.58) ] , one has

\[ |W|^2 \ll T^{o(1)} \left( |W| N^{2-2\sigma } + N^{1-2\sigma } |W|^2 + N^{(3-4\sigma )/2} \int _{v = O(T^{o(1)})} \sum _{t,t' \in W} \left|\sum _{n \leq 4T/N} n^{-1/2+it-it'+iv}\right|\ dv \right). \]

Since \(\sigma {\gt}1/2\), the \(N^{1-2\sigma } |W|^2\) term can be dropped. Applying Hölder’s inequality and dyadic pigeonholing as in [ 144 , (11.59) ] , we conclude that

\[ |W| \ll T^{o(1)} \left(N^{2-2\sigma } + N^{k(3-4\sigma )/2} \left(\sum _{t,t' \in W} \left|\sum _{n \in [N',2N']} b_n n^{-1/2+it-it'+iv}\right|^2\right)^{1/2}\right) \]

for some \(v = O(T^{o(1)})\) and coefficients \(b_n = O(T^{o(1)})\), and some \(N' \ll (4T/N)^k\). After passing to a subsequence if necessary, we may assume that \(N' = N^{\alpha +o(1)}\) for some \(0 \leq \alpha \leq k(\tau -1)\). If \(\alpha =0\) then the second term here is negligible compared to the first and we obtain \(\rho \leq 2-2\sigma \), so suppose that \(\alpha {\gt} 0\). Using [ 144 , Lemma 11.1 ] to eliminate the \(b_n n^{-1/2+iv}\) coefficients, we conclude that

\[ |W| \ll T^{o(1)} (N^{2-2\sigma } + N^{k(3-4\sigma )/2-1} S(N', W)). \]

By construction, we have \(S(N',W) = (N')^{s'/\alpha +o(1)} = N^{s'+o(1)}\) for some tuple \((\sigma , \tau /\alpha , \rho /\alpha , \rho ^*/\alpha , s'/\alpha ) \in \mathcal{E}\). The claim follows.

Heuristically one expects \(s \leq \max ( \rho +1, 2\rho )+1\) (see [ 144 , (11.63) ] ). There is one easy case in which this is true:

Lemma 10.16

If \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\) with \(\tau {\lt} 1\), then \(s \leq \max (\rho +1, 2\rho )+1\).

Recorded in literature.py as:
add_lver_ivic_1985()

Proof

By definition, there exists a large value pattern \((N,T,V,(a_n)_{n \in [N,2N]},J,W)\) with \(N{\gt}1\) unbounded, \(T = N^{\tau +o(1)}\), \(|W| = N^{\rho +o(1)}\), \(E_1(W) = N^{\rho ^*+o(1)}\) and \(S(N,W) = N^{s+o(1)}\). By the discussion after [ 144 , (11.63) ] , we have

\[ N^{-1} S(N,W) \ll T^\varepsilon ( |W| N + |W|^2 ) \]

for any fixed \(\varepsilon {\gt}0\), which gives the claim. .

Another bound is

Lemma 10.17

[ 144 , Lemma 11.2 ] If \((k,\ell )\) is an exponent pair with \(k{\gt}0\), and \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\), then

\[ s \leq \max \left( \rho +1, 5\rho /3 + \tau /3, \frac{2+3k+4\ell }{1+2k+2\ell } \rho + \frac{k+\ell }{1+2k+2\ell } \tau \right) + 1. \]
Proof

By definition, there exists a large value pattern \((N,T,V,(a_n)_{n \in [N,2N]},J,W)\) with \(N{\gt}1\) unbounded, \(T = N^{\tau +o(1)}\), \(|W| = N^{\rho +o(1)}\), \(E_1(W) = N^{\rho ^*+o(1)}\) and \(S(N,W) = N^{s+o(1)}\). From [ 144 , Lemma 11.2 ] we have

\[ N^{-1} S(N,W) \ll |W| N + |W|^{5/3} T^{1/3+\varepsilon } + |W|^{\frac{2+3k+4\ell }{1+2k+2\ell }} T^{\frac{k+\ell +\varepsilon }{1+2k+2\ell }} \]

for any fixed \(\varepsilon {\gt}0\), which gives the claim.

Finally, we have the useful

Lemma 10.18 Heath-Brown bound on double sums

If \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\), then

\[ s \leq \max ( \rho +1, 2\rho , 5\rho /4+\tau /2) + 1. \]

Note that if \(\tau \leq 3/2\), the \(5\rho /4+\tau /2\) term is bounded by the convex combination \((3/4)(\rho +1)+(1/4)(2\rho )\) and may therefore be omitted.

Recorded in literature.py as:
add_lver_heath_brown_1979()

Proof

By definition, there exists a large value pattern \((N,T,V,(a_n)_{n \in [N,2N]},J,W)\) with \(N{\gt}1\) unbounded, \(T = N^{\tau +o(1)}\), \(|W| = N^{\rho +o(1)}\), \(E_1(W) = N^{\rho ^*+o(1)}\) and \(S(N,W) = N^{s+o(1)}\). From [ 106 , Theorem 1 ] or [ 144 , Lemma 11.5 ] , one has

\[ N^{-1} S(N,W) \ll T^\varepsilon ( |W| N + |W|^2 + |W|^{5/4} T^{1/2} ), \]

giving the claim.

Lemma 10.5 can be formulated in terms of the large value energy region as follows.

Lemma 10.19

If \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\), then there exists \((\sigma ,\tau ,\rho ',(\rho ')^*,s') \in \mathcal{E}\) and \(0 \leq \kappa \leq \rho \) such that

\[ \kappa + \rho ' \leq 2 \rho \]
\[ 2\kappa + \rho ' \leq \rho ^* \]

and

\[ \rho ^* + 2\sigma \leq \kappa + (s+s')/2. \]
Proof

By definition, there exists a large value pattern

\[ (N,T,V,(a_n)_{n \in [N,2N]},J,W) \]

with \(N \geq 1\) unbounded, \(T = N^{\tau +o(1)}\), \(V = N^{\sigma +o(1)}\), \(|W| = N^{\rho +o(1)}\), \(E_1(W) = N^{\rho ^*+o(1)}\), and \(S(N,W) = N^{s+o(1)}\). From 6 we have

\[ V^2 E_1(W) \ll T^{o(1)} S(N,W)^{1/2} S_4(N,W)^{1/2} + T^{-50}. \]

By Lemma 10.5, there exists \(1 \leq u \ll |W|\) and a \(1\)-separated subset \(U\) of \([-2T,2T]\) such that such that

\[ V^2 E_1(W) \ll T^{o(1)} u S(N,W)^{1/2} S(N,U)^{1/2} + T^{-50} \]

with 7, 8 holding. Since \(W\) is non-empty, \(E_1(W) \geq 1\) and \(V \geq N^{1/2} \geq 1\), so the \(T^{-50}\) error here may be discarded. Passing to a subsequence, we may assume that \(u = N^{\kappa +o(1)}\) for some \(0 \leq \kappa \leq \rho \), and that \(|U| = N^{\rho '+o(1)}\) for some \(\rho ' \geq 0\). Then we have \(S_2(N,U) = s'\) for some \((\sigma ,\tau ,\rho ',(\rho ')^*,s') \in \mathcal{E}\), and the claim follows.

These bounds on the double zeta sums can be used to control additive energies:

Theorem 10.20 Heath-Brown relation

[ 107 , (33) ] If \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\), then one has

\[ \rho ^* \leq 1-2\sigma + \frac{1}{2}\max \left(\rho +1, 2\rho , \frac{5}{4}\rho +\frac{\tau }{2}\right) + \frac{1}{2}\max \left(\rho ^*+1, 4\rho , \frac{3}{4}\rho ^*+\rho +\frac{\tau }{2}\right). \]

Recorded in literature.py as:
add_lver_heath_brown_1979b1()

Proof

By Lemma 10.19 we have

\[ \rho ^* + 2\sigma \leq \kappa + ( \max ( \rho +1, 2\rho , 5\rho /4+\tau /2)+ \max ( \rho '+1, 2\rho ', 5\rho '/4+\tau /2) )/2 + 1 \]

for some \(0 \leq \kappa \leq \rho \) with

\[ \kappa + \rho ' \leq 2\rho \]
\[ 2\kappa + \rho ' \leq \rho ^* \]

In particular,

\[ 2\kappa + 5\rho '/4 \leq 3\rho ^*/4 + \rho \]

and the claim follows after moving the \(\kappa \) inside the second maximum and performing some algebra.

Corollary 10.21 Simplified Heath-Brown relation

If \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\) and \(\tau \leq 3/2\), then

\[ \rho ^* \leq \max (3 \rho + 1-2\sigma , \rho +4-4\sigma , 5\rho /2 + (3-4\sigma )/2). \]

Recorded in literature.py as:
add_lver_heath_brown_1979b2()

This result essentially appears as [ 107 , Lemma 3 ] .

Proof

Apply the previous result. For \(\tau \leq 3/2\) we observe that \(5\rho /4+\tau /2\) is less than \(5\rho /4 + 3/4\), which is a convex combination of \(\rho +1\) and \(2\rho \). Similarly \(3\rho ^*/4+\rho +\tau /2\) is less than \(3\rho ^*/4+\rho +3/4\), which is a convex combination of \(\rho ^*+1\) and \(4\rho \). We conclude that

\[ \rho ^* \leq 1-2\sigma + \max (\rho +1, 2\rho )/2 + \max (\rho ^*+1, 4\rho )/2. \]

Thus \(\rho ^*\) is less than one of

\[ 1-2\sigma + (\rho +\rho ^*+2)/2, 1-2\sigma + (5\rho +1)/2, 1-2\sigma + (2\rho +\rho ^*+1)/2, 1-2\sigma + (6\rho )/2; \]

solving for \(\rho ^*\), we conclude

\[ \rho ^* \leq \max ( 4-4\sigma + \rho , (3-4\sigma )/2 + 5\rho /2, 3-4\sigma + 2\rho , 1-2\sigma + 3\rho ). \]

But since \(\sigma \geq 1/2\), \(3-4\sigma + 2\rho \) is less than \(5/2-3\sigma + 2\rho \), which is the mean of \(4-4\sigma +\rho \) and \(1-2\sigma +3\rho \). Thus

\[ \rho ^* \leq \max ( 4-4\sigma + \rho , (3-4\sigma )/2 + 5\rho /2, 1-2\sigma + 3\rho ), \]

which gives the claim.

Similarly, using Lemma 10.17 and Lemma 10.19, one has

Theorem 10.22

If \((k, \ell )\) be an exponent pair with \(k {\gt} 0\) and \((\sigma , \tau , \rho , \rho ^*, s)\in \mathcal{E}\), then

\begin{align*} \rho ^* \le 1 - 2\sigma & + \frac{1}{2}\max \left(\rho +1, \frac{5}{3}\rho + \frac{\tau }{3}, \frac{2+3k+4\ell }{1+2k+2\ell } \rho + \frac{k+\ell }{1+2k+2\ell } \tau \right)\\ & + \frac{1}{2}\max \left(\rho ^*+1, 4\rho , \frac{3}{4}\rho ^*+\rho +\frac{\tau }{2}\right). \end{align*}

Implemented at additive_energy.py as:
ep_to_lver(eph)

Proof

By Lemma 10.19 and Lemma 10.17, there exists some \((\sigma , \tau , \rho ', (\rho ^*)', s') \in \mathcal{E}\) satisfying

\begin{equation} \label{kappa-bounds} 0 \le \kappa \le \rho ,\qquad \kappa + \rho ' \le 2\rho ,\qquad 2\kappa + \rho ' \le \rho ^* \end{equation}
9

and

\begin{align*} \rho ^* + 2\sigma \le \kappa & + \frac{1}{2}\max \left( \rho +1, 5\rho /3 + \tau /3, \frac{2+3k+4\ell }{1+2k+2\ell } \rho + \frac{k+\ell }{1+2k+2\ell } \tau \right)\\ & + \frac{1}{2}\max \left(\rho ’ + 1, 2\rho ’, \frac{5}{4}\rho ’ + \frac{\tau }{2}\right) + 1. \end{align*}

The result follows by moving the \(\kappa \) term into the second maximum and applying 9.

Lemma 10.23 Second Heath-Brown relation

If \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\) then

\[ \rho \leq \max ( 2-2\sigma , \rho ^*/4 + \max (\tau /4 + k(3-4\sigma )/4, k\tau /4 + k(1-2\sigma )/2)) \]

for any positive integer \(k\).

Recorded in literature.py as:
add_lver_heath_brown_1979c(K)

Proof

By definition, there exists a large value pattern \((N,T,V,(a_n)_{n \in [N,2N]},J,W)\) with \(N{\gt}1\) unbounded, \(T = N^{\tau +o(1)}\), \(|W| = N^{\rho +o(1)}\), \(E_1(W) = N^{\rho ^*+o(1)}\) and \(S(N,W) = N^{s+o(1)}\). From [ 107 , Lemma 4 ] , we have

\[ |W| \ll T^\varepsilon \left( N^{2-2\sigma } + E_1(W)^{1/4} (T^{1/4} N^{k(3-4\sigma )/4} + T^{k/4} N^{k(1-2\sigma )/2})\right) \]

for any fixed \(\varepsilon {\gt}0\), giving the claim.

Lemma 10.24 Guth-Maynard relation

If \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\) then

\[ \rho \leq \max (2-2\sigma , 1-2\sigma + \max (S_1, S_2, S_3)/3) \]

where \(S_1, S_2, S_3\) are real numbers with

\[ S_1 \leq -10, \]
\[ S_2 \leq \max (2+2\rho , \tau +1+(2-1/k) \rho , 2 + 2\rho + (\tau /2 - 3\rho /4)/k ) \]

for any positive integer \(k\) and

\[ S_3 \leq 2\tau + \rho /2 + \rho ^*/2 \]

and also

\[ S_3 \leq \max ( 2\tau + 3\rho /2, \tau +1+\rho /2+\rho ^*/2). \]

Recorded in literature.py as:
add_lver_guth_maynard_2024a()

Proof

By definition, there exists a large value pattern \((N,T,V,(a_n)_{n \in [N,2N]},J,W)\) with \(N{\gt}1\) unbounded, \(T = N^{\tau +o(1)}\), \(|W| = N^{\rho +o(1)}\), \(E_1(W) = N^{\rho ^*+o(1)}\) and \(S(N,W) = N^{s+o(1)}\). By [ 88 , Proposition 4.6, (5.5) ] , one may bound

\[ |W| \ll N^{2-2\sigma } + N^{1-2\sigma } (S_1 + S_2 + S_3)^{1/3} \]

for three expressions \(S_1, S_2, S_3\) defined after [ 88 , (5.5) ] . From [ 88 , Proposition 5.1 ] we have

\[ S_1 \ll T^{-10}. \]

From [ 88 , Proposition 6.1 ] we have

\[ S_2 \ll T^{o(1)} ( N^2 |W|^2 + T N |W|^{2-1/k} + N^2 |W|^2 (\frac{T^{1/2}}{|W|^{3/4}})^{1/k} ). \]

From [ 88 , Proposition 8.1 ] we have

\[ S_3 \ll T^{2+o(1)} |W|^{1/2} E_1(W)^{1/2} \]

while from [ 88 , Proposition 10.1 ] we have

\[ S_3 \ll T^{2+o(1)} |W|^{3/2} + T^{1+o(1)} N |W|^{1/2} E_1(W)^{1/2}. \]

Combining all these bounds, we obtain the claim.

Lemma 10.25 Second Guth-Maynard relation

[ 88 , Lemma 1.7 ] If \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\) then

\[ \rho ^* \leq \rho + s - 2\sigma . \]

In particular, from Lemma 10.18 we see for \(\tau \leq 3/2\) that

\[ \rho ^* \leq \max (3\rho +1-2\sigma , 2\rho +2-2\sigma ). \]

Recorded in literature.py as:
add_lver_guth_maynard_2024b()

Proof

By definition, we can find a large value pattern \((N,T,V,(a_n)_{n \in [N,2N]},J,W)\) with \(N{\gt}1\) unbounded, \(T = N^{\tau +o(1)}\), \(V = N^{\sigma +o(1)}\), \(|W| = N^{\rho +o(1)}\), \(E_1(W) = N^{\rho ^*+o(1)}\), and \(S(N,W) = N^{s+o(1)}\). From 5 one has

\[ V^2 E_1(W) \ll T^{o(1)} |W| S(N,W) + T^{-50}. \]

Since \(W\) is non-empty, \(E_1(W) \geq 1\), and \(V \gg 1\), so the \(T^{-50}\) error can be discarded. The claim then follows.

Lemma 10.26 Third Guth-Maynard relation

If \((\sigma ,\tau ,\rho ,\rho ^*, s) \in \mathcal{E}\) and \(1 \leq \tau \leq 4/3\), then

\[ \rho ^* \leq \max (\rho +4-4\sigma , 21\rho /8+\tau /4+1-2\sigma , 3\rho +1-2\sigma ). \]

Recorded in literature.py as:
add_lver_guth_maynard_2024c()

Proof

By definition, there exists a large value pattern \((N,T,V,(a_n)_{n \in [N,2N]},J,W)\) with \(N{\gt}1\) unbounded, \(T = N^{\tau +o(1)}\), \(|W| = N^{\rho +o(1)}\), \(E_1(W) = N^{\rho ^*+o(1)}\) and \(S(N,W) = N^{s+o(1)}\). Applying [ 88 , Proposition 11.1 ] , we conclude that

\[ E_1(W) \ll T^{o(1)} ( |W| N^{4-4\sigma } + |W|^{21/8} T^{1/4} N^{1-2\sigma } + |W|^3 N^{1-2\sigma } ), \]

giving the claim.

We can put this all together to prove the Guth–Maynard large values theorem.

Theorem 10.27 Guth–Maynard large values theorem

[ 88 , Theorem 1.1 ] One has

\[ \mathrm{LV}(\sigma ,\tau ) \leq \max (2-2\sigma , 18/5 - 4 \sigma , \tau + 12/5 - 4\sigma ). \]

Recorded in literature.py as:
add_guth_maynard_large_values_estimate()
Derived in derived.py as:
prove_guth_maynard_large_values_theorem()
prove_guth_maynard_lvt_from_intermediate_lemmas()

Proof

For \(\sigma \leq 7/10\) this follows from Lemma 7.9, and for \(\sigma \geq 8/10\) it follows from Lemma 7.12. Thus we may assume that \(7/10 \leq \sigma \leq 8/10\). By subdivision (Lemma 7.7) it then suffices to treat the case \(\tau = 6/5\), that is to say to show that

\[ \rho \leq 18/5-4\sigma \]

whenever \((\sigma ,\tau ,\rho ,\rho ^*,s) \in \mathcal{E}\) with \(\tau =6/5\) and \(7/10 \leq \sigma \leq 8/10\).

Applying Lemma 10.24 and discarding the very negative \(S_1\) term, we have

\[ \rho \leq \max (2-2\sigma , 1-2\sigma + \max (S_2, S_3)/3) \]

where \(S_2, S_3\) are real numbers with

\[ S_2 \leq \max (2+2\rho , \tau +1+(2-1/k) \rho , 2 + 2\rho + (\tau /2 - 3\rho /4)/k ) \]

for any positive integer \(k\) and

\[ S_3 \leq 2\tau + \rho /2 + \rho ^*/2 \]

and also

\[ S_3 \leq \max ( 2\tau + 3\rho /2, \tau +1+\rho /2+\rho ^*/2). \]

From the latter bound and Lemma 10.26, one has

\[ S_3 \leq \max ( 2\tau +3\rho /2, \tau +\rho +3-2\sigma , \tau +2\rho +3/2-\sigma , 9\tau /8+29\rho /16 + 3/2-\sigma ). \]

Inserting this and the \(S_2\) bound (with \(k=4\)) into the bound for \(\rho \) and simplifying (using \(\tau =6/5\)), we eventually obtain the desired bound \(\rho \leq 18/5-4\sigma \).

We also record a variant of that theorem from the same paper:

Theorem 10.28 Additional Guth–Maynard large values estimate

For any \(1/2 {\lt} \sigma {\lt} 1\), \(\tau \geq 1\), and natural number \(k\) one has

\begin{equation} \label{general-tau} \begin{split} \mathrm{LV}(\sigma ,\tau ) \leq \max \Big(& 2-2\sigma , 5-6\sigma , (4-6\sigma )\frac{k}{k+1} + \frac{k}{k+1} \tau , (5-6\sigma ) \frac{4k}{4k+3} + \frac{2}{4k+3} \tau , \\ & 2-4\sigma + \frac{4}{3} \tau , 3-4\sigma + \frac{\tau }{2}, \frac{9}{2}-7\sigma +\tau , \frac{72-112\sigma }{19} + \frac{18}{19} \tau \Big). \end{split} \end{equation}
10

If one specializes to the case \(\sigma \geq 7/10\) and \(1 \leq \tau \leq 6/5\), one then has

\begin{equation} \label{second-bound} \mathrm{LV}(\sigma ,\tau ) \leq \max \Big(2-2\sigma , 3-4\sigma +\frac{\tau }{2}, (4-6\sigma )\frac{k}{k+1} + \frac{k}{k+1} \tau , (5-6\sigma ) \frac{4k}{4k+3} + \frac{2}{4k+3} \tau \Big). \end{equation}
13

and also

\begin{equation} \label{third-bound} \mathrm{LV}(\sigma ,\tau ) \leq \max \Big(2-2\sigma , 3-4\sigma +\frac{\tau }{2}, \frac{46-60\sigma }{5} + \frac{30\sigma -21}{5} \tau \Big). \end{equation}
14

Recorded in literature.py as:
add_guth_maynard_large_values_estimate2(Constants.LARGE_VALUES_TRUNCATION)
Derived in derived.py as:
prove_guth_maynard_intermediate_lvt()
prove_guth_maynard_intermediate_lvt2()

Proof

(Sketch) The first bound 10 is [ 88 , (12.1) ] , and is proven by the same methods used to prove Theorem 10.27. The second and third bounds 13, 14 are in [ 88 , Proposition 12.1 ] . The second bound 13 follows from the \(k=3\) case of Theorem 7.16 when \(\sigma \geq 39/50\), and from 10 together with Theorems 7.9, 7.12 for \(\sigma {\lt} 39/50\). The third bound 14 follows from Theorems 7.9, 7.12 when \(\tau \leq 4-4\sigma \), and for \(\tau {\gt} 4-4\sigma \) the bound follows from 13 after optimizing in \(k\) (see the proof of [ 88 , Proposition 12.1 ] for details).

Now we turn to another application of double zeta sums to large value theorems.

Theorem 10.29 Bourgain large values theorem

[ 21 ] Let \(1/2 {\lt} \sigma {\lt} 1\) and \(\tau {\gt} 0\), and let \(\rho := \mathrm{LV}(\sigma ,\tau )\). Let \(\alpha _1, \alpha _2 \geq 0\) be real numbers. Then either

\begin{equation} \label{rho1} \rho \leq \max ( \alpha _2 + 2 - 2 \sigma , -\alpha _2 + 2\tau +4-8\sigma , -2\alpha _1 + \tau + 12 - 16 \sigma ) \end{equation}
15

or else there exists \(s \geq 0\) such that

\begin{equation} \label{rs} \begin{split} & \frac{1}{2}\max (\rho +2, 2\rho +1, 5\rho /4 + \tau /2 + 1) + \frac{1}{2}\max (s+2, 2s+1, 5s/4 + \tau /2 + 1) \geq \\ & \qquad \max ( -2\alpha _1 + 2\sigma + s + \rho , -\alpha _1 - \alpha _2/2 + 2\sigma + s/2 + 3\rho /2). \end{split} \end{equation}
16

Proof

By Definition 7.2, we can find a large value pattern \((N,T,V,(a_n)_{n \in [N,2N]},J,R)\) with \(N{\gt}1\) unbounded, \(N \geq 1\), \(T = N^{\tau +o(1)}\), \(|R| = N^{\rho +o(1)}\), \(V = N^{\sigma +o(1)}\); we use \(R\) here instead of \(W\) to be consistent with the notation from [ 21 ] . Now set \(\delta _1 := N^{-\alpha _1}\), \(\delta _2 := N^{-\alpha _2}\). From [ 21 , (4.41), (4.42) ] , one has the inequality

\[ |R| \leq |R^{(1)}| + |R^{(2)}| \]

for certain sets \(R^{(1)}\) and \(R^{(2)}\) with the former set obeying the bound

\[ |R^{(1)}| \ll \delta _2^{-1} N^2 V^{-2} + \delta _2 T^2 N^4 V^{-8} + \delta _1^2 T N^{12} V^{-16}. \]

Hence, we either have

\[ |R| \ll \delta _2^{-1} N^2 V^{-2} + \delta _2 T^2 N^4 V^{-8} + \delta _1^2 T N^{12} V^{-16} \]

which implies 15, or else

\begin{equation} \label{rr} |R| \ll |R^{(2)}|. \end{equation}
19

Henceforth we assume that 19 holds. From [ 21 , (4.53), (4.54) ] we may upper bound

\begin{equation} \label{Exp-1} T^{-\varepsilon } \delta ' (\delta '')^2 V^2 |S| |R^{(2)}| + T^{-\varepsilon } \delta _1 V^2 |S|^{1/2} \sum _\alpha |R_\alpha |^{3/2} \end{equation}
20

by

\begin{equation} \label{Exp-2} \ll T^\varepsilon S(N,R^{(2)})^{1/2} S(N,S)^{1/2} \end{equation}
21

for arbitrarily small fixed \(\varepsilon \), some \(\delta ',\delta ''{\gt}0\) with \(\delta ' {\gt} T^{-\varepsilon } (\delta _1/\delta '')^2\) (see [ 21 , (4.37) ] ), some set \(S\) (which will be non-empty by [ 21 , (4.47) ] ), and some sets \(R_\alpha \) defined in [ 21 , (4.39) ] , where the double zeta sums \(S(N,W)\) are defined in 2. Applying Lemma 10.18, the latter expression is bounded by

\[ \ll T^\varepsilon (|R|N^2 + |R|^2 N + |R|^{5/4} T^{1/2} N)^{1/2} (|S| N^2 + |S|^2 N + |S|^{5/4} T^{1/2} N)^{1/2}. \]

Meanwhile, from [ 21 , (4.57) ] , the expression 20 is bounded from below by

\[ \gg T^{-2\varepsilon } (\delta _1^2 V^2 |S| |R| + \delta _1 \delta _2^{1/2} V^2 |S|^{1/2} |R|^{3/2}). \]

After passing to a subsequence, we can ensure that \(|S| = N^{s+o(1)}\) for some \(s {\gt} 0\). Combining these bounds and writing all expressions as powers of \(N\), we obtain the claim (after sending \(\varepsilon \to 0\)).

Corollary 10.30 Bourgain large values theorem, simplified version

[ 21 , Lemma 4.60 ] Let the notation be as above, but additionally assume \(\rho \leq \min (1, 4-2\tau )\). Then

\[ \rho \leq \max ( \alpha _2 + 2 - 2 \sigma , \alpha _1+\alpha _2/2 + 2-2\sigma , -\alpha _2 + 2\tau +4-8\sigma , -2\alpha _1 + \tau + 12 - 16 \sigma , 4\alpha _1 + 2+\max (1,2\tau -2)-4\sigma ). \]

In [ 21 ] this bound is only established in the case \(\tau \leq 3/2\) (in which case the condition on \(\rho \) simplifies to \(\rho \leq 1\), and the final term \(4\alpha _1 + 2+\max (1,2\tau -2)-4\sigma \) simplifies to \(4\alpha _1 + 3 -4\sigma \)), but the argument extends to the \(\tau {\gt} 3/2\) case without significant difficulty.

Proof

With \(\rho \leq \min (1,4-2\tau )\), \(5\rho /4+\tau /2+1\) and \(2\rho +1\) are both bounded by \(\rho +2\), hence

\[ \max (\rho +2, 2\rho +1, 5\rho /4 + \tau /2 + 1) = \rho +2. \]

Furthermore, \(5s/4+\tau /2+1\) is a convex combination of \(s+2\) and \(2s + 2\tau -2\), hence

\[ \max (s+2, 2s+1, 5s/4 + \tau /2 + 1) \leq \max (s+2, 2s + \max (1,2\tau -2)). \]

Thus 16 simplifies to

\[ (\rho +2)/2 + \max (s+2, 2s+\max (1,2\tau -2))/2 \geq \max ( -2\alpha _1 + 2\sigma + s + \rho , -\alpha _1 - \alpha _2/2 + 2\sigma + s/2 + 3\rho /2). \]

Thus either

\[ (\rho +2)/2 + (s+2)/2 \geq -\alpha _1 - \alpha _2/2 + 2\sigma + s/2 + 3\rho /2 \]

or

\[ (\rho +2)/2 + (2s+\max (1,2\tau -2))/2 \geq -2\alpha _1 + 2\sigma + s + \rho . \]

In both cases we may eliminate \(s\) and solve for \(\rho \) to obtain

\[ \rho \leq \alpha _1 + \alpha _2/2 + 2 - 2 \sigma \]

or

\[ \rho \leq 4\alpha _1 + 2 + \max (1,2\tau -2) - 4 \sigma , \]

giving the claim.

With the aid of computer assistance, one is able to produce an optimized version of the above large values theorem. We have

Corollary 10.31 Bourgain large values theorem, optimized version

For each row \((\rho _0, \alpha _1, \alpha _2, \mathcal{S})\) of Table 10.1, one has

\[ \rho \le \rho _0(\sigma , \tau ),\qquad (\sigma , \tau ) \in \mathcal{S}. \]

Recorded in literature.py as:
add_bourgain_large_values_estimate()

Derived in derived.py as:
prove_bourgain_large_values_theorem()

Proof

Follows from substituting the specified values of \(\alpha _1\) and \(\alpha _2\) and a routine calculation.

Table 10.1 Bounds on \(\mathrm{LV}(\sigma , \tau )\) for \(1/2 \le \sigma \le 1\) and \(\tau \ge 1\)

\(\rho _0(\sigma , \tau )\)

\(\alpha _1\)

\(\alpha _2\)

\(\mathcal{S}\)

\(\frac{16}{3} - \frac{20}{3}\sigma + \frac{1}{3}\tau \)

\(\frac{10}{3} - \frac{14}{3}\sigma + \frac{\tau }{3}\)

\(0\)

\(\begin{aligned} -1 + \tau \ge 0,\\ 10 - 14\sigma + \tau \ge 0,\\ 4 + 4\sigma - 5\tau \ge 0,\\ -11 + 16\sigma - \tau \ge 0. \end{aligned}\)

\(5 - 7\sigma + \frac{3}{4}\tau \)

\(\frac{7}{2} - \frac{9}{2}\sigma + \frac{\tau }{8}\)

\(-1 - \sigma + \frac{5}{4}\tau \)

\(\begin{aligned} 8 - 8\sigma - \tau \ge 0,\\ -16 + 20\sigma + \frac{1}{3}\tau \ge 0,\\ -6 + 10\sigma - \frac{7}{6}\tau \ge 0,\\ -4 - 4\sigma + 5\tau \ge 0. \end{aligned}\)

\(3 - 5\sigma + \tau \)

\(\frac{9}{2} - \frac{11}{2}\sigma \)

\(1 - 3\sigma + \tau \)

\(\begin{aligned} -8+8\sigma +\tau & \ge 0,\\ 2-6\sigma +2\tau & \ge 0,\\ -10+14\sigma -\frac{2}{3}\tau & \ge 0,\\ 6-2\sigma -2\tau & \ge 0. \end{aligned}\)

\(-4\sigma + 2\tau \)

\(0\)

\(1 - 3\sigma + \tau \)

\(\begin{aligned} -6+2\sigma +2\tau & \ge 0,\\ -12 + 12\sigma + \tau & \ge 0,\\ 1-\sigma & \ge 0. \end{aligned}\)

\(8 - 12\sigma + \frac{4}{3}\tau \)

\( 2 - 2\sigma - \frac{1}{6}\tau \)

\(6 - 10\sigma + \frac{4}{3}\tau \)

\(\begin{aligned} 15-21\sigma +\tau \ge 0,\\ 12 - 12\sigma - \tau \ge 0,\\ -\frac{3}{2} + \tau \ge 0,\\ 6-10\sigma +\frac{7}{6}\tau \ge 0.\\ \end{aligned}\)

\(2 - 2\sigma \)

\(0\)

\(0\)

\(\begin{aligned} 1-\sigma \ge 0,\\ -10+14\sigma -\tau \ge 0,\\ -1+\tau \ge 0,\\ -2+6\sigma -2\tau \ge 0. \end{aligned}\)

\(9 - 12\sigma + \frac{2}{3}\tau \)

\(\frac{3}{2} - 2\sigma + \frac{1}{6}\tau \)

\(11 - 16\sigma + \tau \)

\(\begin{aligned} \frac{3}{2} - \tau \ge 0,\\ -\frac{1}{2} + \sigma \ge 0,\\ -1 + \tau \ge 0,\\ 11 - 16\sigma + \tau \ge 0,\\ 16 - 20\sigma - \frac{1}{3}\tau \ge 0. \end{aligned}\)

The preprint of Kerr [ 165 ] contains additional large value theorems:

Lemma 10.32 Kerr large values theorem
  • [ 165 , Theorem 2 ] Let \(3/4 {\lt} \sigma \leq 1\), \(0 \leq \tau \leq 3/2\), and \(0 \leq \rho \leq \mathrm{LV}(\sigma ,\tau ), 1\) be fixed. Then for any fixed \(\alpha \geq 0\), one has

    \[ \rho \leq \max ( 2-2\sigma +\alpha , 2\tau +4-8\sigma -\alpha , \tau /3+16/3 -20\sigma /3 + \alpha /3, 2\tau /3+9-12\sigma ). \]
  • [ 165 , Theorem 3 ] Under the same hypotheses as (i), we have for any fixed integer \(k \geq 2\) obeying \(-\alpha {\lt} 4k\sigma -(\tau +3k-1)\) and \(-\alpha {\lt} 1 + \frac{1}{k-1} - \tau \) that

    \[ \rho \leq \max (2-2\sigma +\alpha , \tau /3+(3k+4)/3-(4k+4)\sigma /3 + \alpha /3). \]
  • [ 165 , Theorem 4 ] If \(25/32 {\lt} \sigma \leq 1\), \(\tau \geq 0\), and \(0 \leq \rho \leq \mathrm{LV}(\sigma ,\tau ), 1, 4-2\tau \) are fixed, then for any fixed \(\alpha \) with

    \[ 26 - 32 \sigma - \tau {\lt} - \alpha {\lt} 16\sigma -11-\tau \]

    one has

    \[ \rho \leq \max ( 2-2\sigma +\alpha , 2\tau +4-8\sigma -\alpha , -\tau +8-8\sigma +2\alpha , 10-12\sigma +2\alpha /3). \]
  • [ 165 , Theorem 5 ] If \(1/2 {\lt} \sigma \leq 1\), \(0 \leq \tau \leq 3/2\), and \(0 \leq \rho \leq \mathrm{LV}(\sigma ,\tau ), 1\) are fixed, then for any fixed \(\alpha \) with

    \[ - \alpha {\lt} -\tau + 8\sigma - 5 \]

    one has

    \[ \rho \leq \max (2-2\sigma +\alpha , 4\tau /3 +23/3 - 12\sigma - 2\alpha /3, 2\tau /3 + 14/3 - 20/3 ). \]