A Hilbert space lemma: Difference between revisions
New page: This is a finitised version of the lemma which is key to the ergodic proof of DHJ(2). :'''Lemma''' Let H be a Hilbert space with nested subspaces :<math>V_0 \subset V_1 \subset \ldots \su... |
No edit summary |
||
Line 1: | Line 1: | ||
== Motivation == | |||
The Khintchine recurrence theorem has a Hilbert space formulation: | |||
:'''Theorem''' If <math>U: H \to H</math> is a unitary operator, and 1, f are elements of H of norm at most 1 with <math>U^* 1 = 1</math> and <math>\langle f, 1 \rangle = \delta</math>, then there exist arbitrarily large n with <math>\langle f, U^n f \rangle \geq \delta^2 - o(1)</math>. | |||
It implies in particular that if A is a subset of [N] of density at least <math>\delta</math>, then there are many n for which <math>A \cap (A-n)</math> has density at least <math>\delta^2 - o(1)</math>. It can be proven by a variety of means, for instance it follows from the von Neumann ergodic theorem, that asserts that <math>\frac{1}{N} \sum_{n=1}^N U^n f</math> converges strongly to Pf for some orthogonal projection P, with P1 = 1. | |||
The argument relies on the ability to iterate a single operator U many times. In the situation relevant to DHJ, U corresponds to an operation such as flipping a 0 to a 1, and we cannot iterate any single operator more than once, let alone n times. Instead, what one has is an "IP system" of operators <math>U_1, U_2, \ldots</math>, each of which can be used exactly once (think of <math>U_i</math> as the operation of flipping the "i^th digit" from 0 to 1). We are then interested in lower bounds on expressions such as <math>\langle f, U_{i_n} \ldots U_{i_1} f \rangle</math> for some <math>0 < i_1 < \ldots < i_n</math>. | |||
== Preliminary lemma == | |||
A preliminary lemma in this direction is as follows. | |||
:'''Lemma''' Let H be a Hilbert space with nested subspaces | :'''Lemma''' Let H be a Hilbert space with nested subspaces | ||
Line 8: | Line 20: | ||
* (Boundedness) <math>1, f</math> have norm at most 1. | * (Boundedness) <math>1, f</math> have norm at most 1. | ||
* (Positive density) <math> \langle 1, f \rangle \geq \delta > 0</math>. | * (Positive density) <math> \langle 1, f \rangle \geq \delta > 0</math>. | ||
* ( | * (Measure preservation) <math>U_i^* 1 = 1</math> for all i. | ||
* (Substationarity) Each <math>U_i</math> has operator norm at most <math>1+O(\varepsilon)</math>. | * (Substationarity) Each <math>U_i</math> has operator norm at most <math>1+O(\varepsilon)</math>. | ||
* (Measurability) For each i, <math>U_i</math> maps <math>V_{i-1}</math> to <math>V_i</math>. | * (Measurability) For each i, <math>U_i</math> maps <math>V_{i-1}</math> to <math>V_i</math>. | ||
Line 15: | Line 27: | ||
: Then there exists <math>1 \leq i \leq m</math> such that <math> \langle f, U_i f \rangle > \delta^2 - o(1)</math>, where o(1) goes to zero as <math>\varepsilon \to 0</math>. | : Then there exists <math>1 \leq i \leq m</math> such that <math> \langle f, U_i f \rangle > \delta^2 - o(1)</math>, where o(1) goes to zero as <math>\varepsilon \to 0</math>. | ||
== | == Sketch of infinitary proof == | ||
By a compactness argument it suffices to treat the case <math>\varepsilon=0, m = \infty</math>. We may then restrict H to the closure of <math>\bigcup_i V_i</math>. The stability hypothesis is then asserting that the <math>U_i</math> converge in the weak operator topology to some limit P, and that <math>U_k U_j</math> also converge in this topology. By computing the limit <math>\lim_{k \to \infty} \lim_{j \to \infty} \langle U_k U_j v, w \rangle$ in two different ways, we conclude that <math>P^2 = P</math>, i.e. P is idempotent. On the other hand, from substationarity we know that P has operator norm at most 1. The two facts are only compatible when P is an orthonormal projection. It now suffices to show that <math> \langle f, Pf \rangle \geq \delta^2,</math>. But the left-hand side is <math>\|Pf\|^2</math>; meanwhile, positive density and measure preservation give us <math>P^* 1 = 1</math> and thus <math>\langle Pf, 1 \rangle \geq \delta</math>, and the claim follows from Cauchy-Schwarz. | |||
== Finitary proof == | |||
By the stability hypothesis, we have an operator <math>P_i = \Pi_{V_i} P_i \Pi_{V_i}</math> such that | By the stability hypothesis, we have an operator <math>P_i = \Pi_{V_i} P_i \Pi_{V_i}</math> such that | ||
Line 56: | Line 73: | ||
which by (3) is equal to | which by (3) is equal to | ||
:<math> \| P_{i_0} f \|^2 + o(1).</math> | :<math> \| P_{i_0} f \|^2 + o(1).</math> | ||
On the other hand, from positive density and | On the other hand, from positive density and measure preservation we have | ||
:<math> \langle U_{i_0+1} f, 1 \rangle \geq \delta</math> | :<math> \langle U_{i_0+1} f, 1 \rangle \geq \delta</math> | ||
and thus by (1) | and thus by (1) |
Revision as of 16:18, 23 February 2009
Motivation
The Khintchine recurrence theorem has a Hilbert space formulation:
- Theorem If [math]\displaystyle{ U: H \to H }[/math] is a unitary operator, and 1, f are elements of H of norm at most 1 with [math]\displaystyle{ U^* 1 = 1 }[/math] and [math]\displaystyle{ \langle f, 1 \rangle = \delta }[/math], then there exist arbitrarily large n with [math]\displaystyle{ \langle f, U^n f \rangle \geq \delta^2 - o(1) }[/math].
It implies in particular that if A is a subset of [N] of density at least [math]\displaystyle{ \delta }[/math], then there are many n for which [math]\displaystyle{ A \cap (A-n) }[/math] has density at least [math]\displaystyle{ \delta^2 - o(1) }[/math]. It can be proven by a variety of means, for instance it follows from the von Neumann ergodic theorem, that asserts that [math]\displaystyle{ \frac{1}{N} \sum_{n=1}^N U^n f }[/math] converges strongly to Pf for some orthogonal projection P, with P1 = 1.
The argument relies on the ability to iterate a single operator U many times. In the situation relevant to DHJ, U corresponds to an operation such as flipping a 0 to a 1, and we cannot iterate any single operator more than once, let alone n times. Instead, what one has is an "IP system" of operators [math]\displaystyle{ U_1, U_2, \ldots }[/math], each of which can be used exactly once (think of [math]\displaystyle{ U_i }[/math] as the operation of flipping the "i^th digit" from 0 to 1). We are then interested in lower bounds on expressions such as [math]\displaystyle{ \langle f, U_{i_n} \ldots U_{i_1} f \rangle }[/math] for some [math]\displaystyle{ 0 \lt i_1 \lt \ldots \lt i_n }[/math].
Preliminary lemma
A preliminary lemma in this direction is as follows.
- Lemma Let H be a Hilbert space with nested subspaces
- [math]\displaystyle{ V_0 \subset V_1 \subset \ldots \subset V_m \subset H }[/math].
- Suppose [math]\displaystyle{ V_0 }[/math] contains two elements 1, f.
- For each [math]\displaystyle{ 1 \leq i \leq m }[/math], let [math]\displaystyle{ U_i: H \to H }[/math] be a bounded linear operator. We assume the following axioms:
- (Boundedness) [math]\displaystyle{ 1, f }[/math] have norm at most 1.
- (Positive density) [math]\displaystyle{ \langle 1, f \rangle \geq \delta \gt 0 }[/math].
- (Measure preservation) [math]\displaystyle{ U_i^* 1 = 1 }[/math] for all i.
- (Substationarity) Each [math]\displaystyle{ U_i }[/math] has operator norm at most [math]\displaystyle{ 1+O(\varepsilon) }[/math].
- (Measurability) For each i, [math]\displaystyle{ U_i }[/math] maps [math]\displaystyle{ V_{i-1} }[/math] to [math]\displaystyle{ V_i }[/math].
- (Stability) For each [math]\displaystyle{ 0 \leq i \lt m-1 }[/math], the operators [math]\displaystyle{ \Pi_{V_i} U_j \Pi_{V_i} }[/math] for [math]\displaystyle{ i \lt j \leq m }[/math] and [math]\displaystyle{ \Pi_{V_i} U_k U_j \Pi_{V_i} }[/math] for [math]\displaystyle{ i \lt j \lt k \leq m }[/math] differ from each other by at most [math]\displaystyle{ O(\varepsilon) }[/math] in the operator norm, where [math]\displaystyle{ \Pi_{V_i} }[/math] is the orthogonal projection onto [math]\displaystyle{ V_i }[/math].
- (Parameter sizes) [math]\displaystyle{ \varepsilon }[/math] is sufficiently small depending on [math]\displaystyle{ \delta }[/math], and m is sufficiently large depending on [math]\displaystyle{ \delta, \varepsilon }[/math].
- Then there exists [math]\displaystyle{ 1 \leq i \leq m }[/math] such that [math]\displaystyle{ \langle f, U_i f \rangle \gt \delta^2 - o(1) }[/math], where o(1) goes to zero as [math]\displaystyle{ \varepsilon \to 0 }[/math].
Sketch of infinitary proof
By a compactness argument it suffices to treat the case [math]\displaystyle{ \varepsilon=0, m = \infty }[/math]. We may then restrict H to the closure of [math]\displaystyle{ \bigcup_i V_i }[/math]. The stability hypothesis is then asserting that the [math]\displaystyle{ U_i }[/math] converge in the weak operator topology to some limit P, and that [math]\displaystyle{ U_k U_j }[/math] also converge in this topology. By computing the limit [math]\displaystyle{ \lim_{k \to \infty} \lim_{j \to \infty} \langle U_k U_j v, w \rangle$ in two different ways, we conclude that \lt math\gt P^2 = P }[/math], i.e. P is idempotent. On the other hand, from substationarity we know that P has operator norm at most 1. The two facts are only compatible when P is an orthonormal projection. It now suffices to show that [math]\displaystyle{ \langle f, Pf \rangle \geq \delta^2, }[/math]. But the left-hand side is [math]\displaystyle{ \|Pf\|^2 }[/math]; meanwhile, positive density and measure preservation give us [math]\displaystyle{ P^* 1 = 1 }[/math] and thus [math]\displaystyle{ \langle Pf, 1 \rangle \geq \delta }[/math], and the claim follows from Cauchy-Schwarz.
Finitary proof
By the stability hypothesis, we have an operator [math]\displaystyle{ P_i = \Pi_{V_i} P_i \Pi_{V_i} }[/math] such that
- [math]\displaystyle{ \Pi_{V_i} U_j \Pi_{V_i} = P_i + o(1) }[/math] (1)
for all [math]\displaystyle{ i \lt j \leq m }[/math] and
- [math]\displaystyle{ \Pi_{V_i} U_k U_j \Pi_{V_i} = P_i + o(1) }[/math] (2)
for all [math]\displaystyle{ i \lt j \lt k \leq m }[/math], where o(1) denotes a quantity that goes to zero as [math]\displaystyle{ \varepsilon \to 0 }[/math] (in the operator norm for operators, or the Hilbert space norm for vectors). By the substationarity hypothesis, [math]\displaystyle{ P_i }[/math] has operator norm [math]\displaystyle{ 1+o(1) }[/math].
From (1) we see that [math]\displaystyle{ \| P_i f \|^2 }[/math] is monotone increasing in i up to errors of [math]\displaystyle{ o(1) }[/math]. By the pigeonhole principle, one can thus find [math]\displaystyle{ i_0 }[/math] such that
- [math]\displaystyle{ \| P_{i_0 + \lfloor 1/\varepsilon \rfloor} f\|^2 = \| P_{i_0} f \|^2 + o(1) }[/math]
which then implies that
- [math]\displaystyle{ P_i f = P_j f + o(1) }[/math] (3)
for [math]\displaystyle{ i_0 \leq i, j \leq i_0 + 1/\varepsilon }[/math]. A second application of the pigeonhole principle then locates [math]\displaystyle{ i_0 \leq i_1 \leq i_0 + 1/\varepsilon - 3 }[/math] such that
- [math]\displaystyle{ \| P_{i_1+3}^* P_{i_0} f\|^2 = \| P_{i_1}^* P_{i_0} f \|^2 + o(1) }[/math]
and thus
- [math]\displaystyle{ P_i^* P_{i_0} f = P_j^* P_{i_0} f + o(1) }[/math] (4)
for [math]\displaystyle{ i_1 \leq i,j \leq i_1+3 }[/math].
Now we compute the expression
- [math]\displaystyle{ \| P_{i_0} f \|^2 = \langle P_{i_0} f, P_{i_0} f \rangle. }[/math]
From (2), this is equal to
- [math]\displaystyle{ \langle U_{i_1+2} U_{i_1+1} f, P_{i_0} f \rangle + o(1). }[/math]
Applying (1), this is equal to
- [math]\displaystyle{ \langle P_{i_1+1} U_{i_1+1} f, P_{i_0} f \rangle + o(1). }[/math]
Moving the [math]\displaystyle{ P_{i_1+1} }[/math] temporarily to the other side of the inner product, applying (4), and then moving it back, this is equal to
- [math]\displaystyle{ \langle P_{i_1} U_{i_1+1} f, P_{i_0} f \rangle + o(1). }[/math]
which by (1) is equal to
- [math]\displaystyle{ \langle P_{i_1} P_{i_1} f, P_{i_0} f \rangle + o(1). }[/math]
Aplying (3), this is equal to
- [math]\displaystyle{ \langle P_{i_1} P_{i_0} f, P_{i_0} f \rangle + o(1). }[/math]
which is in turn equal to
- [math]\displaystyle{ \langle P_{i_0} f, P_{i_1}^* P_{i_0} f \rangle + o(1). }[/math]
On the other hand, by Cauchy-Schwarz and the boundedness of [math]\displaystyle{ P_{i_1} }[/math], this is bounded by [math]\displaystyle{ \|P_{i_0} f \|^2 + o(1) }[/math]. In order for Cauchy-Schwarz to be sharp, we are thus forced to conclude that
- [math]\displaystyle{ P_{i_1}^* P_{i_0} f = P_{i_0} f + o(1) }[/math]. (5)
Now, we look at the expression
- [math]\displaystyle{ \langle f, P_{i_0} f \rangle. }[/math]
By (5), this is
- [math]\displaystyle{ \langle P_{i_1} f, P_{i_0} f \rangle + o(1) }[/math]
which by (3) is equal to
- [math]\displaystyle{ \| P_{i_0} f \|^2 + o(1). }[/math]
On the other hand, from positive density and measure preservation we have
- [math]\displaystyle{ \langle U_{i_0+1} f, 1 \rangle \geq \delta }[/math]
and thus by (1)
- [math]\displaystyle{ \langle P_{i_0} f, 1 \rangle \geq \delta. }[/math]
By Cauchy-Schwarz we thus have
- [math]\displaystyle{ \| P_{i_0} f \|^2 \geq \delta^2 }[/math]
and the claim follows. [math]\displaystyle{ \Box }[/math]